Skip to main content

ORIGINAL RESEARCH article

Front. Plant Sci., 23 January 2025
Sec. Plant Breeding
This article is part of the Research Topic Genomics-Driven Advances in Crop Productivity and Stress Resilience View all 6 articles

Combining genotyping approaches improves resolution for association mapping: a case study in tropical maize under water stress conditions

  • 1Postgraduate Program of Plant Science, Federal University of Ceará, Fortaleza, Ceará, Brazil
  • 2College of Agriculture and Applied Sciences, Utah State University, Logan, UT, United States
  • 3Louisiana State University Agricultural Center, Baton Rouge, LA, United States

Genome-wide Association Studies (GWAS) identify genome variations related to specific phenotypes using Single Nucleotide Polymorphism (SNP) markers. Genotyping platforms like SNP-Array or sequencing-based techniques (GBS) can genotype samples with many SNPs. These approaches may bias tropical maize analyses due to reliance on the temperate line B73 as the reference genome. An alternative is a simulated genome called “Mock,” adapted to the population using bioinformatics. Recent studies show SNP-Array, GBS, and Mock yield similar results for population structure, heterotic groups definition, tester selection, and genomic hybrid prediction. However, no studies have examined the results generated by these different genotyping approaches for GWAS. This study aims to test the equivalence among the three genotyping scenarios in identifying significant effect genes in GWAS. To achieve this, maize was used as the model species, where SNP-Array genotyped 360 inbred lines from a public panel via the Affymetrix platform and GBS. The GBS data were used to perform SNP calling using the temperate inbred line B73 as the reference genome (GBS-B73) and a simulated genome “Mock” obtained in-silico (GBS-Mock). The study encompassed four above-ground traits with plants grown under two levels of water supply: well-watered (WW) and water-stressed (WS). In total, 46, 34, and 31 SNP were identified in the SNP-Array, GBS-B73, and GBS-Mock scenarios, respectively, across the two water levels, associated with the evaluated traits following the comparative analysis of each genotyping method individually. Overall, the identified candidate genes varied along the various scenarios but had the same functionality. Regarding SNP-Array and GBS-B73, genes with functional similarity were identified even without coincidence in the physical position of the SNPs. These genes and regions are involved in various processes and responses with applications in plant breeding. In terms of accuracy, the combination of genotyping scenarios compared to those isolated is feasible and recommended, as it increased all traits under both water conditions. In this sense, it is worth highlighting the combination of GBS-B73 and GBS-Mock scenarios, not only due to the increase in the resolution of GWAS results but also the reduction of costs associated with genotyping and the possibility of conducting genomic breeding methods.

1 Introduction

Water is the most abundant and often limiting of all plant resources needed to grow and function (Taiz et al., 2015). Water availability is considered one of the most influential factors in agricultural productivity, controlling species distribution in different climatic zones on Earth (Turner and Jones, 1980). In the tropical zone, characterized by relatively high temperatures and low rainfall compared to other zones, plants thriving in these environments are often more exposed to prolonged periods of water scarcity, especially in arid and semi-arid regions. According to climate change projections, this scenario will likely continue or worsen over the years, with potentially more drastic effects on plants (Raza et al., 2019; IPCC, 2023).

Stress can be considered a significant deviation from optimal life conditions (Larcher, 2003), inducing changes and responses as the plant fails to complete its physiological processes for growth and production. The lack of adequate water supply causes greater expansion of the root system into deeper and moister zones of the soil profile, reduction in the development of cells in the aerial tissues, resulting in decreased growth and stomatal closure to reduce transpiration rate and, consequently, photosynthetic activity (Frensch and Hsiao, 1994; Hsiao, 1973). Control measures are complex and difficult for humans to manage, and the search for genotypes that will perform better and economically viable yields in water-limited environments has been increasingly important for genetic improvement.

Conventional breeding for water deficit conditions is still time-consuming, laborious, and costly, as experimental conditions must be carefully managed. However, in recent years, with advances in molecular biology, the development of high-throughput genotyping technologies, and progress in platform development, new opportunities have emerged to enhance this process. This is partly due to cost reduction, which has consequently driven advances in genomic sequencing; another factor is the versatility of SNP (Single Nucleotide Polymorphism) markers, most commonly used in this process (Ingvarsson and Street, 2011). SNPs are abundant markers in crop genomes and are ideal for genetic discovery research and molecular improvement (Rasheed et al., 2017). According to the same authors, genotyping platforms involving Next Generation Sequencing (NGS) and SNP-Array technologies are suitable for genotyping hundreds to thousands of samples with many SNP markers in a single assay much more quickly, revolutionizing the study of genomics and molecular biology.

Genotyping techniques by sequencing or GBS (Genotyping by Sequencing) are simple and highly multiplexed systems used for constructing libraries intended for next-generation sequencing. SNP-Array is a technique that uses microarrays designed to pre-select previously identified genetic markers characterized by wide polymorphism. These markers are then incorporated into a specific platform. GBS-scored SNP platforms provide many markers, although they have high rates of missing data. On the other hand, Array-scored SNP platforms are of high quality but have relatively high costs (Elbasyoni et al., 2018) and possible ascertainment bias if the genetic material used for array development is not related to the tested germplasm (Heslot et al., 2013).

Arrays are well-designed and established in the market to assist studies and breeding programs of major commodity crops (Ganal et al., 2011). For minor crops, arrays are still rarely available, and researchers often rely on information from other crops that is already accessible. However, due to the high cost associated with array development, these platforms are preferably employed when it is possible to use a “universal” approach that applies to various germplasms. However, this can be challenging if researchers attempt to identify rare SNPs across various germplasms; a universal design can become large and expensive, resulting in many monomorphic loci for non-target germplasm groups (Thomson, 2014).

The advancement of model genome knowledge and the advent of next-generation sequencing techniques open up the possibility of a great leap in understanding the genome of relatively lesser-known species. The GBS pipelines are based on a reference genome or assembly of a new genome, applied to model organisms and species lacking pre-existing genomic information (Davey et al., 2011; Poland et al., 2012). In cases where a reference genome is not yet available, a simulated genome can be employed for SNP discovery, which can serve as a valid alternative (Melo et al., 2016). The same authors developed a bioinformatics pipeline to construct a simulated genome called “Mock,” adapted to the population and built from GBS data. This genome is already being used in genomic studies and indicated that the Mock produces similar results when it comes to organizing populations, identifying heterotic groups, selecting testers, and predicting genomic characteristics of hybrids compared to standard approaches (SNP-Array and GBS) (MaChado et al., 2023; Sabadin et al., 2022). This suggests that simulated genomes can be a good alternative, especially for species without the reference genome. However, no studies have been identified on the results generated by these different genotyping approaches in Genome-Wide Association Studies (GWAS).

Other studies have compared datasets from different high-throughput genotyping technologies in GWAS. Darrier et al. (2019), using standard platforms, GBS and SNP-Array, demonstrated efficiency in characterizing genetic diversity in barley, although accessing different regions of the genome. Despite capturing different areas, there was a positive correlation between the genetic distance matrices of both approaches, validating the use of either one for the characterization. These authors emphasized that the choice between GBS and SNP-Array genotyping platforms should be based on various factors, including the nature of the research and group preferences. For example, GBS may be preferable for studies requiring broader genomic coverage due to its ability to sequence a large number of genetic markers.

Conversely, SNP-Array may be more appropriate for analyses focused on specific genome regions. Group preferences, previous experience, and practical considerations such as cost and resource availability influence platform choice. In a study with inbred maize lines, Negro et al. (2019) concluded that GBS and SNP-Array were complementary for detecting QTL marking different haplotypes in association studies. Assuming they are complementary, combining these platforms seeks to determine if it will result in greater data accuracy.

To date, no study comparing GBS, SNP-Array, and simulated genome for GWAS has been published yet. The application of studies of this nature is crucial because they provide evidence that the information obtained from various genotyping approaches may be complementary during the genotyping process, thus demonstrating an efficient alternative for identifying polymorphisms. This, in turn, should offer better support to breeding programs that consistently grapple with identifying more efficient and tolerant genotypes against various abiotic and biotic factors. Another relevant point is that, even with advances in whole-genome sequencing and the complete publication of the maize genome, approaches such as SNP-Array and GBS remain important due to their lower cost and efficiency in genotyping large populations. These techniques generate more manageable data, requiring less computational infrastructure, and provide sufficient resolution to address many biological questions. Their effectiveness in GWAS studies and identifying loci in crops like maize is well demonstrated. For breeding programs or projects with limited resources, they represent agile and viable alternatives, balancing cost, accessibility, and quality.

In this context, the objectives of this study were: i) to verify if there is a difference in the identification of genes with significant effects among genotyping platforms, SNP-Array, GBS, and simulated genome (“Mock”) in GWAS; ii) once differences are confirmed, to determine if the identified genomic regions are complementary and if they provide better accuracy.

2 Materials and methods

To enhance the comprehensibility of the analyses conducted in this study, we present a workflow in which the experimental and data analysis components are summarized in Figure 1. The subsequent sections provide detailed explanations.

Figure 1
www.frontiersin.org

Figure 1. The workflow employed in the study. Different colors are used to represent distinct phases of the analysis.

2.1 Genetic material and experimental trials

This study used maize as the model species in a public diversity panel of 360 tropical inbred lines (Yassue et al., 2021). The genomic and phenotypic data of this panel can be found on the Mendeley platform (https://data.mendeley.com/datasets/6pb9prrbbb/1). The data to be explored were obtained from eight experiments conducted in 2020 and 2021, as detailed below. This study involves contrasting water supply conditions, well-watered (WW) and water-stressed (WS), so a pilot experiment was conducted before the main experiments. A water retention curve was established through regression to obtain field capacity and determine the amount of water to be provided via irrigation (De Souza Silveira et al., 2024). This pilot experiment involved five randomly selected lines from the panel and five levels of water supply: 100% of water applied (WA), 80% of WA, 70% of WA, 50% of WA, and 40% of WA. As a result, the WW and WS points were determined, with the 80% WA and 40% WA treatments representing these conditions, respectively.

The main experiments were conducted at experimental fields of the Department of Agriculture at UFC, Campus do Pici, Fortaleza-CE, located at 3°44’24.27” S latitude and 38°34’29.93” W longitude. The main experiments were conducted under WW and WS in augmented partially repeated block design (augmented p-rep designs), with two temporally spaced replicates (Williams et al., 2011). Five common treatments (checks) were used, randomly selected from within the panel and distributed in each block within the WW and WS conditions (Supplementary Figure S1).

These experiments were always conducted in the second semester of each year, following the rainy season in the region, a period that resembles the climate of the semi-arid zone. The sowings were carried out in plastic pots with a capacity of 2000 cm3, containing substrate (easily reproducible) in a ratio of 3:1 (sand: earthworm humus). The earthworm humus was chosen due to its easy obtainability and effectiveness in providing nutrients to the plants. The use of sand is justified by its easy acquisition, availability, and low cost.

Two seeds were sown per pot at an average 3-4 cm depth. Thinning was performed when the seedlings reached the V2 stage, leaving only one seedling per pot (plot). At this same phenological stage, a water deficit was also initiated, which continued until the V6 stage (harvest). Planting and topdressing fertilization were based on the chemical analysis of the substrate, taking into consideration the crop recommendations, to isolate nutritional stress during the experimental conduct.

As the experiment was conducted in an open field, irrigation control for each experiment was carried out manually and daily. Thus, 15 random samples were used to calculate the daily average weight of the pots within each water supply level. Subsequently, the difference between the current and total weights obtained at each water supply level was calculated to replenish the water volume. It is worth noting that, for each vegetative stage, the average plant weight was obtained to subtract it along with the current weight, thus not affecting the volume of water to be replenished.

2.2 Phenotypic data

The phenotypic evaluation was conducted when most plants reached the V6 phenological stage. The traits considered in this study were: Plant height (PH) - measured from the soil to the insertion of the flag leaf, measured using a graduated ruler (cm); Stalk diameter (SD) – an average of two measurements above ground level at the second node of the stem obtained using a caliper (mm); Chlorophyll content estimation - using SPAD, measuring three leaves per plant to get the average. Subsequently, the plants were cut off at ground level, placed in paper bags, and put in a forced-air oven at 65°C for 72 hours to obtain: Shoot dry matter (SDM)- quantified using an electronic analytical balance (0,005 g).

2.3 Phenotypic analysis

The outliers of the phenotypic data for the traits described in section 2.2 were removed. Then, the remaining data were adjusted for normality using the bestNormalize package (Peterson, 2021), and the assumptions of normal distribution were checked via the Shapiro test and Q-Q plots. Subsequently, equations of mixed linear models were fitted to obtain the BLUP by REML for each trait studied under WW and WS conditions, using the sommer package (Covarrubias-Pazaran, 2016). These analyses were performed using the following model:

y=X1t+X2l+X3n+Z1b+Z2g+Z3i+ϵ(1)

where, y is the vector of phenotypic values of the inbred lines panel and checks; X1, X2, and X3 are incidence matrices for t, l, and n fixed effects; Z1, Z2 and Z3 are incidence matrices for b, g e i random effects; t is the water supply fixed effect vector (WW and WS conditions); l is the replicate (season) fixed effect vector within water supply; n is the number of leaves used as a covariate to correct for differences in plant vigor and development; b is the block/water supply/season random effect vector, where g~N(0, Iσb2); g s the genotype random effect vector, where g~N(0, Iσg2); i is the random effect vector of the genotype–water supply interaction, where i~N (0, Iσi2); ε is the experimental error, where ε~N(0, Rσe2), obtained using a structured diagonal matrix to make it possible to estimate two residual variances, one for each water supply level (σeWW2 and σeWS2). The significance of fixed effects was assessed using the Wald test, and random effects using the likelihood ratio test.

The variance components were used to estimate the heritabilities (h2) by the following estimator:

h2=σg2σg2+σge2s+(σeWW2+σeWS2)rs(2)

where h2 refers to the entry-mean heritability; σg2 is the genotypic variance of the inbred lines panel, σge2 is the variance of the genotype–water supply interaction; σeWWe2σeWS2 are the environmental variance components in WW and WS; s are levels of WW and WS; and r is the number of repetitions in each water supply level.

The reliability of selection for each line [R2(α^l) was obtained by the following expression (Gorjanc et al., 2015):

R2(α^l)=1Var(αiα^i)Var(αi)(3)

where Var(αiα^i) is the variance of the prediction error (PEV) of line i and Var(αi) is the genotypic variance of the trait.

The de-regressed BLUPs (dBLUPs) were obtained by calculating the ratio between the BLUPs of each inbred line in WW and WS and their respective average reliabilities. After these analyses, 313 lines remained out of the 360 in the panel. The dBLUPs of these lines in WW and WS were used in the GWAS analyses.

2.4 Genotypic data

The lines were genotyped using two SNP genotyping platforms: Affymetrix® Axiom Maize Genotyping Array with 18.413 SNP markers (SNP-Array) and genotyping-by-sequencing (GBS) process following the sequencing protocol established by Poland et al. (2012). In this method, genomic DNA was digested by two restriction enzymes, PstI and MseI, to reduce the genome complexity. Subsequently, specific adapters for sequencing on the Illumina NextSeq 500 platform (Illumina Inc., San Diego, CA, United States) were attached to the digested fragments.

The primary GBS data were employed for two purposes: firstly, to perform SNP calling using the temperate line B73 as the reference genome (RefGen v4). Secondly, to construct a simulated reference genome (mock genome) for SNP calling, following the pipeline proposed by Melo et al. (2016), considering all the lines in the panel (Mock).

Therefore, the SNP data were subjected to three GWAS approaches: 1) SNP-Array; 2) GBS with SNP calling based on the B73 reference genome (GBS-B73); 3) GBS using the simulated genome as the reference (GBS-Mock). The SNPs for the GBS dataset was identified from raw data using the TASSEL 5.0 GBSv2 pipeline (Glaubitz et al., 2014), considering both GBS-B73 and GBS-Mock as reference genomes, employing the BWA aligner. The BWA aligner (Li and Durbin, 2009) was used to align the tags against the reference genome (GBS-B73 and GBS-Mock).

The SNP sets obtained in these scenarios were submitted to quality control parameters as call rate (CR) and Minor Allele Frequency (MAF) procedures, where markers with CR < 90% and MAF lower than 5%, and non-biallelic markers were removed from the datasets (Morosini et al., 2017). Imputation of missing data was performed using the Beagle 5.0 algorithm (Browning and Browning, 2008).

2.5 Population structure and LD decay

In order to minimize potential bias caused by population structure, a PCA was performed based on the additive genomic relationship matrix among the remaining 313 panel lines, following VanRaden (2008) using the SNPRelate package (Zheng et al., 2012). FarmCPU automatically incorporated the correction via PCA in the association analysis. Two principal components were used to correct the population structure effect, and the best fit for the model was determined based on Q-Q plots. The most likely number of groups within the panel was determined according to Yassue et al. (2021) as it involved the same diversity panel.

The Linkage Disequilibrium (LD) estimation between each pair of SNP within the chromosomes was calculated by the square of the allele frequency correlation (r²) among all SNP within a distance less than 1 Mbp. The r² values were plotted against the base pair distance of the SNP pair to obtain the LD decay by chromosome. This procedure was performed with all SNP retained from the quality control procedures.

2.6 Association analysis

GWAS were performed for each trait under WW and WS conditions using the FarmCPU method (Liu et al., 2016a). The method stands out for its computational efficiency and ability to control false positives, demonstrating greater statistical power in situations where the trait is strongly associated with kinship (Liu et al., 2016a; Segura et al., 2012). The FarmCPU.P.Threshold function was employed to obtain the p-threshold, specific for each trait via a simulation process with 100 permutations. Subsequently, the cutoff point was obtained by the ratio between the p-threshold and the number of markers used. Subsequently, p-values (significance), MAF, and ASE (Average Effect of Allele Substitution) were obtained for each significantly associated SNP, designated hereafter as a potential candidate gene underlying the target trait. Furthermore, the coefficient of determination for each significant SNP (RSNP2 was obtained based on ASE and MAF using equations described in Da et al. (2014). Next, multiple linear regressions were established for each trait using the significant SNPs as predictor variables to quantify the markers’ influence on that trait’s expression (RTOT2). The Manhattan and Q-Q plots graphs were generated using the CMplot package (Yin, 2020), and the graphs showing the proportion of phenotypic variance explained by the SNP were generated using the ggplot2 package (Wickham, 2011) in the R software. Venn diagrams based on the common gene functionality for the traits at each water supply level were created using LucidChart (lucidchart.com).

2.7 Correlation among markers of different scenarios

Given the stability and efficiency of SNP-Array technology in accurately genotyping numerous markers, we conducted Pearson correlation analysis (r) among significant markers with known functions identified in GWAS within the GBS-Mock scenario and markers present in the SNP-Array scenario for each trait under both WW and WS conditions. The analyses were performed using the R software base. This approach aimed to assess the concordance and potential overlap between markers identified through different genotyping methods and their associations with specific traits. By comparing these markers across scenarios, we sought to elucidate common genetic factors contributing to trait variation and explore the utility of integrating data from diverse genotyping platforms in genomic analyses related to crop improvement and adaptation to environmental stressors.

2.8 Gene annotation

A candidate gene association mapping was performed for traits with significant SNP. The physical positions of SNP for GBS-Mock were assigned using BLAST (Altschul et al., 1990) to align them with the maize genome assembly for comparison purposes. These positions were used to obtain 41 bp DNA fragments on a single chromosome (Supplementary Table S2). Subsequently, a BLAST was conducted exclusively for GBS-Mock on MaizeGDB1 via blast, utilizing the B73 RefGen_v4 sequence database to locate the chromosome by inserting the DNA fragment. The MaizeGDB database and its functional information associated with each SNP based on B73 RefGen_v4 were utilized for all scenarios. After defining the region to be considered, potential candidate genes flanking each marker were identified. Candidate genes linked to each trait were determined through annotation within a sliding window of 50 kb around each significant SNP, following a conservative approach described by Yassue et al. (2021). All genes within a range of 50 kb downstream and 50 kb upstream were annotated. Subsequently, they were assessed and considered based on two criteria: proximity to the SNP and functional similarity as per databases available on the Maize eFP Browser (2023)2 and Maize Genomics Resource (2023)3.

3 Results

3.1 Phenotypic analysis

In general, significant effects were detected for all sources of variation, except for the G x WA interaction, in the studied traits (Table 1). The variance components showed a similar pattern for all traits, with a predominance of genotypic variance over the residual variance of the interaction. Except for the PH trait, there was a higher residual variance for the well-watered environment than the low-water availability. The genotypic variance component ranged from 0.09 to 0.19, and the genotype x environment interaction approached zero for all traits, affecting the estimates of heritabilities and accuracy. Heritabilities ranged from moderate to high magnitude, ranging from 0.58 to 0.73. PH was the trait with the highest heritability (0.73) and the least influenced by the environment, showing the highest genotypic coefficient of variation (0.197). The adjusted means fall within the same range observed in other studies.

Table 1
www.frontiersin.org

Table 1. Wald test of fixed effects, likelihood-ratio test (LRT) of random effects, variance components, heritability, accuracy, and adjusted average for SPAD, plant height (PH), stalk diameter (SD), and shoot dry matter (SDM) of the inbred lines evaluated in WW (well-watered) and WS (water-stressed) conditions water supply.

3.2 Genotypic scenarios: number and distribution of SNP

After the quality control, heterozygous markers were eliminated using the MAF and CR procedures, resulting in 12.704 SNP markers for SNP-Array out of a total of 18.413, 11.153 out of 131.350 for GBS-B73, and 4.935 out of 46.926 for GBS-Mock, which were used in the association analyses (Table 2). Approximately 69% of the marker set remained in the SNP-Array, while 10.5% remained in the GBS-Mock and 8.5% in the GBS-B73 scenario. However, there was a balanced distribution of SNP across the chromosomes in the standard scenarios (SNP-Array and GBS-B73). In total, 11 common SNPs were found among the genotypic scenarios (Figure 2). Between SNP-Array and GBS-B73, 8 shared SNPs were observed: 3 for SPAD under well-watered conditions and 1 under water-stressed conditions; 1 SNP in each water supply condition for PH and SDM. In the GBS-B73 and GBS-Mock scenarios, 1 common SNP was identified for PH under well-watered conditions and 1 for SD in both water supply conditions.

Table 2
www.frontiersin.org

Table 2. Number of markers scored (raw data) and the final number of markers (clean data) total and per chromosome (Chr) after quality control for all genotyping scenarios used to assess inbred lines evaluated in WW (well-watered) and WS (water-stressed) conditions water supply.

Figure 2
www.frontiersin.org

Figure 2. Venn diagrams with the number of significant trait SNPs in three genotyping scenarios. (A) WW (well-watered) water supply condition column; (B) WS (water-stressed) water supply condition column. SPAD, PH (plant height), SD (stalk diameter), and SDM (shoot dry matter).

3.3 GWAS analysis

Significant SNP were found on five of the ten maize chromosomes for the SNP-Array scenario and four for GBS-B73 for the SPAD trait under the WW condition (Figures 3A, B). The Q-Q plots showed data fitted to the model (Figures 3C–E). The significant marker/trait association threshold ranged from 4.99 to 12.85 (Supplementary Table S1).

Figure 3
www.frontiersin.org

Figure 3. Manhattan plot and Quantile-Quantile (Q-Q) plots for Genome-Wide Association Study (GWAS) comparing genotyping platforms in tropical maize for SPAD trait in WW (water-stressed) conditions water supply. The Manhattan plot displays GWAS results based on three datasets: SNP-Array and GBS-B73 (A), and Mock (B). The x-axis represents the chromosomal positions, while the y-axis indicates the -log10 P-values, reflecting statistical significance. The horizontal lines denote the genome-wide suggestive significance threshold, with dots above these lines marking significant SNPs. The Q-Q plots illustrate the GWAS results for the same datasets: SNP-Array (C), GBS-B73 (D), and Mock (E). The x-axis corresponds to the -log10 expected P-values derived from the chi-square distribution, while the y-axis represents the -log10 observed P-values. Each dot represents an SNP, with the most significant SNP appearing as the top hit. The red diagonal line shows the expected distribution under the null hypothesis of no association.

A total of 46, 34, and 31 significant SNP were found for SNP-Array, GBS-B73, and GBS-Mock, respectively (Table 3; Supplementary Figures S2S9). There were no SNP common to all three scenarios; however, at least one SNP was shared between two of them (Figure 2). SPAD had the highest number of significant SNP, totaling 34, followed by PH, SDM, 27, and SD, 23. The SNP array presented more markers for SPAD and PH and GBS-B73 for SD, and there was an equivalence among the three scenarios for SDM. Overall, GBS-B73 and GBS-Mock showed some similarity in the quantity of markers.

Table 3
www.frontiersin.org

Table 3. Number, average, and standard deviation (SD) of significant SNPs per trait in WW (well-watered) and WS (water-stressed) conditions water supply and genotyping scenario.

3.4 Correlation among SNP in the GBS-Mock and SNP-Array scenarios

Our results revealed 20 significant markers in the GBS-Mock that positively correlated with the SNP-Array scenario to traits under different environmental conditions (Table 4). Pearson correlation coefficients (r) were observed, ranging from weak to strong. Specifically, for SDM in WW conditions, correlations ranged from 0.94 to 0.30. Similarly, SPAD values showed moderate to strong correlations with markers, ranging from 0.52 to 0.76 in WW conditions and from 0.40 to 0.76 in WS conditions. For PH, correlations were moderate, with values of 0.36 for WW and 0.51 and 0.53 for WS. Notably, SD exhibited correlations ranging from 0.35 to 0.87 in WW conditions and from 0.30 to 0.87 in WS conditions. Additionally, SDM showed moderate to strong correlations, ranging from 0.46 to 0.94 in WW conditions and 0.47 in WS conditions.

Table 4
www.frontiersin.org

Table 4. Pearson correlation among significant markers from the GBS-Mock scenario with known functions and markers from the SNP-Array scenario for traits in WW (well-watered) and WS (water-stressed) conditions water supply.

3.5 Candidate genes and functional annotations

Based on the physical location of significant SNP in the B73 reference genome for SNP-Array and GBS-B73 and the reference genome for GBS-Mock, genomic regions and candidate genes related to significant loci were identified (Supplementary Table S1). In some cases, the same genes and regions were identified for a given trait under both water supply conditions. For example, Zm00001d042735 and Zm00001d001852 in the GBS-Mock scenario for SPAD and SD, respectively; Zm00001d017978 located on chromosome 5 in SNP-Array for PH. Similarly, identical genes and regions were found in different scenarios, for instance, Zm00001d031759 located on chromosome 1 was detected in SNP-Array and GBS-B73 for SPAD in WW and WS. The same gene was also identified for different traits, such as Zm00001d005090 for SD and SDM in GBS-B73.

The genomic regions and candidate genes with similar functions were grouped, considering each trait at the same water supply level across genotyping scenarios (Figure 2; Supplementary Table S3). For SNP-Array and GBS-B73, regions and genes with the same functionality on the same chromosome were observed, such as Zm00001d031445 and Zm00001d027626, both on chromosome 1, which are correlated with ethylene biosynthesis for SDM in WW. Conversely, these platforms also identified genomic regions and candidate genes on different chromosomes but with coinciding functions. For example, Zm00001d026477 on chromosome 10 and Zm00001d027695 on chromosome 1 are responsible for responses to abiotic stress by reactive oxygen species (ROS), jasmonic acid (JA), and ethylene; Zm00001d044194 on chromosome 3 and Zm00001d018127 on chromosome 5 function in the regulation of the circadian cycle for SPAD under WW; Zm00001d017978 on chromosome 5 and Zm00001d008952 on chromosome 8 are involved in endoglucanase activity for PH in WW; and Zm00001d053809 on chromosome 4 and Zm00001d042481 on chromosome 3 for GBS-B73 are associated with ubiquitin proteins for PH in WS; Zm00001d016786 on chromosome 5 and Zm00001d005090 on chromosome 2 act in response to water stress through abscisic acid (ABA) for SDM in WS.

In scenarios involving GBS-B73 and GBS-Mock, genomic regions and candidate genes with similar functions were identified for Zm00001d021708 on chromosome 7 and Zm00001d012719 on the single chromosome, related to plant responses to ABA for PH in WW; Zm00001d014899 on chromosome 5 and Zm00001d001852 on the single chromosome, associated with the phytohormone gibberellin for SD in WW; Zm00001d00509 on chromosome 2 and Zm00001d053262 on the single chromosome, involved in ABA regulation for SD in WS.

3.6 Phenotypic variation explained by SNP in different genotyping scenarios

The proportions of phenotypic variance explained by significant SNP (RTOT2) for the analyzed traits under both water supply conditions, ideal (WW) and deficit (WS), were less explained in the isolated genotyping scenarios for the studied traits (Figure 4). Regarding the isolated scenarios, RTOT2 in SNP-Array ranged from 0.18 for SD (WW in WS) to 0.53 for SPAD (WW), GBS-B73 ranged from 0.11 for SD (WS) to 0.48 for SD (WW), and GBS-Mock from 0.11 for PH (WW) to 0.53 for SPAD (WW). Overall, SNP-Array performed better independently for SPAD and PH, except for SD (WW), where GBS-B73 stood out, and SDM was almost the same among the scenarios. When combined, the value of RTOT2 ranged from 0.26 in SNP-Array + GBS-B73 for SD (WS) to 0.65 in SNP-Array + GBS-Mock for SPAD (WW). The percentages obtained represent the phenotypic variance explained by combining multiple SNPs simultaneously.

Figure 4
www.frontiersin.org

Figure 4. Proportion of phenotypic variance explained by the SNP (RTOT2) per trait in WW (well-watered) and WS (water-stressed) conditions water supply and genotyping scenario). (A) SPAD, (B) PH (plant height), (C) SD (stalk diameter), and (D) SDM (shoot dry matter).

The best scenario combination was SNP-Array + GBS-Mock for SPAD (WW) with an increase of 0.12 in accuracy compared to the best isolated scenario. For PH and SDM under WW condition, SNP-Array + GBS-B73 was superior, increasing accuracy by 0.07 and 0.16, respectively, compared to the best single scenario. For SD, combining GBS-B73 + GBS-Mock increased accuracy by 0.05. Regarding water availability, the ideal water supply condition achieved better overall accuracy, except for PH in isolated SNP-Array and combined with GBS-Mock. In the WS condition, better accuracy was also observed for all traits when combining scenarios.

4 Discussion

Water is one of the most important factors limiting crop growth. Maize requires a large amount of water throughout all stages of development, from seed germination to the reproductive phase. In this context, the significant effect of water supply levels reveals contrasting conditions in WW and WS, indicating that the irrigation treatments used in the present study to generate contrasting environments were sufficient for all traits (Table 1). Moreover, the significance of genotypes suggests that the panel used in this study exhibits genetic variability. Previous studies have also reported genetic diversity for the same tropical maize germplasm panel (Yassue et al., 2021; De Souza Silveira et al., 2024). Genetic variability is a fundamental factor for any breeding program.

However, the interaction effect shows that the responses were not differentiated for the genotypes across environments; they exhibit similar phenotypic responses to environmental changes. Genotype x environment is important when estimating heritability because it influences a trait’s genetic and environmental variation (Falconer and Mackay, 1996). The low effect of interaction also maximizes the accuracy (Assareh et al., 2012); high accuracy estimates indicate good experimental precision. Heritability was higher for plant height, followed by stem diameter, consistent with results from Sabiel et al. (2013), who reported moderate heritabilities for plant height and stem diameter in maize under water stress.

4.1 SNP in genotyping scenarios

Platforms such as SNP-Array and GBS are well-suited for genotyping hundreds to thousands of samples, each containing numerous SNP markers, in a single assay and at a significantly faster pace (Rasheed et al., 2017). This study had a balanced SNP distribution across chromosomes in the SNP-Array and GBS-B73 genotyping scenarios, perhaps attributed to using the same reference genome (Table 2). The inbred line B73 has been utilized as the reference genome for maize sequencing (Schnable et al., 2009), and an example of a reference genome-based pipeline is TASSEL-GBS.

In the GBS-Mock scenario, a smaller number of SNP markers was observed. In cases where a reference genome is not yet available, a simulated genome can perform SNP discovery, serving as a valid alternative, especially for minor crops (MaChado et al., 2023; Sabadin et al., 2022). Regarding the smaller number of markers observed in GBS-B73 compared to SNP-Array, this may be related to the low genomic coverage of GBS resulting in missing SNP (Wang et al., 2020). However, this issue can be partially addressed by using software employed in imputation, as missing SNP are imputed to fill in the gaps in obtaining intermediate genotype information.

4.2 GWAS and candidate genes

GWAS has emerged as a crucial tool, allowing for a systematic approach to identifying associations between thousands of genomic loci and complex traits. In this study, overall, more SNP were identified in association with the trait under ideal water supply conditions than under water deficit conditions in all genotyping scenarios (Table 3). A similar result was found by De Souza Silveira et al (2024), who identified more SNPs associated with root traits of tropical maize under ideal water supply conditions than those subjected to water scarcity. Moreover, Yassue et al (2021); Yassue et al, 2023) found more SNP associated with tropical maize traits not evaluated under inoculation by growth-promoting bacteria, such as plant height, stem diameter, and aboveground dry mass. These authors also consider that growth-related traits, such as plant height, stem diameter, and dry mass, are complex and controlled by many genes with small individual effects.

The genes found in the study have small effects (ASE), revealing the polygenic nature of the traits and controlling a relatively small portion of the genotypic variation (Supplementary Table S1). Complex traits in plants, such as height, diameter, and tolerance to environmental stresses, often have a multifactorial genetic basis involving the interaction of various genes and environmental factors. Thus, knowledge of the genomic regions associated with the traits of interest will provide insight into this genetic basis. Additionally, the study also detected a common marker associated with more than one trait at different water supply levels, indicating a possible pleiotropic effect. Bouchet et al. (2017) reported pleiotropy among phenology-related traits, such as plant height and leaf number, and Zhang et al. (2022) for maize productivity traits. Pleiotropic effects in GWAS studies can increase the complexity of understanding genetic and phenotypic relationships, indicating that phenotypes are more interconnected than initially thought. This complicates the interpretation of study results, as it may need to be clarified which phenotype is directly influenced by the variant and to what extent. In genetic improvement studies, pleiotropic effects can affect the selection of desirable traits, as a single genetic variant can influence multiple agronomic or desirable traits.

The candidate gene Zm00001d005090 is associated with SD under both water conditions and SDM under water deficit, possibly indicating a pleiotropic effect regulating the expression of these two traits. This gene is responsible for the clathrin heavy chain, one of the main subunits of clathrin, an essential protein in eukaryotic cells playing a crucial role in the endocytosis process. Hence, endocytosis occurs in many vital processes for plant development, such as abscisic acid (ABA) responses (Sutter et al., 2007). These authors state that in situations involving ABA, specific proteins in the plasma membrane are negatively regulated through the induction of their endocytosis. It has been demonstrated that ABA and salicylic acid positively regulate a gene encoding a clathrin chain in maize (Zeng et al., 2013). ABA is produced in various parts of plants, including the stem, and it influences gene expression by activating stress-response protein-coding genes and repressing growth-related genes. There is also evidence that clathrin impacts Arabidopsis’s stomatal function, gas exchange, and vegetative growth (Larson et al., 2017). Thus, this gene may have a pleiotropic effect, resulting in reduced height, stem diameter growth, and dry mass.

SNP were found to be associated with the trait simultaneously in both water availability levels, such as the gene Zm00001d017978 identified in association with the PH trait in the SNP-Array scenario and the gene Zm00001d001852 in association with the SD trait in the GBS-Mock scenario. Zm00001d017978 has a putative function in the endoglucanase enzyme, a subgroup of a larger enzyme family called cellulase. Cellulases are part of a superfamily of enzymes called hydrolases that use water to break down molecules. All cellulases are essential to degrading cellulose, a structural polysaccharide found in plant cell walls (Rahman et al., 2018). The cell wall plays a crucial role in plants’ support and mechanical support, allowing them to grow by providing rigidity and resistance. Therefore, any alteration in cellulose degradation, caused by overexpression or underexpression of enzymes can affect structural integrity and, consequently, plant height. The applied water deficit may have negatively affected stem elongation, contributing to plant height, as at the V6 stage, the stem initiates the accelerated elongation phase. The gene Zm00001d001852 has a putative function as Gibberellin-regulated protein 2 (GRP) with expression positively regulated by gibberellin. The plant hormone gibberellin regulates major aspects of plant growth and development (Yamaguchi, 2008), stimulating cell division and growth. The effect of gibberellin on stem diameter may be related to cell division and radial expansion of cells, increasing the number of cell layers. Additionally, there is evidence that biotic stresses impact gibberellin and GRP levels, as it has been reported that a slight increase in temperature can raise endogenous gibberellin concentration (Camut et al., 2019).

The genes Zm00001d042735 and Zm00001d031759 were also identified at both water supply levels and are associated with the SPAD trait. The first one was identified in the GBS-Mock scenario, while the other one was identified in both the SNP-Array and GBS-B73 scenarios, and both belong to the zinc finger family. Zinc finger proteins are named for their three-dimensional structure resembling a finger, binding to zinc ions through amino acids in the peptide sequence and widely distributed in eukaryotic organisms (Han et al., 2020). They bind to specific genetic sequences, interact with various proteins, participate in signal transduction, and regulate gene expression, playing an essential role in growth, development, and environmental adaptation. Zm00001d042735 was described as a RING-type E3 ubiquitin transferase. Ubiquitin is a protein that acts as a molecular marker, signaling various cellular functions such as protein degradation, cell cycle regulation, cellular stress response, and intracellular signaling (Lee and Kim, 2011). E3 ubiquitin proteins respond to water stress by regulating ABA biosynthesis and signal transduction, modifying and degrading stress-related proteins (Han et al., 2022). An example is ZmAIRP4, which is involved in maize’s ABA signaling, and this gene’s overexpression increased water stress tolerance in Arabidopsis (Yang et al., 2018). Changes in water content induced by water stress can directly affect the SPAD index and chlorophyll content, as ABA concentration increases, causing stomatal closure to reduce water loss, which may affect the expression of genes related to stress response.

The gene Zm00001d031759, also belonging to the zinc finger protein family, has a putative function in the Protein shoot gravitropism 5 group, acting in the morphogenesis of aerial organs and responses to gravitropism. Some genes from the shoot gravitropism family have been identified and are involved in the perception and signal transduction for gravity associated with the branching angle (Yamauchi et al., 1997). It has also been found that loss of functionality of the shoot gravitropism five gene (SGR5) resulted in decreased starch accumulation in aerial tissues and consequently reduced gravity sensitivity (Tanimoto et al., 2008). Gravity is an important regulator of plant architecture, allowing plants to optimize their position relative to the soil for nutrient absorption and to light for photosynthesis. Furthermore, some genes and regions manifest for the expression of the trait independently of the water supply level, probably unrelated to water stress.

4.2.1 Genes associated with phytohormone signaling pathway

Genes and regions shared among the genotyping scenarios were identified based on their function for the same trait (Supplementary Table S3). For example, genes Zm00001d026477 in SNP-Array and Zm00001d027695 in GBS-B73 are related to jasmonic acid (JA) response, associated with SPAD in WW traits. Jasmonate ZIM domain proteins, well-known as JAZ proteins, play a crucial role in pathogen responses (Ishiga et al., 2013) and are important signaling molecules in the JA pathway (Liu et al., 2017). Glutaredoxins are associated with water-induced stress response in maize, also participating in the abiotic stress response mediated by JA and ethylene through their interaction with transcription factors (Ding et al., 2019). As JA is involved in various signaling pathways regulating physiological and molecular processes in plants, in defense against biotic and abiotic stresses, such as drought (Rehman et al., 2023), signaling pathways induce stomatal closure, activating potassium efflux in guard cell protoplasts (Evans, 2003) enhancing the plants’ ability to cope with environmental stresses.

Regarding ABA regulation, Zm00001d016786 was associated with SDM in WS in SNP-Array and Zm00001d005090 in GBS-B73. Zm00001d021708 was found in GBS-B73, and Zm00001d012719 in GBS-Mock for PH under WW conditions. Zm00001d005090 and Zm00001d053262 were also identified in GBS-B73 and GBS-Mock, respectively, for SD under WS conditions. Protein disulfide-isomerase (PDI) is a member of the thioredoxin superfamily of redox proteins with multiple physiological functions (Khan and Mutus, 2014), playing a crucial role in abiotic stress tolerance. Thioredoxin (TRXo1) is involved in ABA perception through redox regulation of specific receptors (De Brasi-Velasco et al., 2023). In maize, genes related to PDI were highly responsive to ABA and water stress (Liu et al., 2009). A PDI-like protein strongly associated with aboveground biomass and leaf size was also identified (Kang et al., 2015). According to Tanz et al. (2012), PDI is a family of proteins that affect chlorophyll biosynthesis in Arabidopsis seedlings.

The PPR (pentatricopeptide repeat) proteins are located in mitochondria or chloroplasts. In contrast, the BZIP (basic leucine zipper) proteins constitute a family of transcription factors (TFs) associated with plant growth, development, and stress responses. A typical PPR protein is targeted to mitochondria or chloroplasts, binds to one or several organellar transcripts, and influences their expression by altering RNA sequence, turnover, processing, or translation (Barkan and Small, 2014). It has been found that the PPR96 protein, located in mitochondria, altered the transcription levels of various stress-responsive genes under ABA treatments (Liu et al., 2016b). BZIP proteins are involved in multiple stress responses, primarily through the ABA signaling pathway (Uno et al., 2000). Changes in the transcription levels of maize BZIP TFs were observed in response to ABA treatments (Cao et al., 2019).

As mentioned earlier in SDM and SD, the Clathrin heavy chain indicates possible pleiotropy. Calcium-dependent lipid-binding proteins act in response to abiotic stress, such as drought. The expression of sANN3, a calcium-dependent lipid-protein, increased in response to water stress in rice, inducing various genes in the ABA signaling pathway and promoting root growth to enhance water absorption and stomatal closure to reduce water loss (Li et al., 2019). Therefore, these proteins and the biosynthesis pathways in ABA regulation may influence photosynthesis, plant development, and growth.

The genes Zm00001d014899 in GBS-B73 and Zm00001d001852 in GBS-Mock are associated with the trait SD under WW conditions, involved with the phytohormone gibberellin. The first encodes a protein from the tetratricopeptide repeat (TPR)-like superfamily. Proteins containing tetratricopeptide repeats play an important role in protein-protein interaction and regulating various cellular functions (Rosado et al., 2006). They serve different crucial roles in plants, including their involvement in phytohormone signaling, such as gibberellin (Jacobsen and Olszewski, 1993; Silverstone et al., 2007). Therefore, TPR-repeat-containing proteins are pivotal in signaling phytohormones and regulating various physiological processes, including growth, development, and environmental response. Gibberellin-regulated protein 2 (GRP) was mentioned earlier, occurring at both levels of water availability for SD.

Genes associated with SDM under WW conditions were found on the same chromosome, Zm00001d031445 in the SNP-Array and Zm00001d027626 in the GBS-B73, both involved in ethylene biosynthesis. The ethylene-insensitive3-like/ethylene-insensitive3 (EIL/EIN3) is one of the major regulatory families in ethylene signaling, also serving as a hub for ethylene connections with various plant responses to different environmental conditions (Wu and Yang, 2019). Ethylene is a crucial regulator in stress signaling, and its interaction with a receptor complex triggers the inactivation of kinase response, resulting in the initial dephosphorylation of EIN2, followed by the cleavage of the C-terminal of EIN2. Subsequently, EIN2 translocates to the nucleus, regulating the activation of EIN3/EIL1. These proteins, in turn, exert control over ethylene response factors (Yoshida et al., 2011).

S-adenosyl-L-methionine synthetase, known as SAM, is a donor of methyl groups in the biosynthesis of nucleic acids, proteins, lipids, polysaccharides, and secondary compounds (Heidari et al., 2020). SAM is involved in many important biological processes, such as ethylene biosynthesis. Yu et al. (2012) found that alterations in the expression level of SAM affected protein synthesis, phytohormones (JA and ethylene), and genes related to stress defense response. Ethylene is a volatile compound produced endogenously by plants for growth regulation - roots, stems, leaves, and flowers (Shilev, 2020). Plants increase the synthesis of this hormone when subjected to stressful situations, whether biotic or abiotic. Water deficit, in particular, is one of the main factors related to its increase (Apelbaum and Yang, 1981). Thus, the plant alters its growth rates, decreases biomass, and reduces development (Glick, 2014).

4.2.2 Genes associated with the circadian clock

Zm00001d044194 was identified in the SNP-Array, and Zm00001d018127 in the GBS-B73 under WW condition associated with the SPAD trait acting in the circadian clock. The MYB proteins constitute one of the most extensive families of transcription factors found in plants, playing an important role in growth and development, with widespread expression in the development of corn and soybeans in stress responses, and are closely correlated with the circadian rhythm (Du et al., 2013). MYB-related genes can act as repressors and activators associated with the circadian clock (Kamioka et al., 2016; Hsu et al., 2013; Hu et al., 2024; Schaffer et al., 1998).

The SNW/Ski domain protein is involved in the post-transcriptional regulation of circadian clock genes. SkipP interacts with the serine/arginine-rich spliceosomal protein 45 (SR45) and controls the circadian cycle through alternative splicing of circadian clock genes under biotic stress conditions (Wang et al., 2012). The circadian clock in plants refers to an internal timing system on a cycle of approximately 24 hours that regulates plants’ behavioral and physiological processes, including photosynthesis (Niwa et al., 2009). Likely, each guard cell maintains its circadian rhythm, and a clock controlling stomatal opening seems advantageous for the plant, helping prevent unnecessary water loss through transpiration (Dodd et al., 2005; Gorton et al., 1993). Thus, besides the environmental and internal factors that influence stomatal function, the circadian pattern in regulating stomatal movements is advantageous as it can enhance photosynthetic and water use efficiency.

4.2.3 Genes associated with ubiquitination regulation

The genes associated with the PH trait under WS conditions were Zm00001d053809 in SNP-Array and Zm00001d042481 in GBS-B73, which are related to the regulation of protein ubiquitination. Culins neddylation modulates the ubiquitin ligase activity of the complex, leading to increased ubiquitination and degradation of target proteins by the proteasome (Biswas et al., 2007; Mohanty et al., 2021; Pan et al., 2004). Neddylation is the post-translational protein modification most closely related to the regulation of protein ubiquitination (Rabut and Peter, 2008).

Ubiquitin thioesterases play a fundamental role in regulating the degradation of proteins marked with ubiquitin in plants. The ubiquitin system regulates virtually all aspects of cellular function (Ernst et al., 2013), which is important in controlling abiotic stress and processes that affect agronomic traits. For example, the ubiquitin-proteasome system is an essential pathway for protein degradation in plant growth and development (Linden and Callis, 2020). The ubiquitin-proteasome system is involved in regulating transcription responsive to ABA, allowing plants to respond to abiotic stresses such as drought (Dreher and Callis, 2007). Thus, ubiquitination affects gene expression or protein abundance to determine agronomic traits and stress control, enabling dynamic adjustments in physiological and biochemical responses contributing to plant survival and adaptation under adverse conditions.

4.2.4 Coincident genes among genotyping scenarios

Concerning the SNP-Array and GBS-B73 genotyping scenarios, these platforms are based on the same reference genome (B73) and are physically fixed, making it possible to determine the physical position of the marker in the genome. The coincidence between genes and regions on the same chromosome occurred only for Zm00001d031445 in the SNP-Array and Zm00001d027626 in the GBS-B73, both on chromosome 1. However, it was observed that, even though there was no coincidence regarding the physical position of the markers and chromosomes, there was still similarity regarding the gene functions.

Considering the three scenarios, when identifying the gene and region, it was observed that there was coincidence only for one marker in the SPAD trait under both irrigation conditions. However, possible coincidences were highlighted when deeper analyses were conducted regarding the gene function. Negro et al. (2019) concluded that GBS and SNP-Array were complementary for detecting QTLs in maize, marking different haplotypes. In a study performed in barley by Darrier et al. (2019), GBS and SNP-Array were shown to be efficient in accessing diversity. Still, they accessed different regions of the genome. These are methods that will capture different SNPs, there will be differences in position, density and distribution of the marks. However, even though they captured different regions, there was a positive correlation between the similarity matrices of both approaches. Thus, even when accessing different genome regions, these platforms demonstrate that they can be complementary. In the study, there was also a coincidence for the simulated genome, GBS-Mock, validating the complementarity for this scenario.

4.3 Association of markers in genotyping scenarios

The correlation between the markers in the SNP-Array and GBS-Mock scenarios provides information about the location of the markers on the chromosomes. Identifying a marker highly correlated with the GBS-Mock suggests that this marker is likely on a specific chromosome. The strength of the correlation between two markers is related to their physical proximity; the closer the markers are, the stronger the linkage disequilibrium (LD) (Myles et al., 2009). When markers are closer, they are more likely to be inherited together, leading to a stronger correlation between them. This is because when two markers are very close, they have fewer opportunities for recombination during meiosis, the process of gamete formation, which maintains stable combinations of adjacent alleles across generations. This information can be useful for guiding research and providing an initial direction for investigating the specific position of the marker in the genome.

However, according to the study results, the markers are located throughout the genome and not necessarily physically close. In other words, despite the relationship between the correlation’s strength and the markers’ physical proximity, the results showed that the markers are distributed across the entire genome. This suggests that other factors, besides physical proximity, may influence the correlation between the markers, such as genetic inheritance patterns, recombination rate, and genomic structure, highlighting the importance of considering these aspects.

4.4 Combining genotyping scenarios

Combining genotyping scenarios can be a valid alternative for GWAS studies, providing higher resolution results than those obtained in isolated scenarios. In the approach involving Array and GBS, it was noticed that one tool complements the other, regardless of how GBS data are explored, whether with the referenced genome or in-silico, as there was little difference between SNP-Array + GBS-B73 and SNP-Array + GBS-Mock. Using multiple genotyping platforms, capturing a broader range of genetic markers in linkage disequilibrium with the loci of interest is possible. This can increase the ability to detect significant associations between genetic variants and phenotypes in GWAS studies.

Concerning the use of simulated genomes, MaChado et al. (2023) and Sabadin et al. (2022), assert that it is an excellent strategy for studies on diversity, population structure, heterotic group definition, tester selection, and genomic prediction for minor crops. Another caveat is that using temperate germplasm as a reference genome may introduce a significant bias when analyzing tropical germplasm (Xu et al., 2017). As a result, favorable alleles hidden in tropical maize, in specific tropical genomic regions, may be lost (Rasheed et al., 2017). With GBS, marker discovery and genotyping occur simultaneously, mitigating this bias and enabling the identification of markers in the analyzed diversity panel (Heslot et al., 2013). Furthermore, combining information obtained via conventional approaches with a reference genome obtained from the simulated genome should improve accuracy in association studies and impact the advancement of genetic research and the development of breeding strategies.

5 Final remarks

Negro et al. (2019) and Darrier et al. (2019) highlighted the complementarity between standard genotyping platforms for GWAS, demonstrating that both SNP-Array and GBS can identify markers strongly linked to genes influencing key phenotypic traits. However, adopting different genotyping platforms may incur substantial costs due to their distinct methodologies. Conversely, GBS genotyping offers the flexibility to utilize both the reference genome and in-silico genome, thereby avoiding additional expenses associated with combining these scenarios. In our study, combining GBS-B73 and GBS-Mock datasets resulted in a notable increase in accuracy for several traits compared to the highest accuracy achieved by GBS alone. Specifically, we observed accuracy gains of 0.06, 0.03, 0.05, and 0.15 for SPAD, PH, SD, and SDM, respectively. This integration of datasets allows for more comprehensive analyses, capturing a broader range of SNPs and providing enhanced resolution in explaining phenotypic variation. Ultimately, leveraging a single genotyping method enables more informative and efficient data exploration, facilitating a deeper understanding of the genetic basis of traits and informing crop improvement strategies.

Indeed, when a study aims to uncover greater genetic polymorphism within a species, and SNP-Array technology is unavailable, leveraging GBS approaches becomes a viable alternative. By conducting GWAS using GBS methods, researchers can effectively identify additional polymorphisms, thereby increasing the resolution and depth of the study. This strategy is particularly beneficial for minor or orphan crops with a genome reference but need access to SNP-Array technology. In such cases, GBS offers a cost-effective and accessible means to explore the genetic diversity present within these crops, facilitating a more comprehensive understanding of their genetic architecture and potential avenues for crop improvement. By harnessing the power of GBS-based GWAS, researchers can unlock valuable insights into the genetic factors underlying traits of interest, ultimately contributing to the development of improved varieties tailored to the specific needs of these crops.

Maize, with its high genetic diversity, can target pangenomes to improve the accuracy of genetic and phenotypic analyses (Lu et al., 2015). Pangenomes offer a more comprehensive representation of genomic variations within the species, allowing for the capture of rare or subpopulation-specific variations that may not be present in a single reference genome (Marschall et al., 2018). However, if the resources needed to generate or utilize pangenomes are not available, GBS remains an effective and accessible alternative. Compared to other approaches, GBS provides greater flexibility and sufficient resolution to identify significant polymorphisms, contributing to the exploration of genetic diversity and the advancement of breeding programs. By leveraging the power of GBS-based GWAS, either alongside pangenomes or as an alternative to them, researchers can unlock valuable insights into the genetic factors underlying traits of interest. This ultimately contributes to the development of improved varieties tailored to the specific needs of these crops.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Author contributions

FP: Data curation, Investigation, Methodology, Writing – original draft, Writing – review & editing. IM: Data curation, Writing – review & editing. MS: Data curation, Writing – review & editing. AL: Data curation, Writing – review & editing. FS: Data curation, Methodology, Validation, Writing – review & editing. RF-N: Conceptualization, Formal Analysis, Investigation, Writing – review & editing, Data curation, Methodology. JD: Conceptualization, Data curation, Formal Analysis, Investigation, Writing – review & editing, Methodology.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study was financially supported by Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)—Process: 409096/2021-0 CNPq/MCTI/FNDCT and by Fundação Cearense de Apoio ao Desenvolvimento Científico e Tecnológico (Funcap) - Process: 10504968/2023 - Edital 08/2023 – Funcap - Internacionalização.

Acknowledgments

We acknowledge the financial support provided by the Coordination for the Improvement of Higher Education Personnel - Brazil (CAPES) - Finance Code 001, and Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) - Process 2017/24327-0. We also extend our gratitude to Limagrain Field Seeds for their technical support. This manuscript was derived from the doctoral thesis of the first author.

Conflict of interest

The authors declare that the research was conducted without any commercial or financial relationships that could potentially create a conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2024.1442008/full#supplementary-material

Footnotes

  1. ^ maizegdb.org
  2. ^ bar.utoronto.ca
  3. ^ maize.uga.edu

References

Altschul, S. F., Gish, W., Miller, W., Myers, E. W., Lipman, D. J. (1990). Basic local alignment search tool. J. Mol. Biol. 215, 403–410. doi: 10.1016/S0022-2836(05)80360-2

PubMed Abstract | Crossref Full Text | Google Scholar

Apelbaum, A., Yang, S. F. (1981). Biosynthesis of stress ethylene induced by water deficit. Plant Physiol. 68, 594–596. doi: 10.1104/pp.68.3.594

PubMed Abstract | Crossref Full Text | Google Scholar

Assareh, A., Volkert, L. G., Li, J. (2012). Interaction Trees: Optimizing Ensembles of Decision Trees for Gene-Gene Interaction Detections. 2012 11th International Conference on Machine Learning and Applications. Boca Raton, FL, USA. pp. 616–621. doi: 10.1109/ICMLA.2012.114

Crossref Full Text | Google Scholar

Barkan, A., Small, I. (2014). Pentatricopeptide repeat proteins in plants. Annu. Rev. Plant Biol. 65, 415–442. doi: 10.1146/annurev-arplant-050213-040159

PubMed Abstract | Crossref Full Text | Google Scholar

Biswas, K. K., Ooura, C., Higuchi, K., Miyazaki, Y., Van Nguyen, V., Rahman, A., et al. (2007). Genetic characterization of mutants resistant to the antiauxin p -chlorophenoxyisobutyric acid reveals that AAR3, a gene encoding a DCN1-like protein, regulates responses to the synthetic auxin 2,4-dichlorophenoxyacetic acid in arabidopsis roots. Plant Physiol. 145, 773–785. doi: 10.1104/pp.107.104844

PubMed Abstract | Crossref Full Text | Google Scholar

Bouchet, S., Bertin, P., Presterl, T., Jamin, P., Coubriche, D., Gouesnard, B., et al. (2017). Association mapping for phenology and plant architecture in maize shows higher power for developmental traits compared with growth influenced traits. Heredity (Edinb). 118, 249–259. doi: 10.1038/hdy.2016.88

PubMed Abstract | Crossref Full Text | Google Scholar

Browning, B. L., Browning, S. R. (2008). A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals. Am. J. Hum. Genet. 84, 210–223. doi: 10.1016/j.ajhg.2009.01.005

PubMed Abstract | Crossref Full Text | Google Scholar

Camut, L., Regnault, T., Sirlin-Josserand, M., Sakvarelidze-Achard, L., Carrera, E., Zumsteg, J., et al. (2019). Root-derived GA12 contributes to temperature-induced shoot growth in Arabidopsis. Nat. Plants 5, 1216–1221. doi: 10.1038/s41477-019-0568-8

PubMed Abstract | Crossref Full Text | Google Scholar

Cao, L., Lu, X., Zhang, P., Wang, G., Wei, L., Wang, T. (2019). Systematic analysis of differentially expressed maize ZmbZIP genes between drought and rewatering transcriptome reveals bZIP family members involved in abiotic stress responses. Int. J. Mol. Sci. 20. doi: 10.3390/ijms20174103

PubMed Abstract | Crossref Full Text | Google Scholar

Covarrubias-Pazaran, G. (2016). Genome-Assisted prediction of quantitative traits using the r package sommer. PloS One 11, 1–15. doi: 10.1371/journal.pone.0156744

PubMed Abstract | Crossref Full Text | Google Scholar

Da, Y., Wang, C., Wang, S., Hu, G. (2014). Mixed model methods for genomic prediction and variance component estimation of additive and dominance effects using SNP markers. PloS One 9. doi: 10.1371/journal.pone.0087666

PubMed Abstract | Crossref Full Text | Google Scholar

Darrier, B., Russell, J., Milner, S. G., Hedley, P. E., Shaw, P. D., Macaulay, M., et al. (2019). A comparison of mainstream genotyping platforms for the evaluation and use of barley genetic resources. Front. Plant Sci. 10. doi: 10.3389/fpls.2019.00544

PubMed Abstract | Crossref Full Text | Google Scholar

Davey, J. W., Hohenlohe, P. A., Etter, P. D., Boone, J. Q., Catchen, J. M., Blaxter, M. L. (2011). Genome-wide genetic marker discovery and genotyping using next-generation sequencing. Nat. Rev. Genet. 12, 499–510. doi: 10.1038/nrg3012

PubMed Abstract | Crossref Full Text | Google Scholar

De Brasi-Velasco, S., Sánchez-Guerrero, A., Castillo, M. C., Vertommen, D., León, J., Sevilla, F., et al. (2023). Thioredoxin TRXo1 is involved in ABA perception via PYR1 redox regulation. Redox Biol. 63. doi: 10.1016/j.redox.2023.102750

PubMed Abstract | Crossref Full Text | Google Scholar

De Souza Silveira, M. V., de Pontes, F. C. F., MaChado, I. P., Lobo, A. L. A., Fritsche-Neto, R., DoVale, J. C. (2024). Association mapping for image-based root traits in tropical maize under water stress in semi-arid regions. Agron. J. 116 (3), 1250–1264. doi: 10.1002/agj2.21528

Crossref Full Text | Google Scholar

Ding, S., He, F., Tang, W., Du, H. (2019). Identification of maize CC-type glutaredoxins that. Genes (Basel). 10, 610. doi: 10.3390%2Fgenes10080610

Crossref Full Text | Google Scholar

Dodd, A. N., Salathia, N., Hall, A., Kévei, E., Tóth, R., Nagy, F., et al. (2005). Cell biology: Plant circadian clocks increase photosynthesis, growth, survival, and competitive advantage. Science 80)309, 630–633. doi: 10.1126/science.1115581

PubMed Abstract | Crossref Full Text | Google Scholar

Dreher, K., Callis, J. (2007). Ubiquitin, hormones and biotic stress in plants. Ann. Bot. 99, 787–822. doi: 10.1093/aob/mcl255

PubMed Abstract | Crossref Full Text | Google Scholar

Du, H., Wang, Y., Xie, Y., Liang, Z., Jiang, S. J., Zhang, S. S., et al. (2013). Genome-wide identification and evolutionary and expression analyses of MYB-related genes in land plants. DNA Res. 20, 437–448. doi: 10.1093/dnares/dst021

PubMed Abstract | Crossref Full Text | Google Scholar

Elbasyoni, I. S., Lorenz, A. J., Guttieri, M., Frels, K., Baenziger, P. S., Poland, J., et al. (2018). A comparison between genotyping-by-sequencing and array-based scoring of SNPs for genomic prediction accuracy in winter wheat. Plant Sci. 270, 123–130. doi: 10.1016/j.plantsci.2018.02.019

PubMed Abstract | Crossref Full Text | Google Scholar

Ernst, A., Avvakumov, G., Tong, J., Fan, Y., Zhao, Y., Alberts, P., et al. (2013). A strategy for modulation of enzymes in the ubiquitin system. Sci. (80-.). 339, 590–595. doi: 10.1126/science.1230161

PubMed Abstract | Crossref Full Text | Google Scholar

Evans, N. H. (2003). Modulation of guard cell plasma membrane potassium currents by methyl jasmonate. Plant Physiol. 131, 8–11. doi: 10.1104/pp.014266

PubMed Abstract | Crossref Full Text | Google Scholar

Falconer, D. S., Mackay, T. F. C. (1996). Introduction to quantitative genetics. 4.ed (London: Longman).

Google Scholar

Frensch, J., Hsiao, T. C. (1994). Transient responses of cell turgor and growth of maize roots as affected by changes in water potential. Plant Physiol. 104, 247–254. doi: 10.1104/pp.104.1.247

PubMed Abstract | Crossref Full Text | Google Scholar

Ganal, M. W., Durstewitz, G., Polley, A., Bérard, A., Buckler, E. S., Charcosset, A., et al. (2011). A large maize (zea mays L.) SNP genotyping array: Development and germplasm genotyping, and genetic mapping to compare with the B73 reference genome. PloS One 6. doi: 10.1371/journal.pone.0028334

PubMed Abstract | Crossref Full Text | Google Scholar

Glaubitz, J. C., Casstevens, T. M., Lu, F., Harriman, J., Elshire, R. J., Sun, Q., et al. (2014). TASSEL-GBS: A high capacity genotyping by sequencing analysis pipeline. PloS One 9. doi: 10.1371/journal.pone.0090346

PubMed Abstract | Crossref Full Text | Google Scholar

Glick, B. R. (2014). Bacteria with ACC deaminase can promote plant growth and help to feed the world. Microbiol. Res. 169, 30–39. doi: 10.1016/j.micres.2013.09.009

PubMed Abstract | Crossref Full Text | Google Scholar

Gorjanc, G., Bijma, P., Hickey, J. M. (2015). Reliability of pedigree-based and genomic evaluations in selected populations. Genet. Sel. Evol. 47. doi: 10.1186/s12711-015-0145-1

PubMed Abstract | Crossref Full Text | Google Scholar

Gorton, H. L., Williams, W. E., Assmann, S. M. (1993). Circadian rhythms in stomatal responsiveness to red and blue light. Plant Physiol. 103, 399–406. doi: 10.1104/pp.103.2.399

PubMed Abstract | Crossref Full Text | Google Scholar

Han, G., Lu, C., Guo, J., Qiao, Z., Sui, N., Qiu, N., et al. (2020). C2H2 zinc finger proteins: master regulators of abiotic stress responses in plants. Front. Plant Sci. 11. doi: 10.3389/fpls.2020.00115

PubMed Abstract | Crossref Full Text | Google Scholar

Han, G., Qiao, Z., Li, Y., Yang, Z., Wang, C., Zhang, Y., et al. (2022). RING zinc finger proteins in plant abiotic stress tolerance. Front. Plant Sci. 13. doi: 10.3389/fpls.2022.877011

PubMed Abstract | Crossref Full Text | Google Scholar

Heidari, P., Mazloomi, F., Nussbaumer, T., Barcaccia, G. (2020). Insights into the SAM Synthetase Gene Family and Its Roles in Tomato Seedlings under Abiotic Stresses and Hormone Treatments. Plants 9, 586. doi: 10.3390/plants9050586

PubMed Abstract | Crossref Full Text | Google Scholar

Heslot, N., Rutkoski, J., Poland, J., Jannink, J. L., Sorrells, M. E. (2013). Impact of marker ascertainment bias on genomic selection accuracy and estimates of genetic diversity. PloS One 8. doi: 10.1371/journal.pone.0074612

PubMed Abstract | Crossref Full Text | Google Scholar

Hsiao, T. C. (1973). Plant responses to water stress. Annu. Rev. Plant Physiol. 24, 519–570. doi: 10.1146/annurev.pp.24.060173.002511

Crossref Full Text | Google Scholar

Hsu, P. Y., Devisetty, U. K., Harmer, S. L. (2013). Accurate timekeeping is controlled by a cycling activator in Arabidopsis. Elife 2013, 1–20. doi: 10.7554/eLife.00473

PubMed Abstract | Crossref Full Text | Google Scholar

Hu, Z., Zhang, N., Qin, Z., Li, J., Yang, N., Chen, Y., et al. (2024). Differential response of MYB transcription factor gene transcripts to circadian rhythm in tea plants (Camellia sinensis). Int. J. Mol. Sci. 25. doi: 10.3390/ijms25010657

PubMed Abstract | Crossref Full Text | Google Scholar

Ingvarsson, P. K., Street, N. R. (2011). Association genetics of complex traits in plants. New Phytol. 189, 909–922. doi: 10.1111/j.1469-8137.2010.03593.x

PubMed Abstract | Crossref Full Text | Google Scholar

IPCC, Core Writing Team (2023). “Climate change 2023: synthesis report, summary for policymakers,” in Contribution of working groups I, II and III to the sixth assessment report of the intergovernmental panel on climate change. Eds. Lee, H., Romero, J. (Geneva, Switzerland: IPC). doi: 10.59327/IPCC/AR6-9789291691647.001

Crossref Full Text | Google Scholar

Ishiga, Y., Ishiga, T., Uppalapati, S. R., Mysore, K. S. (2013). Jasmonate ZIM-domain (JAZ) protein regulates host and nonhost pathogen-induced cell death in tomato and nicotiana benthamiana. PloS One 8. doi: 10.1371/journal.pone.0075728

PubMed Abstract | Crossref Full Text | Google Scholar

Jacobsen, S. E., Olszewski, N. E. (1993). Mutations at the SPINDLY locus of Arabidopsis alter gibberellin signal transduction. Plant Cell 5, 887–896. doi: 10.1105/tpc.5.8.887

PubMed Abstract | Crossref Full Text | Google Scholar

Kamioka, M., Takao, S., Suzuki, T., Taki, K., Higashiyam, T., Kinoshita, T., et al. (2016). Direct repression of evening genes by CIRCADIAN CLOCK-ASSOCIATED1 in the Arabidopsis circadian clock. Plant Cell 28, 696–711. doi: 10.1105/tpc.15.00737

PubMed Abstract | Crossref Full Text | Google Scholar

Kang, Y., Sakiroglu, M., Krom, N., Stanton-Geddes, J., Wang, M., Lee, Y. C., et al. (2015). Genome-wide association of drought-related and biomass traits with HapMap SNPs in Medicago truncatula. Plant Cell Environ. 38, 1997–2011. doi: 10.1111/pce.12520

PubMed Abstract | Crossref Full Text | Google Scholar

Khan, H. A., Mutus, B. (2014). Protein disulfide isomerase a multifunctional protein with multiple physiological roles. Front. Chem. 2. doi: 10.3389/fchem.2014.00070

PubMed Abstract | Crossref Full Text | Google Scholar

Larcher, W. (2003). Physiological plant ecology: ecophysiology and stress physiology of functional groups (Berlin: Springer Nature).

Google Scholar

Larson, E. R., van Zelm, E., Roux, C., Marion-Poll, A., Blatt, M. R. (2017). Clathrin heavy chain subunits coordinate endo- and exocytic traffic and affect stomatal movement. Plant Physiol. 175, 708–720. doi: 10.1104/pp.17.00970

PubMed Abstract | Crossref Full Text | Google Scholar

Lee, J. H., Kim, W. T. (2011). Regulation of abiotic stress signal transduction by E3 ubiquitin ligases in arabidopsis. Mol. Cells 31, 201–208. doi: 10.1007/s10059-011-0031-9

PubMed Abstract | Crossref Full Text | Google Scholar

Li, H., Durbin, R. (2009). Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760. doi: 10.1093/bioinformatics/btp324

PubMed Abstract | Crossref Full Text | Google Scholar

Li, X., Zhang, Q., Yang, X., Han, J., Zhu, Z. (2019). OsANN3, a calcium-dependent lipid binding annexin is a positive regulator of ABA-dependent stress tolerance in rice. Plant Sci. 284, 212–220. doi: 10.1016/j.plantsci.2019.04.019

PubMed Abstract | Crossref Full Text | Google Scholar

Linden, K. J., Callis, J. (2020). The ubiquitin system affects agronomic plant traits. J. Biol. Chem. 295, 13940–13955. doi: 10.1074/jbc.REV120.011303

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, X., Huang, M., Fan, B., Buckler, E. S., Zhang, Z. (2016a). Iterative usage of fixed and random effect models for powerful and efficient genomeWide association studies. PloS Genet. 12. doi: 10.1371/journal.pgen.1005767

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, F., Sun, T., Wang, L., Su, W., Gao, S., Su, Y., et al. (2017). Plant jasmonate ZIM domain genes: Shedding light on structure and expression patterns of JAZ gene family in sugarcane. BMC Genomics 18, 1–17. doi: 10.1186/s12864-017-4142-3

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, Y. H., Wang, X. T., Shi, Y. S., Huang, Y. Q., Song, Y. C., Wang, T. Y., et al. (2009). Expression and characterization of a protein disulfide isomerases in maize (Zea Mays L.). Chin. J. Biochem. Mol. Biol. 25, 229–234.

Google Scholar

Liu, J. M., Zhao, J. Y., Lu, P. P., Chen, M., Guo, C. H., Xu, Z. S., et al. (2016b). The E-subgroup pentatricopeptide repeat protein family in Arabidopsis thaliana and confirmation of the responsiveness PPR96 to abiotic stresses. Front. Plant Sci. 7. doi: 10.3389/fpls.2016.01825

PubMed Abstract | Crossref Full Text | Google Scholar

Lu, F., Romay, M. C., Glaubitz, J. C., Bradbury, P. J., Elshire, R. J., Wang, T., et al. (2015). High-resolution genetic mapping of maize pan-genome sequence anchors. Nat. Commun. 6. doi: 10.1038/ncomms7914

PubMed Abstract | Crossref Full Text | Google Scholar

MaChado, I. P., DoVale, J. C., Sabadin, F., Fritsche-Neto, R. (2023). On the usefulness of mock genomes to define heterotic pools, testers, and hybrid predictions in orphan crops. Front. Plant Sci. 14. doi: 10.3389/fpls.2023.1164555

PubMed Abstract | Crossref Full Text | Google Scholar

Marschall, T., Marz, M., Abeel, T., Dijkstra, L., Dutilh, B. E., Ghaffaari, A., et al. (2018). Computational pan-genomics: Status, promises and challenges. Brief. Bioinform. 19, 118–135. doi: 10.1093/bib/bbw089

PubMed Abstract | Crossref Full Text | Google Scholar

Melo, A. T. O., Bartaula, R., Hale, I. (2016). GBS-SNP-CROP: A reference-optional pipeline for SNP discovery and plant germplasm characterization using variable length, paired-end genotyping-by-sequencing data. BMC Bioinf. 17, 1–15. doi: 10.1186/s12859-016-0879-y

PubMed Abstract | Crossref Full Text | Google Scholar

Mohanty, P., Chatterjee, K. S., Das, R. (2021). NEDD8 deamidation inhibits cullin RING ligase dynamics. Front. Immunol. 12. doi: 10.3389/fimmu.2021.695331

PubMed Abstract | Crossref Full Text | Google Scholar

Morosini, J. S., Mendonça, L., de, F., Lyra, D. H., Galli, G., Vidotti, M. S., et al. (2017). Association mapping for traits related to nitrogen use efficiency in tropical maize lines under field conditions. Plant Soil 421, 453–463. doi: 10.1007/s11104-017-3479-3

Crossref Full Text | Google Scholar

Myles, S., Peiffer, J., Brown, P. J., Ersoz, E. S., Zhang, Z., Costich, D. E., et al. (2009). Association mapping: Critical considerations shift from genotyping to experimental design. Plant Cell 21, 2194–2202. doi: 10.1105/tpc.109.068437

PubMed Abstract | Crossref Full Text | Google Scholar

Negro, S. S., Millet, E. J., Madur, D., Bauland, C., Combes, V., Welcker, C., et al. (2019). Genotyping-by-sequencing and SNP-arrays are complementary for detecting quantitative trait loci by tagging different haplotypes in association studies. BMC Plant Biol. 19, 1–22. doi: 10.1186/s12870-019-1926-4

PubMed Abstract | Crossref Full Text | Google Scholar

Niwa, Y., Yamashino, T., Mizuno, T. (2009). The Circadian Clock Regulates the Photoperiodic Response of Hypocotyl Elongation through a Coincidence Mechanism in Arabidopsis thaliana. Plant Cell Physiol. 50, 838–854. doi: 10.1093/pcp/pcp028

PubMed Abstract | Crossref Full Text | Google Scholar

Pan, Z.-Q., Kentsis, A., Dias, D. C., Yamoah, K., Wu, K. (2004). Nedd8 on cullin: building an expressway to protein destruction. Oncogene 23, 1985–1997. doi: 10.1038/sj.onc.1207414

PubMed Abstract | Crossref Full Text | Google Scholar

Peterson, R. A. (2021). Estimating normalization transformations with bestNormalize. The R Journal. 13:1, 310–329. doi: 10.32614/RJ-2021-041

Crossref Full Text | Google Scholar

Poland, J. A., Brown, P. J., Sorrells, M. E., Jannink, J. L. (2012). Development of high-density genetic maps for barley and wheat using a novel two-enzyme genotyping-by-sequencing approach. PloS One 7. doi: 10.1371/journal.pone.0032253

PubMed Abstract | Crossref Full Text | Google Scholar

Rabut, G., Peter, M. (2008). Function and regulation of protein neddylation. “Protein modifications: beyond the usual suspects” review series. EMBO Rep. 9, 969–976. doi: 10.1038/embor.2008.183

PubMed Abstract | Crossref Full Text | Google Scholar

Rahman, M. S., Fernando, S., Ross, B., Wu, J., Qin, W. (2018). Endoglucanase (EG) activity assays chapter 13. Methods Mol. Biol. 169–183. doi: 10.1007/978-1-4939-7877-9

Crossref Full Text | Google Scholar

Rasheed, A., Hao, Y., Xia, X., Khan, A., Xu, Y., Varshney, R. K., et al. (2017). Crop breeding chips and genotyping platforms: progress, challenges, and perspectives. Mol. Plant 10, 1047–1064. doi: 10.1016/j.molp.2017.06.008

PubMed Abstract | Crossref Full Text | Google Scholar

Raza, A., Razzaq, A., Mehmood, S. S., Zou, X., Zhang, X., Lv, Y., et al. (2019). Impact of climate change on crops adaptation and strategies to tackle its outcome: A review. Plants 8. doi: 10.3390/plants8020034

PubMed Abstract | Crossref Full Text | Google Scholar

Rehman, M., Saeed, M. S., Fan, X., Salam, A., Munir, R., Yasin, M. U., et al. (2023). The multifaceted role of jasmonic acid in plant stress mitigation: an overview. Plants 12. doi: 10.3390/plants12233982

PubMed Abstract | Crossref Full Text | Google Scholar

Rosado, A., Schapire, A. L., Bressan, R. A., Harfouche, A. L., Hasegawa, P. M., Valpuesta, V., et al. (2006). The arabidopsis tetratricopeptide repeat-containing protein TTL1 is required for osmotic stress responses and abscisic acid sensitivity. Plant Physiol. 142, 1113–1126. doi: 10.1104/pp.106.085191

PubMed Abstract | Crossref Full Text | Google Scholar

Sabadin, F., Carvalho, H. F., Galli, G., Fritsche-Neto, R. (2022). Population-tailored mock genome enables genomic studies in species without a reference genome. Mol. Genet. Genomics 297, 33–46. doi: 10.1007/s00438-021-01831-9

PubMed Abstract | Crossref Full Text | Google Scholar

Sabiel, S. A. I., Abdelmula, A. A., Bashir, E., Khan Baloch, S., I Sabiel, S. A., Abdelmula, A. A., et al. (2013). Genetic Variation of Plant Height and Stem Diameter Traits in Maize (Zea mays L.) under Drought Stress at Different Growth Stages. J. Nat. Sci. Res. 4, 116–122.

Google Scholar

Schaffer, R., Ramsay, N., Samach, A., Corden, S., Putterill, J., Carré, I. A., et al. (1998). The late elongated hypocotyl mutation of Arabidopsis disrupts circadian rhythms and the photoperiodic control of flowering. Cell 93, 1219–1229. doi: 10.1016/S0092-8674(00)81465-8

PubMed Abstract | Crossref Full Text | Google Scholar

Schnable, P. S., Ware, D., Fulton, R. S., Stein, J. C., Wei, F., Pasternak, S., et al. (2009). The B73 maize genome: Complexity, diversity, and dynamics. Sci. (80-.). 326, 1112–1115. doi: 10.1126/science.1178534

PubMed Abstract | Crossref Full Text | Google Scholar

Segura, V., Vilhjálmsson, B. J., Platt, A., Korte, A., Seren, Ü., Long, Q., et al. (2012). An efficient multi-locus mixed-model approach for genome-wide association studies in structured populations. Nat. Genet. 44, 825–830. doi: 10.3390/app10207326. Plant-growth-promoting bacteria mitigating soil salinity stress in plants. Appl. Sci. 10, 1–20.

PubMed Abstract | Crossref Full Text | Google Scholar

Silverstone, A. L., Tseng, T.-S., Swain, S. M., Dill, A., Jeong, S. Y., Olszewski, N. E., et al. (2007). Functional analysis of SPINDLY in gibberellin signaling in arabidopsis. Plant Physiol. 143, 987–1000. doi: 10.1104/pp.106.091025

PubMed Abstract | Crossref Full Text | Google Scholar

Sutter, J. U., Sieben, C., Hartel, A., Eisenach, C., Thiel, G., Blatt, M. R. (2007). Abscisic acid triggers the endocytosis of the arabidopsis KAT1 K+ Channel and its recycling to the plasma membrane. Curr. Biol. 17, 1396–1402. doi: 10.1016/j.cub.2007.07.020

PubMed Abstract | Crossref Full Text | Google Scholar

Taiz, L., Zeiger, E., Moller, I. M., Murphy, A. (2015). Plant physiology and development. 6th Edn (Sunderland, Massachusetts, USA: Sinauer Associates Incorporated).

Google Scholar

Tanimoto, M., Tremblay, R., Colasanti, J. (2008). Altered gravitropic response, amyloplast sedimentation and circumnutation in the Arabidopsis shoot gravitropism 5 mutant are associated with reduced starch levels. Plant Mol. Biol. 67, 57–69. doi: 10.1007/s11103-008-9301-0

PubMed Abstract | Crossref Full Text | Google Scholar

Tanz, S. K., Kilian, J., Johnsson, C., Apel, K., Small, I., Harter, K., et al. (2012). The SCO2 protein disulphide isomerase is required for thylakoid biogenesis and interacts with LCHB1 chlorophyll a/b binding proteins which affects chlorophyll biosynthesis in Arabidopsis seedlings. Plant J. 69, 743–754. doi: 10.1111/j.1365-313X.2011.04833.x

PubMed Abstract | Crossref Full Text | Google Scholar

Thomson, M. J. (2014). High-throughput SNP genotyping to accelerate crop improvement. Plant Breed. Biotechnol. 2, 195–212. doi: 10.9787/pbb.2014.2.3.195

Crossref Full Text | Google Scholar

Turner, N. C., Jones, M. M. (1980). “Turgor maintenance by osmotic adjustment : A Review and evaluation,” in Adaptation of plants to water and high temperatures stress. Eds. Turner, N. C., Kramer, P. J. (New York: Willey & Sons), 87–103.

Google Scholar

Uno, Y., Furihata, T., Abe, H., Yoshida, R., Shinozaki, K., Yamaguchi-Shinozaki, K. (2000). Arabidopsis basic leucine zipper transcription factors involved in an abscisic acid-dependent signal transduction pathway under drought and high-salinity conditions. Proc. Natl. Acad. Sci. U. S. A. 97, 11632–11637. doi: 10.1073/pnas.190309197

PubMed Abstract | Crossref Full Text | Google Scholar

VanRaden, P. M. (2008). Efficient methods to compute genomic predictions. J. Dairy Sci. 91, 4414–4423. doi: 10.3168/jds.2007-0980

PubMed Abstract | Crossref Full Text | Google Scholar

Wang, X., Wu, F., Xie, Q., Wang, H., Wang, Y., Yue, Y., et al. (2012). SKIP is a component of the spliceosome linking alternative splicing and the circadian clock in Arabidopsis. Plant Cell 24, 3278–3295. doi: 10.1105/tpc.112.100081

PubMed Abstract | Crossref Full Text | Google Scholar

Wang, N., Yuan, Y., Wang, H., Yu, D., Liu, Y., Zhang, A., et al. (2020). Applications of genotyping-by-sequencing (GBS) in maize genetics and breeding. Sci. Rep. 10, 1–12. doi: 10.1038/s41598-020-73321-8

PubMed Abstract | Crossref Full Text | Google Scholar

Wickham, H. (2011). ggplot2. WIREs Comput. Stat. 3, 180–185. doi: 10.1002/wics.147

Crossref Full Text | Google Scholar

Williams, E., Piepho, H. P., Whitaker, D. (2011). Augmented p-rep designs. Biometrical J. 53, 19–27. doi: 10.1002/bimj.201000102

PubMed Abstract | Crossref Full Text | Google Scholar

Wu, Y. S., Yang, C. Y. (2019). Ethylene-mediated signaling confers thermotolerance and regulates transcript levels of heat shock factors in rice seedlings under heat stress. Bot. Stud. 60. doi: 10.1186/s40529-019-0272-z

PubMed Abstract | Crossref Full Text | Google Scholar

Xu, Y., Li, P., Zou, C., Lu, Y., Xie, C., Zhang, X., et al. (2017). Enhancing genetic gain in the era of molecular breeding. J. Exp. Bot. 68, 2641–2666. doi: 10.1093/jxb/erx135

PubMed Abstract | Crossref Full Text | Google Scholar

Yamaguchi, S. (2008). Gibberellin metabolism and its regulation. Annu. Rev. Plant Biol. 59, 225–251. doi: 10.1146/annurev.arplant.59.032607.092804

PubMed Abstract | Crossref Full Text | Google Scholar

Yamauchi, Y., Fukaki, H., Fujisawa, H., Tasaka, M. (1997). Mutations in the SGR4, SGR5 and SGR6 loci of arabidopsis thaliana alter the shoot gravitropism. Plant Cell Physiol. 38, 530–535. doi: 10.1093/oxfordjournals.pcp.a029201

PubMed Abstract | Crossref Full Text | Google Scholar

Yang, L., Wu, L., Chang, W., Li, Z., Miao, M., Li, Y., et al. (2018). Overexpression of the maize E3 ubiquitin ligase gene ZmAIRP4 enhances drought stress tolerance in Arabidopsis. Plant Physiol. Biochem. 123, 34–42. doi: 10.1016/j.plaphy.2017.11.017

PubMed Abstract | Crossref Full Text | Google Scholar

Yassue, R. M., Carvalho, H. F., Gevartosky, R., Sabadin, F., Souza, P. H., Bonatelli, M. L., et al. (2021). On the genetic architecture in a public tropical maize panel of the symbiosis between corn and plant growth-promoting bacteria aiming to improve plant resilience. Mol. Breed. 41. doi: 10.1007/s11032-021-01257-6

PubMed Abstract | Crossref Full Text | Google Scholar

Yassue, R. M., Galli, G., Chun-Peng, J. C., Fritsche-Neto, R., Gota, M. (2023). Genome-wide association analysis of hyperspectral reflectance data to dissect the genetic architecture of growth-related traits in maize under plant growth-promoting bacteria inoculation. Plant Direct 7. doi: 10.1002/pld3.492

PubMed Abstract | Crossref Full Text | Google Scholar

Yin, L. (2020). CMplot: circle manhattan plot. C. Circ. Manhattan Plot.

Google Scholar

Yoshida, T., Ohama, N., Nakajima, J., Kidokoro, S., Mizoi, J., Nakashima, K., et al. (2011). Arabidopsis HsfA1 transcription factors function as the main positive regulators in heat shock-responsive gene expression. Mol. Genet. Genomics 286, 321–332. doi: 10.1007/s00438-011-0647-7

PubMed Abstract | Crossref Full Text | Google Scholar

Yu, J. G., Lee, G. H., Park, Y. D. (2012). Physiological role of endogenous S-adenosyl-L-methionine synthetase in Chinese cabbage. Hortic. Environ. Biotechnol. 53, 247–255. doi: 10.1007/s13580-012-0021-7

Crossref Full Text | Google Scholar

Zeng, M. H., Liu, S. H., Yang, M. X., Zhang, Y. J., Liang, J. Y., Wan, X. R., et al. (2013). Characterization of a gene encoding clathrin heavy chain in maize up-regulated by salicylic acid, abscisic acid and high boron supply. Int. J. Mol. Sci. 14, 15179–15198. doi: 10.3390/ijms140715179

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, X., Ren, Z., Luo, B., Zhong, H., Ma, P., Zhang, H., et al. (2022). Genetic architecture of maize yield traits dissected by QTL mapping and GWAS in maize. Crop J. 10, 436–446. doi: 10.1016/j.cj.2021.07.008

Crossref Full Text | Google Scholar

Zheng, X., Levine, D., Shen, J., Gogarten, S. M., Laurie, C., Weir, B. S. (2012). A high-performance computing toolset for relatedness and principal component analysis of SNP data. Bioinformatics 28, 3326–3328. doi: 10.1093/bioinformatics/bts606

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: SNP-array, genotyping by sequencing, simulated genome, GWAS, genotyping platforms, candidate genes, genes with similar functions

Citation: de Pontes FCF, Machado IP, Silveira MVdS, Lobo ALA, Sabadin F, Fritsche-Neto R and DoVale JC (2025) Combining genotyping approaches improves resolution for association mapping: a case study in tropical maize under water stress conditions. Front. Plant Sci. 15:1442008. doi: 10.3389/fpls.2024.1442008

Received: 01 June 2024; Accepted: 31 December 2024;
Published: 23 January 2025.

Edited by:

Parimalan Rangan, Indian Council of Agricultural Research (ICAR), India

Reviewed by:

Raul Pirona, National Research Council (CNR), Italy
Enéas Ricardo Konzen, Federal University of Rio Grande do Sul, Brazil

Copyright © 2025 de Pontes, Machado, Silveira, Lobo, Sabadin, Fritsche-Neto and DoVale. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Fernanda Carla Ferreira de Pontes, ZmVybmFuZGFjZnBvbnRlc0BhbHUudWZjLmJy

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.