Identification of Ear Morphology Genes in Maize (Zea mays L.) Using Selective Sweeps and Association Mapping

Li, Ting; Qu, Jianzhou; Tian, Xiaokang; Lao, Yonghui; Wei, Ningning; Wang, Yahui; Hao, Yinchuan; Zhang, Xinghua; Xue, Jiquan; Xu, Shutu

doi:10.3389/fgene.2020.00747

ORIGINAL RESEARCH article

Front. Genet., 20 July 2020

Sec. Evolutionary and Population Genetics

Volume 11 - 2020 | https://doi.org/10.3389/fgene.2020.00747

This article is part of the Research TopicGenomics-Enabled Crop GeneticsView all 17 articles

Identification of Ear Morphology Genes in Maize (Zea mays L.) Using Selective Sweeps and Association Mapping

Ting Li^1,2†

Jianzhou Qu^1,2†

Xiaokang Tian^1,2

Yonghui Lao^1,2

Ningning Wei^1,2

Yahui Wang^1,2

Yinchuan Hao^1,2

Xinghua Zhang^1,2

Jiquan Xue^1,2*

Shutu Xu^1,2*

¹The Key Laboratory of Biology and Genetics Improvement of Maize in Arid Areas of the Northwest Region, Ministry of Agriculture, College of Agronomy, Northwest A&F University, Xianyang, China
²The Maize Engineering Technology Research Centre of Shaanxi Province, Yangling, China

The performance of maize hybrids largely depend on two parental inbred lines. Improving inbred lines using artificial selection is a key task in breeding programs. However, it is important to elucidate the effects of this selection on inbred lines. Altogether, 208 inbred lines from two maize heterosis groups, named Shaan A and Shaan B, were sequenced by the genotype-by-sequencing to detect genomic changes under selection pressures. In addition, we completed genome-wide association analysis in 121 inbred lines to identify candidate genes for ear morphology related traits. In a genome-wide selection scan, the inbred lines from Shaan A and Shaan B groups showed obvious population divergences and different selective signals distributed in 337 regions harboring 772 genes. Meanwhile, functional enrichment analysis showed those selected genes are mainly involved in regulating cell development. Interestingly, some ear morphology related traits showed significant differentiation between the inbred lines from the two heterosis groups. The genome-wide association analysis of ear morphology related traits showed that four associated genes were co-localized in the selected regions with high linkage disequilibrium. Our spatiotemporal pattern and gene interaction network results for the four genes further contribute to our understanding of the mechanisms behind ear and fruit length development. This study provides a novel insight into digging a candidate gene for complex traits using breeding materials. Our findings in relation to ear morphology will help accelerate future maize improvement.

Introduction

Maize (Zea mays L. ssp. mays), one of the most widely grown crops in the world, plays an essential role in global food security. It has been suggested that maize was domesticated from teosinte (Zea mays L. ssp. Parviglumis) about 10,000 years ago in the Balsas River Basin of southwestern Mexico (Schnable et al., 2009; Xiao et al., 2017). Originally, the teosinte was a wild plant with approximately 5 to 12 kernels in the ear and long tassel branches (Doebley et al., 1995; Wang et al., 1999). The number of kernels in modern maize varieties is far larger than that of teosinte and the long branches have disappeared. There are sharp distinctions between modern maize and teosinte in terms of plant morphology structure.

The morphological architecture of maize underwent a striking transformation mainly owing to selection by early agriculturalists and the local environment. This domestication stage continued for a long time. The original landraces appeared during this long-term domestication process (Yamasaki et al., 2005). In recent years, to increase production and quality levels, modern breeders have made great improvements to maize materials. Recent short-term artificial improvements of maize have resulted in the improvement of combining ability, including specific combining abilities and others (Souza et al., 2010; Viana et al., 2013). Besides phenotypic change, selection during the domestication and improvement processes also brought about reduced genetic diversity in selected genes (Yamasaki et al., 2005). In maize, diversity analyses showed a sequence diversity decrease in the promoter region of Teosinte branched1 (tb1) which affects long branches (Wang et al., 1999). Analysis of the genomic history of North American maize found that genetic separation and linkage disequilibrium increased with maize improvement (Van Heerwaarden et al., 2012). Selection transformed the genome and the shape of maize during domestication and improvement (Shi and Lai, 2015). Such changes in the genome and phenotype during the selection process provide an opportunity to study the influence of selected genes on agronomic traits.

A number of studies have explored the genetic variants of selected traits that arose during the selection process in various species. In soybeans, more than 800 differentiated regions, selected during the domestication process, were observed in the genome (Han et al., 2016). In chrysanthemum, about 550 genomic regions underwent selection amongst the different chrysanthemum types (Chong et al., 2017). In rice, a total of 200 regions with selective signatures were detected in two major indica subpopulations. A large number of genes with known functions in these selected regions are associated with crucial agronomic traits, particularly grain yield (Xie et al., 2015). In maize, there are 484 selected features arising from domestication and 695 from improvement (Hufford et al., 2012). In fact, the selected traits in different breeding populations are diverse during maize improvement. Thus, identifying the genetic variants underlying agronomic traits in different populations, which were developed during crop improvement, will help us understand more about selection effects and, in turn, lead to further crop improvements.

With the advent of new technologies and the exploitation of diverse analysis approaches, it is now possible to identify important genetic variants related to selected traits. Selective sweeps have been used to dissect selected regions in the above-mentioned studies. In general, the selection of a target trait is always accompanied by other agronomic traits due to the genetic hitch-hiking effect (Xu, 2018). The genome-wide association study (GWAS) has also been widely used to identify loci linked to target traits (Li et al., 2013; Chen et al., 2018). GWAS has a higher resolution to obtain casual genes, but some will be false positive. Hence, combining selective sweeps and GWAS is an efficient way to identify selected candidates in relation to specific traits. Furthermore, gene co-expression networks can be set up using gene expression data, which help associate genes of unknown function with biological processes and prioritize candidate select/target genes or discern transcriptional regulatory programs. This process has been carried out on data from multiple plants and has illuminated many key events during plant development (Sarkar et al., 2014; Huang et al., 2017; Wisecaver et al., 2017; Yu et al., 2017). Each approach has its unique advantages and disadvantages. Therefore, combining multiple analysis methods is likely to be an effective way of identifying the genetic mechanisms of selected traits.

Traits relating to ear architecture such as ear length (EL), fruit length (FL), setting rate (SR), and barren tip length (BTL) are essential for the improvement of grain yield during the breeding process. Two heterotic groups (Shaan A group and Shaan B group), with high heterosis between them, were established during 10-year breeding programs. Usually, inbreds from the Shaan A group act as the female population and those from the Shaan B group act as male. These sex differences have been utilized during selection to increase combining ability. In this study, we found that some traits arising from lines developed during long-term breeding selection from Shaan A and Shaan B groups showed significant differences, particularly in relation to EL, FL, BTL, and SR. This provides a good resource for the study of genetic variants in EL, FL, BTL, and SR. Here, we identified massive selective regions in the inbred lines from Shaan A and Shaan B groups, and we dissected the genetic mechanisms of EL, FL, BTL, and SR using this breeding population. In addition, we developed global gene co-expression networks and spatiotemporal-specific processes of genes in selected regions and genes governing significant agronomic traits. Our research revealed some candidate genes associated with ear development, which provides a valuable data resource for plant genetics research and breeding.

Materials and Methods

Plant Materials and SNP Genotyping

A total of 208 maize inbred lines (AM208) were collected including 54 inbred lines from the Shaan A group (A54) and 154 inbred lines from the Shaan B group (B154) (Supplementary Table S1). The Shaan A and Shaan B groups are derived from one basic population which was constructed using several excellent varieties. Then, the elite inbred Ye478 (Reid group) and HuangZaoSi (Tang SPT group) from China were used to pull the basic population into Shaan A group and Shaan B group, which were then improved over a period of approximately 10 years. Within the two heterosis groups, Shaan A and Shaan B play the role as female and male, respectively. The materials in the two groups were selected based on the number of harvest ears, grain weight per ear and seed rate (seed weight/ear weight) by planting in high density, low nitrogen and low irrigation conditions. New inbreds can be filtered by conducting a combining ability test which crosses more than two elite inbreds (including Zheng58 and Chang7-2). Two hundred and eight inbred lines were sampled at the 3-leaf stage and the DNA was extracted using a modified CTAB method (Murray and Thompson, 1980). The genotypes were determined using tGBS technology (Dara2bio; LLC, Ames, IA, United States) (Li et al., 2018). Overall, 48,432 SNPs were retained. Later, the 48,432 SNPs were inputted into Beagle version 4.1 analysis software (Browning and Browning, 2007, 2016) and filtered by a minor allele frequency (MAF) cutoff point of 5% using PLINK version 1.90 software (Purcell et al., 2007). A set of 32,306 high-quality SNPs (MAF ≥ 0.05) was retained for further analysis. In addition, 121 lines were selected from the AM208 population for construction of an association population (AM121), of which 26 lines (A26) belong to A54 and 95 (B95) belong to B154. Genotype data from the association population were filtered from the AM208 population genotype file resulting in 32,306 SNPs which were further screened using a MAF (≥0.05). Ultimately, 32,051 high-quality SNPs were applied to the GWAS.

Population Structure and LD Analyses

For the AM208, an UPGMA tree with 1000 bootstraps was constructed using MEGA 7.0 software and a set of 32,306 high-quality SNPs (Kumar et al., 2016). Principal component analysis (PCA) was performed using the GCTA tool on the high-quality SNPs (Yang et al., 2011). The output of the GCTA tool was inputted into R software so as to graphically display the PCA results. Additionally, A54 and B154 were separated to compute the linkage disequilibrium (LD) decay distance of each group. To avoid the effect of sample size, 54 inbreds (B54 re-sample) were selected from B154 100 times at random. The LD decay distance was calculated using PopLDdecay software (Zhang et al., 2019). For AM121, a kinship matrix was created and a PCA was performed using TASSEL software version 5.0 (Bradbury et al., 2007) and 32,051 SNPs (MAF ≥ 0.05).

Screening for Selective Regions and Genome-Wide Associations (GWAS)

Selective regions typically include two features, high differentiation and low diversity (Doebley et al., 2006). Therefore, the nucleotide diversity (π) ratios (A54/B154), the genetics differentiation (Fst) and the Tajima’s D statistic were calculated using a sliding-windows approach (100 kb non-overlapping sliding window) for A54 and B154 (Schmutz et al., 2014). We identified the low diversiry regions in A54 by the bottom 10% πA54/πB154 value and in B154 by the top 10% πA54/πB154 value. When a window is located on both the top 10% of the pool’s empirical distribution for Fst and the low diversity regions in A54 or B154, the window is considered to be a selective region in either the Shaan A or the Shaan B group.

Phenotypic data and 32,051 SNPs from AM121 were used to uncover the genetic architecture of the target traits using GWAS and a linear mixed model (P + K) in TASSEL v.5.0 software. The threshold was set to 1 × 10^–3 based on analysis results. When the associated regions of the significant SNPs relating to the target trait overlapped with the selected regions, these associated regions were considered to be candidate regions and subsequent analysis was performed. In these regions, all genes were identified using the MaizeGDB database¹. To compare selected genes with those from published data, all v4 selected gene IDs were converted to v3 gene IDs.

Field Experiment Design and Phenotypic Data Collection

For the association population, AM121 individuals were planted in two different fields, which were distinguished using the codes E1 and E2, in Yangling in Shaanxi Province, China in 2017. The experiment consisted of two replicates, each with two rows. Each row was 5 m long and 0.6 m wide. The planting density was 67,500 plants/ha. When the maize was mature, according to single ear weight, ten ears were selected in each plot in the first replicate to measure ear weight (EW, g), and five ears were selected to measure ear row number (ERN), ear diameter (ED, cm), kernel number per row (KNR), ear length (EL, cm), fruit length (FL, cm), barren tip length (BTL, cm) and the compute setting rate (SR, the ratio of FL to EL). The mean value per material for each parameter was calculated. Descriptive statistics, ANOVAs, and Pearson correlation analyses were conducted using SPSS v.22 software (IBM crop. Armonk, NY, United States). Broad-sense heritability (H²) was calculated as follows:

H = \frac{σ_{g}}{σ_{g} + \frac{σ_{e}}{k}},

where σ_g² is the genetic variance, σ_ç² represents error variance, and k is the number of environments (Hallauer and Miranda, 1981).

Gene Co-expression Network Analysis

A total of 78 maize B73 RNA-seq datasets from multiple tissues (seed, endosperm, embryo, leaf, ear, tassel, silk, cob, root, shoot, pollen, anther, and SAM) and developmental stages, reported and available in a public database², were used to construct gene co-expression networks (Chen et al., 2014; Li et al., 2014). Gene co-expression network analysis was performed using the R package WGCNA version 1.63 (Langfelder and Horvath, 2008). In addition, β was optimized to six to achieve a scale-free topology. Hierarchical clustering was used to identify gene modules with a dynamic tree-cutting algorithm based on the dissimilarity of gene connectivity. A NetworkAnalyzer plugin available in Cytoscape was used to calculate relevant network parameters, such as the degree of connection (Assenov et al., 2008). Modules were visualized using Cytoscape version 3.6.1.

Functional Annotation of Genes

To obtain more complete annotation information, all protein sequences of maize were mapped to Swiss-Prot/UniProt, Pfam, InterPro, Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) databases by eggnog-mapper. For a given gene set, the R package clusterprofiler was used to visualize GO terms (Yu et al., 2012). A GO term was considered significantly enriched if the adjusted p-value was lower than 0.05. The pathway maps were obtained from the KEGG database (Kanehisa et al., 2017). The pathway enrichment analysis was performed using KOBAS version 2.0, and a Benjamini and Hochberg adjusted p-value of 0.05 was used as the cut-off criterion (Xie et al., 2011).

Expression and Statistical Analysis

To avoid an infinite value, gene values expressed as zero were replaced with a value of 0.01. All of the samples were normalized using log₂ (expression values + 0.01). Hierarchical clustering analysis was performed using the R package pheatmap³ and Pearson’s correlation co-efficient as the distance measure. Additionally, a Venn diagram was drawn using the VennDiagram package in R (Chen and Boutros, 2011).

Results

Genetic Divergence Between Inbred Lines From Shaan A and Shaan B Groups

In order to enlarge the germplasm resource of maize and screen elite inbred lines, the two heterosis groups (Shaan A group and Shaan B group) were built and improved. A series of inbred lines from both groups were selected for breeding and genetic study during improvement. To reveal genetic divergence between inbred lines from the Shaan A and Shaan B groups at the genomic level, principal component analysis (PCA) and an UPGMA tree were conducted using 32,306 high-quality SNPs with a MAF greater than 0.05. In the PCA, most of the inbred lines from the Shaan A group were separated from the Shaan B group lines using the first two or three eigenvectors (Figures 1A,B). Inbreds from Shaan B group were dispersed. In the UPGMA tree, except for several inbreds, inbred lines from the Shaan A group also clustered together, as did inbred lines from the Shaan B group (Figure 1C). This illustrated that Shaan A group and Shaan B group have occurred divergence in the genome, but some inbreds from the Shaan A and Shaan B groups are more closely related. Therefore, continuous genetic improvement in the future is needed to achieve further population divergence. Furthermore, in terms of the LD decay distance (r² reached 0.2), AM208 was approximately 10 kb, and A54 had a longer LD decay distance even when we compared it to results of the same number of inbred lines when based on random sampling from B154. This indicates that B154 has a higher genetic diversity than A54 (Figure 1D).

FIGURE 1

Figure 1. Population characteristic of AM208. (A) Scatter plot for PC1 and PC2. (B) Scatter plot for PC1 and PC3. (C) Phylogenetic tree of AM208 was constructed with 32,306 SNPs. (D) Linkage disequilibrium (LD) decay of AM208, A54, and B154.

To identify the selective regions, the Fst and π ratio values between A54 and B154 were calculated using a 100kb step length. A54 had a lower nucleotide diversity (median, πA54/πB154 = 0.854) than B154. In addition, 91 selected regions (9.1 Mb) with both low diversity and high differentiation were discovered in A54, which included 199 protein-coding genes. However, in B154, a total of 246 (24.6 Mb) selected regions were identified, which contained 573 protein-coding genes (Figures 2A,B and Supplementary Table S2). The number of genomic regions with selective sweep signals in B154 was approximately 2.7 times that of A54. Moreover, selective regions in A54 focus on chromosome 3 and 7 (48, 52.75%), but are generally distributed on 10 chromosomes in B154. In addition, of these selective regions, 15 in A54 and 38 in B154 have no genes.

FIGURE 2

Figure 2. Genomic regions through long-term artificial selection in AM208 and Manhattan plot of EL and FL in Chromosome 1. (A) Distribution of genetics differentiation (Fst) values. (B) Distribution of nucleotide diversity (π) ratio (πA54/πB154). These values are calculated in 100kb sliding windows. Red lines represent the 90% tails of the empirical distribution of each stastic. (C) Region plot of EL in E1. (D) Region plot of EL in E2. (E) Region plot of FL in E1. (F) Region plot of FL in E2.

Overview of the Functional Diversity of Genes and the Co-expression Network in Selected Regions

To investigate the functional genes located in selected regions, GO analysis was performed. There were distinctive differences in the greatly enriched GO terms (p < 0.05) of selected genes between A54 and B154. Significantly, genes in the selected regions for B154 showed greater diversity in biological functions than those in A54, such as in hormone synthesis, cell development, and partial secondary metabolites (Supplementary Table S3). In addition, the selective regions located on specific chromosomes and the biological functional diversity of genes in different groups indicate that there is a chasm-like split in some inbred line phenotypes from the Shaan A and Shaan B groups.

To gain a deeper understanding of the effect of genes in the selected regions, a weighted gene co-expression network analysis (WGCNA) was performed using gene expression data from multiple tissue types and developmental stages, which is available from a public database (see text footnote 2). A total of 23 WGCNA modules were identified in the analysis (Figure 3A). Of these, the green module was the most distinct (Figures 3B,C). Notably, genes in the selected regions of A54 and B154 were mainly distributed in 18 modules except for genes not included in the WGCNA (Figure 3D). To make it easier to follow the relationships among genes in selected regions and other unselected genes, 199 genes from A54 and 573 genes from B154, in selected regions, were screened as potential co-expression genes based on a weight value higher than 0.6. A co-expression network was then constructed (Figure 4). In total, 1086 genes were found to interact with selected genes from A54 and B154. In addition, 452 genes interacted with selected genes from A54 and 1315 genes interacted with selected genes from B154 (Supplementary Table S4). GO enrichment analysis indicated that the common interactive genes from A54 and B154 were mainly enriched during tissue development, regulation from vegetative to reproductive states, and pollen germination. Furthermore, remarkable functional differences were noted between specific genes in the selected regions of A54 and B154 (Supplementary Figure S1 and Supplementary Table S5). These results not only show new non-described gene associations and allow the placement in a functional context of some unknown non-assigned genes based on their interactions with known gene families, but also show that improvement for maize may has further accelerated the polarization of A54 and B154 in changing certain maize characteristics (such as quality and resistance) to satisfy breeding requirements.

FIGURE 3

Figure 3. Gene modules identified by weighted gene co-expression network analysis (WGCNA). (A) Gene dendrogram obtained by clustering the dissimilarity based on consensus Topological Overlap with the corresponding module colors indicated by the color row. Each colored row represents a color-coded module which contains a group of highly connected genes. A total of 23 modules were identified. (B) Heatmap plot of topological overlap in the gene network. In the heatmap, each row and column corresponds to a gene, light color denotes low topological overlap, and progressively darker red denotes higher topological overlap. Darker squares along the diagonal correspond to modules. The gene dendrogram and module assignment are shown along the left and top. (C) A multi-dimensional scaling plot of genes indicates that the green module is the most distinct. (D) Bar plot of mean gene significance across modules, and the star tagged module represents where the genes of selected regions are distributed.

FIGURE 4

Figure 4. Co-expression network of genes in selected regions of A54 and B154. Graphical view of the co-expression network where the nodes correspond to genes and the edges to co-expression links. Different colored edges represent interaction modules of different selection genes. (A) Co-expression network of selected genes of A54, and selected genes marked in red, interactive genes were marked in green. (B) Co-expression network of selected genes of B154, and selected genes marked in red, interactive genes were marked in blue.

Genomic Association Analysis for Ear Morphology Components

Following functional enrichment analysis of the interactive genes, we found that these genes were mainly enriched during tissue development processes. During inbred selection, breeders give full attention to morphology and genetic diversity changes due to their importance for hybridization. Herein, we chose eight ear related traits containing ear length (EL), fruit length (FL), ear weight (EW), ear row number (ERN), ear diameter (ED), kernel number per row (KNR), barren tip length (BTL) and setting rate (SR) in order to observe phenotypic differences between 26 inbred lines (A26) from the Shaan A group and 95 inbred lines (B95) from the Shaan B group. Except for ERN and ED, there were significant differences in EL, FL, EW, SR, BTL, and KNR between A26 and B95 in the two environments (Figure 5). In addition, EL was significant positively correlated with FL (r = 0.95). However, SR and BTL were significant negatively correlated (Supplementary Figure S2). These results indicate that ear related traits were significantly altered in the Shaan A and Shaan B groups during population improvement process.

FIGURE 5

Figure 5. The comparison of eight traits between A26 and B95 in two environments. (A) Ear length (EL, cm). (B) Fruit length (FL, cm). (C) Barren tip length (BTL, cm). (D) Setting rate (SR). (E) Ear diameter (ED, cm). (F) Ear row number (ERN). (G) Kernel number per row (KNR). (H) Ear weight (EW, g). *, **, and *** indicate significant level at P < 0.05, P < 0.01 and P < 0.001, respectively. ns represents no significant difference.

In the AM121 population, the broad-sense heritability (H²) of the eight ear traits were higher than 60%, ranging from 65.33% for BTL to 79.09% for EW (Table 1). Considering the observed phenotypic differences between A26 and B95 and the correlation coefficient between traits, EL, FL, BTL and SR were chosen to help conduct the GWAS with 32,051 high-quality SNPs, analyzed using a linear mixed model (P + K). As shown in Supplementary Figure S3, the probability of a false positive result has been reduced in this study. In total, 84, 49, 47, and 50 significant SNPs were identified for EL, FL, SR, and BTL, respectively (Figure 6 and Supplementary Table S6). Among these SNPs, 5, 4, 3, and 0 SNPs in relation to EL, FL, BTL and SR were co-localized in two locations, respectively. Furthermore, we noticed that the majority of significant SNPs were co-localized between EL and FL, as were BTL and SR. Especially, three co-localized SNPs associated with EL and FL located at chromosome 1 were observed with the same p-value due to the high LD (r² = 1.0) (Figures 2C–F). In relation to these three significant SNPs, two different haplotypes (GGG, AAA) were identified. Haplotype 2 (AAA) had a longer EL and FL. Approximately 95.79% (91) of B95 was associated with Haplotype 1 (GGG) and 38.46% (10) of A26 was associated with Haplotype 2 (AAA) (Supplementary Table S7). In addition, none of co-localized significant SNPs were identified in relation to BTL and SR in both environments. The reason for this may be that BTL and SR have a lower heritability than EL and FL. According to the LD decay distance reported in previous research, the upstream and downstream 150 kb of the significant SNPs are regarded as associated regions (Li et al., 2018). Associated regions including the same genes were considered as one associated region for each trait. Finally, 404, 182, 207, 154 protein-coding genes (reference maize genome v4) distributed in these associated regions were found to be associated with EL, FL, BTL, and SR, respectively (Supplementary Table S6).

TABLE 1

Table 1. The basic statistics of ear length (EL), fruit length (FL), barren tip length (BTL) and setting rate (SR), ear weight (EW), ear diameter (ED), ear row number (ERN), and kernel number per row (KNR) of AM121 in two environments.

FIGURE 6

Figure 6. Result of genome-wide association study. Manhattan plots of EL in E1 (A) and in E2 (E), FL (B) and in E2 (F), BTL (C) and in E2 (G) and SR in E1 (D) and in E2 (H).

Gene Co-location: Combining Selective Sweeps and Association Analysis

By integrating selective sweeps and GWAS, we identified 5 regions related to EL or FL and another 5 associated regions related to BTL or SR overlapping with the selected regions (Table 2). There was a difference between the size of the selected window and the associated region. Therefore, the 10 associated regions were designated as candidate regions so as to avoid missing relevant genes. Interestingly, among these regions, the candidate region (26,738,827–27,038,887 bp), located on chromosome 1, was from the three co-localized SNPs related to EL and FL (above-mentioned) (Figures 2C–F). According to information from the reference genome sequence (www.maizegdb.org_v4), four protein-coding genes were found to be distributed in this 300 kb candidate region, which contains Zm00001d028216, Zm00001d028217, Zm00001d028218, and Zm00001d028219.

TABLE 2

Table 2. Co-localized regions detected by combining selective sweeps and GWAS.

Of the four candidate genes, Zm00001d028216 was annotated as indeterminate floral apex1 and synonyms with a C2C2-YABBY transcription factor, which has been found to be involved in regulating the determinacy of the floral meristem, spikelet pair meristems and spikelet meristems (Laudenciachingcuanco and Hake, 2002). The second gene, Zm00001d028217, was annotated as the developmental protein SEPALLATA 2 and descripted as a MADS transcription factor-MADS14. It has a wide range of functions, such as regulating flowering time, vegetative development and fruit ripening, especially in the meristem, and is related to floral organ identity (Cacharr et al., 1999; Heuer et al., 2001; Setter et al., 2001; Danilevskaya et al., 2008; Thompson and Hake, 2009; Zhang et al., 2012). The other two genes may have essential functions in defining the boundary of secondary walls and are described as follows: Zm00001d028218 encodes a methionine-tRNA ligase, which catalyzes a reversible chemical reaction [from ATP, L-methionine and tRNA (Met) to AMP, diphosphate and L-methionyl-tRNA (Met)] in cytosol; and Zm00001d028219 encodes a microtubule-associated protein (MAP70-2), which is closely related to MAP70-1 (Calder, 2010; Korolev et al., 2010; Oda and Fukuda, 2012). These results further indicate that the four genes we observed in our study are likely to play roles in the regulation of EL and FL, particularly in relation to the complex genetic mechanisms of EL and FL.

Temporal and Spatial Expression Patterns and a Co-expression Network of Key Genes Associated With EL and FL

To explore and characterize the expression patterns of four target genes of EL and FL in a complex tissue, public RNA-seq datasets were used to analyze temporal and spatial expression variations of these genes in a large and diverse group of maize B73 tissues. Interestingly, Zm00001d028216, Zm00001d028217, and Zm00001d028219 were expressed in specific tissues or developmental stages. However, the expression of Zm00001d028218 was not remarkably different between any developmental stages, yet its expression level was higher than the three other genes in almost all tissues (Figure 7). Zm00001d028216 and Zm00001d028217 exhibited synchronous expression patterns in some specific developmental stages or tissues. For example, they were highly expressed in the early developmental stages of endosperm, seed, mid-ear, tassel (5.7 mm), silk, and pericarp, and especially in the cob. In particular, both Zm00001d028216 and Zm00001d028217 showed higher expression levels in the specific developmental stages of parts of the ear, silk, and ovule (Figure 7). These expression results indicate that Zm00001d028216 and Zm00001d028217 are likely to be key regulators during the determination of the meristem and the later differentiation and development of the ear. The synchronization and similarity in the expression characteristics of these genes also suggest that they coordinate and regulate the development of certain tissues, but the hub genes between them remain unknown.

FIGURE 7

Figure 7. Temporal and spatial expression patterns of regulation genes of EL and FL. Different expression modules in same tissue represent different expression data sources, and the scale bar shows the normalized expression values corresponded to different modules. Additionally, different colored bars represent four different genes.

To gain further insight into the relationship between these genes, and the influence of external genes, genes that potentially interact with them were extracted from the co-expression network to construct sub-networks (based on a weight value greater than 0.2). The most important finding of the co-expression network was that some hub genes were involved in sub-networks that had Zm00001d028216 and Zm00001d028217 at the core (Figure 8). These hub genes were composed of 28 protein-coding genes, and included three transcription factors. They were mainly involved in cell differentiation, cellular developmental processes, and anatomical structure morphogenesis (Supplementary Table S8). Nevertheless, the other two genes constituted a relatively independent network, and they were functionally enriched in a number of critical developmental processes. This indicates that EL and FL are affected by diverse biological processes, and multilevel genes may be involved in communicating and adjusting the function balance among these hub genes.

FIGURE 8

Figure 8. Regulatory co-expression network of key genes related to EL and FL. Weighted gene co-expression gene network of multiple maize tissues, visualized using an organic layout in cytoscape. The nodes in the network were obtained by coloring each gene by its specific expression profile along maize tissues development. Different genes and their interactive genes were marked with different color. Of which, blue nodes represent weight value more than 0.5, orange nodes represent common interactive genes of Zm00001d028216 and Zm00001d028217.

Discussion

During crop improvement, phenotypic characteristic developments are dependent on market demand and breeder selection. Micro-evolution during the breeding process not only increases genetic divergence, but also changes genetic diversity. In this study, 32,306 high-quality SNPs were used to identify the effect of artificial improvement across the genome, and our analysis suggests that there has been genetic divergence between the Shaan A and B groups in AM208. These results indicate that artificial selection was effective in causing population divergence between Shaan A and Shaan B heterotic groups. This is consistent with previous findings that selection in breeding programs makes significant contributions to genetic divergence between heterotic groups (Liu et al., 2003). Moreover, B154 is more dispersed and has a shorter LD decay distance than A54. Also, the median π ratio (πA54/πB154 is equal to 0.854) further supports these differences. Our results also indicate that B154 contains a richer genetic diversity than A54. Additionally, it is particularly interesting that B154, which has a richer genetic diversity, has more selection regions. Our analysis provides new and important information on the breeding history genomics of Shaan A and Shaan B groups.

Furthermore, a subset of 337 regions including 772 genes in the genome that are related to artificial selection were identified. Within the 772 selected genes, approximately 24 candidate genes including Zm00001d028217 have been reported in a previous study to be associated with modern breeding selection (Jiao et al., 2012). In addition, 20 including Zm00001d028216 and 22 candidate genes overlapped with candidate genes related to maize domestication and improvement, respectively (Hufford et al., 2012; Supplementary Table S9). These combined results indicate that these genes are of high value for crop improvement. A large proportion of identified genes have not been reported because they are from different populations, are related to different focus traits from different breeders, or are identified using different genotyping technology. We found that there are distinctive differences in gene numbers in selected regions between A54 and B154. Enrichment analysis revealed that these genes are involved in regulating different essential metabolic processes during maize development (due to the different usage of their materials during the breeding process). Moreover, the co-expression networks captured important biological modules of selected genes between A54 and B154. These modules showed that these genes were not likely to be independently regulating these metabolic processes, and were also not the only key factors in causing the divergence in some target traits. Additionally, the selected genes and the interactive genes were only a little enriched in response to abiotic stress. This is likely to be because these inbreds were selected under the same conditions: low irrigation rates, low nitrogen levels, and high plant densities. The interactive genes with selected genes were enriched in the regulation of the timing of meristematic phase transition and transition from the vegetative to reproductive phase, as well as pollen germination. These are important targets for a female and male plant, implying that genotype and phenotype difference may be generate gradually during improvement of heterosis groups. Taken together, our results indicate that artificial selection alters consciously or unconsciously the alleles’ frequencies of target traits during improvement process.

In the maize breeding process, construction of heterosis groups is considered to be the most important element. Hybrid vigor remarkably contributes to the performance of hybrids compared to their parents (Birchler et al., 2003). Therefore, parents’ inbred lines may have obvious complementation in the trait and genome. EL, FL, BTL, and SR are vital to increasing yield. In this study, A54 performed longer EL and FL than B154. However, B154 showed higher SR (equivalent to shorter BTL) than A54. The combination crossing from these two groups may produce hybrids with long ears and high SR, such as the SD650 (KA105/KB024) and SD636 (KA103/KB043) hybrids that have been widely grown in Shaanxi province. In this study, four candidate genes related to EL and FL were identified by combining GWAS and a selective sweep. Among the four genes, Zm00001d028216 and Zm00001d028217 also were identified as selected genes in maize domestication and improvement processes (Hufford et al., 2012; Jiao et al., 2012). Moreover, Zm00001d028217 belongs to the MADS family and Zm00001d028216 belongs to the YABBY family, these two transcription factors play important roles in regulating spikelet development, floral induction, and inflorescence development (Cacharr et al., 1999; Laudenciachingcuanco and Hake, 2002; Danilevskaya et al., 2008; Thompson and Hake, 2009). Alternatively, Zm00001d028216 and Zm00001d028217 showed higher expression levels at the development stage and are known to regulate other genes involved in cell differentiation, cellular developmental processes, and anatomical structure morphogenesis. Therefore, it is meaningful and interesting to carry out further validation work on these four genes to understand the underlying molecular biology mechanism. In addition, short BTL (higher SR) is an desirable trait for high-yield breeding, however, genetic basis of BTL and SR remain poorly understand (Li et al., 2020). In the present study, only five associated regions related to SR and BTL overlapped with the selected regions. Moreover, many significant SNPs associated with SR and BTL were identified through association analysis. Therefore, this study has helped to uncover the genetic mechanism of SR and BTL, information which can be used to aid genetic improvements. Nonetheless, some associated regions related to SR and BTL were detected, and more research is needed to identify the functional genes, particularly in the overlapped region.

Genetic analysis for complex traits generally requires genetic populations with a diverse phenotype. This may lead to a situation where researchers collect diverse materials from multiple breeding programs and geographical areas but do not focus on studying breeding materials from a single origin. We used 208 inbred lines selected from the Shaan A and Shaan B groups that were derived from a common ancestral pool but used different testers. As breeding populations, Shaan A and Shaan B groups integrate a diverse germplasm and allele frequency which have been subjected to noteworthy changes due to long-term artificial selection. More importantly, as homozygotes, inbred lines from Shaan A and Shaan B groups can be used as ideal test materials for genetic studies and marker-assisted breeding. In future, genetic studies that use breeding populations will greatly progress breeding applications. Our study used these inbreds from a breeding population to investigate genetic divergence and we identified the selective regions related to traits. This not only provides insights into the effects of artificial selection across the genome for crop improvement but also conveys essential information on some important agronomic traits.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation, to any qualified researcher.

Author Contributions

SX and JX conceived and designed the experiments. XZ, YH, and XT contributed to field management. YW, YL, and NW contributed to the collection of phenotypic data and DNA extraction. TL and JQ analyzed the data and wrote the manuscript. SX contributed to revising the manuscripts. All authors read and approved the final manuscript.

Funding

This work was supported by the Innovation Project of National Key R&D Program of China (Grant No. 2018YFD0100200), the Young Scientists Fund of the National Natural Science Foundation of China (Grant No. 31701438), and the Shaanxi Province Key Research and Development Program of China (2017ZDCXL-NY-02-04). The funding bodies played no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We are grateful to Prof. Jianming Yu and Prof. Qin Yang for revising the manuscript.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2020.00747/full#supplementary-material

FIGURE S1 | GO analysis of interactive genes with selected genes in A54 and B154. Histogram showing the significantly functional distribution of interactive genes with selected genes in A54 and B154, respectively. The scale bar represents the significant levels.

FIGURE S2 | Data distribution and correlation analysis of eight traits. ^∗, ^∗∗, and ^∗∗∗ indicate a significant correlation at P < 0.05, P < 0.01, and P < 0.001 respectively.

FIGURE S3 | QQ-plots of eight ear related traits in two locations. QQ-plot of EL in E1 (A) and E2 (E). QQ-plot of FL in E1 (B) and E2 (F). QQ-plot of BTL in E1 (C) and E2 (G). QQ-plot of SR in E1 (D) and E2 (H).

TABLE S1 | The material information of AM208.

TABLE S2 | The list of selective regions and genes in A54 and B154, respectively.

TABLE S3 | GO enrichment analysis of selected genes from Shaan A and B groups.

TABLE S4 | Interaction between selected genes and other genes.

TABLE S5 | GO enrichment analysis of interactive genes of selected genes.

TABLE S6 | Results of genome-wide association study for EL, FL, BTL, and SR in two environments.

TABLE S7 | Haplotype effect of three significant SNPs related to EL and FL.

TABLE S8 | Functional enrichment analysis of key genes regulating EL and FL.

TABLE S9 | Common selected genes in maize domestication and improvement processes compared to previous studies.

Footnotes

References

Assenov, Y., Ramírez, F., Schelhorn, S. E., Lengauer, T., and Albrecht, M. (2008). Computing topological parameters of biological networks. Bioinformatics 24, 282–284. doi: 10.1093/bioinformatics/btm554

PubMed Abstract | CrossRef Full Text | Google Scholar

Birchler, J. A., Auger, D. L., and Riddle, N. C. (2003). In search of the molecular basis of heterosis. Plant Cell 15, 2236–2239. doi: 10.1105/tpc.151030

PubMed Abstract | CrossRef Full Text | Google Scholar

Bradbury, P. J., Zhang, Z., Kroon, D. E., Casstevens, T. M., Ramdoss, Y., and Buckler, E. S. (2007). TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23, 2633–2635. doi: 10.1093/bioinformatics/btm308

PubMed Abstract | CrossRef Full Text | Google Scholar

Browning, B., and Browning, S. (2016). Genotype imputation with millions of reference samples. Am. J. Hum. Genet. 98, 116–126. doi: 10.1016/j.ajhg.2015.11.020

PubMed Abstract | CrossRef Full Text | Google Scholar

Browning, S. R., and Browning, B. L. (2007). Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am. J. Hum. Genet. 81, 1084–1097. doi: 10.1086/521987

PubMed Abstract | CrossRef Full Text | Google Scholar

Cacharr, N. J., Saedler, H., and Theissen, G. (1999). Expression of MADS box genes ZMM8 and ZMM14 during inflorescence development of Zea mays discriminates between the upper and the lower floret of each spikelet. Dev. Genes Evol. 209, 411–420. doi: 10.1007/s004270050271

PubMed Abstract | CrossRef Full Text | Google Scholar

Calder, G. (2010). The microtubule-associated protein AtMAP70-5 regulates secondary wall patterning in Arabidopsis wood cells. Curr. Biol. 20, 744–749. doi: 10.1016/j.cub.2010.02.057

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, H., and Boutros, P. C. (2011). VennDiagram: a package for the generation of highly-customizable Venn and Euler diagrams in R. BMC Bioinformatics 12:35. doi: 10.1186/1471-2105-12-35

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, J., Zeng, B., Zhang, M., Xie, S., Wang, G., Hauck, A., et al. (2014). Dynamic transcriptome landscape of maize embryo and endosperm development. Plant Physiol. 166, 252–264. doi: 10.1104/pp.114.240689

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, Q., Han, Y., Liu, H., Wang, X., Sun, J., Zhao, B., et al. (2018). Genome-wide association analyses reveal the importance of alternative splicing in diversifying gene function and regulating phenotypic variation in maize. Plant Cell 30, 1404–1423. doi: 10.1105/tpc.18.00109

PubMed Abstract | CrossRef Full Text | Google Scholar

Chong, X., Fei, Z., Wu, Y., Yang, X., and Chen, F. (2017). A SNP-enabled assessment of genetic diversity, evolutionary relationships and the identification of candidate genes in chrysanthemum. Genome Biol. Evol. 8, 3661–3671.

Google Scholar

Danilevskaya, O. N., Meng, X., Selinger, D. A., Deschamps, S., Hermon, P., Vansant, G., et al. (2008). Involvement of the MADS-box gene ZMM4 in floral induction and inflorescence development in maize. Plant Physiol. 147, 2054–2069. doi: 10.1104/pp.107.115261

PubMed Abstract | CrossRef Full Text | Google Scholar

Doebley, J., Stec, A., and Gustus, C. (1995). teosinte branched1 and the origin of maize: evidence for epistasis and the evolution of dominance. Genetics 141, 333–346.

Google Scholar

Doebley, J. F., Gaut, B. S., and Smith, B. D. (2006). The molecular genetics of crop domestication. Cell 127, 1309–1321. doi: 10.1016/j.cell.2006.12.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Hallauer, A. R., and Miranda, F. J. B. (1981). Quantitative genetics in maize breeding. Q. Rev. Biol. 6, 124–126.

Google Scholar

Han, Y., Zhao, X., Liu, D., Li, Y., Lightfoot, D. A., Yang, Z., et al. (2016). Domestication footprints anchor genomic regions of agronomic importance in soybeans. New Phytol. 209:871. doi: 10.1111/nph.13626

PubMed Abstract | CrossRef Full Text | Google Scholar

Heuer, S., Hansen, S., Bantin, J., Brettschneider, R., Kranz, E., Lörz, H., et al. (2001). The maize MADS box gene ZmMADS3 affects node number and spikelet development and is co-expressed with ZmMADS1 during flower development, in egg cells, and early embryogenesis. Plant Physiol. 127, 33–45. doi: 10.1104/pp.127.1.33

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, J., Vendramin, A. S., Shi, L., and Mcginnis, K. (2017). Construction and optimization of large gene co-expression network in maize using RNA-Seq data. Plant Physiol. 175, 568–583. doi: 10.1104/pp.17.00825

PubMed Abstract | CrossRef Full Text | Google Scholar

Hufford, M. B., Xu, X., Heerwaarden, J. V., Pyhäjärvi, T., Chia, J. M., Cartwright, R. A., et al. (2012). Comparative population genomics of maize domestication and improvement. Nat. Genet. 44, 808–811.

Google Scholar

Jiao, Y., Zhao, H., Ren, L., Song, W., Zeng, B., Guo, J., et al. (2012). Genome-wide genetic changes during modern breeding of maize. Nat. Genet. 46, 812–815. doi: 10.1038/ng.2312

PubMed Abstract | CrossRef Full Text | Google Scholar

Kanehisa, M., Furumichi, M., Tanabe, M., Sato, Y., and Morishima, K. (2017). KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 45, D353–D361.

Google Scholar

Korolev, A. V., Chan, J., Naldrett, M. J., Doonan, J. H., and Lloyd, C. W. (2010). Identification of a novel family of 70&emsp;kDa microtubule-associated proteins in Arabidopsis cells. Plant J. 42, 547–555. doi: 10.1111/j.1365-313x.2005.02393.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Kumar, S., Stecher, G., and Tamura, K. (2016). MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 33, 1870–1874. doi: 10.1093/molbev/msw054

PubMed Abstract | CrossRef Full Text | Google Scholar

Langfelder, P., and Horvath, S. (2008). WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 9:559. doi: 10.1186/1471-2105-9-559

PubMed Abstract | CrossRef Full Text | Google Scholar

Laudenciachingcuanco, D., and Hake, S. (2002). The indeterminate floral apex1 gene regulates meristem determinacy and identity in the maize inflorescence. Development 129, 2629–2638.

Google Scholar

Li, G., Wang, D., Yang, R., Logan, K., Chen, H., Zhang, S., et al. (2014). Temporal patterns of gene expression in developing maize endosperm identified through transcriptome sequencing. Proc. Natl. Acad. Sci. U.S.A. 111, 7582–7587. doi: 10.1073/pnas.1406383111

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, H., Peng, Z., Yang, X., Wang, W., Fu, J., Wang, J., et al. (2013). Genome-wide association study dissects the genetic architecture of oil biosynthesis in maize kernels. Nat. Genet. 45, 43–50. doi: 10.1038/ng.2484

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, T., Qu, J., Wang, Y., Chang, L., He, K., Guo, D., et al. (2018). Genetic characterization of inbred lines from Shaan A and B groups for identifying loci associated with maize grain yield. BMC Genet. 19:63. doi: 10.1186/s12863-018-0669-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Z., Liu, P., Zhang, X., Zhang, Y., Ma, L., Liu, M., et al. (2020). Genome-wide association studies and QTL mapping uncover the genetic architecture of ear tip-barrenness in maize. Physiol. Plant. [Epub ahead of print]. doi: 10.1111/ppl.13087

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, K., Goodman, M., Muse, S., Smith, J. S., Buckler, E., and Doebley, J. (2003). Genetic structure and diversity among maize inbred lines as inferred from DNA microsatellites. Genetics 165, 2117–2128.

Google Scholar

Murray, M. G., and Thompson, W. F. (1980). Rapid isolation of high molecular weight plant DNA. Nucleic Acids Res. 8, 4321–4325.

Google Scholar

Oda, Y., and Fukuda, H. (2012). Secondary cell wall patterning during xylem differentiation. Curr. Opin. Plant Biol. 15, 38–44. doi: 10.1016/j.pbi.2011.10.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M. A. R., Bender, D., et al. (2007). PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575. doi: 10.1086/519795

PubMed Abstract | CrossRef Full Text | Google Scholar

Sarkar, N. K., Kim, Y. K., and Grover, A. (2014). Coexpression network analysis associated with call of rice seedlings for encountering heat stress. Plant Mol. Biol. 84, 125–143. doi: 10.1007/s11103-013-0123-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Schmutz, J., Mcclean, P. E., Mamidi, S., Wu, G. A., Cannon, S. B., Grimwood, J., et al. (2014). A reference genome for common bean and genome-wide analysis of dual domestications. Nat. Genet. 46, 707–713.

Google Scholar

Schnable, P. S., Ware, D., Fulton, R. S., Stein, J. C., Wei, F., Pasternak, S., et al. (2009). The B73 maize genome: complexity, diversity, and dynamics. Science 326, 1112–1115. doi: 10.1126/science.1178534

PubMed Abstract | CrossRef Full Text | Google Scholar

Setter, T. L., Flannigan, B. A., and Melkonian, J. (2001). Loss of kernel set due to water deficit and shade in maize. Crop Science 41, 1530–1540. doi: 10.2135/cropsci2001.4151530x

CrossRef Full Text | Google Scholar

Shi, J., and Lai, J. (2015). Patterns of genomic changes with crop domestication and breeding. Curr. Opin. Plant Biol. 24, 47–53. doi: 10.1016/j.pbi.2015.01.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Souza, C. L. D., Barrios, S. C. L., and Moro, G. V. (2010). Performance of maize single-crosses developed from populations improved by a modified reciprocal recurrent selection. Sci. Agric. 67, 198–205. doi: 10.1590/s0103-90162010000200011

CrossRef Full Text | Google Scholar

Thompson, B. E., and Hake, S. (2009). Bearded-ear encodes a MADS box transcription factor critical for maize floral development. Plant Cell 21:2578. doi: 10.1105/tpc.109.067751

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Heerwaarden, J., Hufford, M. B., and Ross-Ibarra, J. (2012). Historical genomics of North American maize. Proc. Natl. Acad. Sci. U.S.A. 109, 12420–12425. doi: 10.1073/pnas.1209275109

PubMed Abstract | CrossRef Full Text | Google Scholar

Viana, J. M. S., DeLima, R. O., Mundim, G. B., Cond, A. B. T., and Vilarinho, A. A. (2013). Relative efficiency of the genotypic value and combining ability effects on reciprocal recurrent selection. Theor. Appl. Genet. 126, 889–899. doi: 10.1007/s00122-012-2023-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, R., Stec, A., Hey, J., Lukens, L., and Doebley, J. (1999). The limits of selection during maize domestication. Nature 398, 236–239. doi: 10.1038/18435

PubMed Abstract | CrossRef Full Text | Google Scholar

Wisecaver, J. H., Borowsky, A. T., Tzin, V., Jander, G., Kliebenstein, D. J., and Rokas, A. (2017). A global co-expression network approach for connecting genes to specialized metabolic pathways in plants. Plant Cell 29, 944–959. doi: 10.1105/tpc.17.00009

PubMed Abstract | CrossRef Full Text | Google Scholar

Xiao, Y., Liu, H., Wu, L., Warburton, M., and Yan, J. (2017). Genome-wide association studies in maize: praise and stargaze. Mol. Plant 10, 359–374. doi: 10.1016/j.molp.2016.12.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Xie, C., Mao, X., Huang, J., Ding, Y., Wu, J., Dong, S., et al. (2011). KOBAS 2.0: a web server for annotation and identification of enriched pathways and diseases. Nucleic Acids Res. 39, 316–322.

Google Scholar

Xie, W., Wang, G., Yuan, M., Yao, W., and Zhang, Q. (2015). Breeding signatures of rice improvement revealed by a genomic variation map from a large germplasm collection. Proc. Natl. Acad. Sci. U.S.A. 112:E5411.

Google Scholar

Xu, G. (2018). Genetic Basis of Artificial Selection Response in High-Oil Maize. Beijing: China Agricultural University.

Google Scholar

Yamasaki, M., Tenaillon, M. I., Bi, I. V., Schroeder, S. G., Sanchez-Villeda, H., Doebley, J. F., et al. (2005). A large-scale screen for artificial selection in maize identifies candidate agronomic loci for domestication and crop improvement. Plant Cell 17:2859. doi: 10.1105/tpc.105.037242

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, J., Lee, S. H., Goddard, M. E., and Visscher, P. M. (2011). GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82. doi: 10.1016/j.ajhg.2010.11.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, G., Wang, L.-G., Han, Y., and He, Q. (2012). clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 16, 284–287. doi: 10.1089/omi.2011.0118

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, H., Jiao, B., and Liang, C. (2017). Systematic analysis Of RNA-Seq-based gene co-expression across multiple plants. bioRxiv [Preprint]. doi: 10.1101/139923

CrossRef Full Text | Google Scholar

Zhang, C., Dong, S.-S., Xu, J.-Y., He, W.-M., and Yang, T.-L. (2019). PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics 35, 1786–1788. doi: 10.1093/bioinformatics/bty875

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Z., Li, H., Zhang, D., Liu, Y., Jing, F., Shi, Y., et al. (2012). Characterization and expression analysis of six MADS-box genes in maize (Zea mays L.). J. Plant Physiol. 169, 797–806. doi: 10.1016/j.jplph.2011.12.020

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: population characteristics, selective sweeps, genome-wide association study, co-expression, ear morphology

Citation: Li T, Qu J, Tian X, Lao Y, Wei N, Wang Y, Hao Y, Zhang X, Xue J and Xu S (2020) Identification of Ear Morphology Genes in Maize (Zea mays L.) Using Selective Sweeps and Association Mapping. Front. Genet. 11:747. doi: 10.3389/fgene.2020.00747

Received: 06 March 2020; Accepted: 23 June 2020;
Published: 20 July 2020.

Edited by:

Yin Li, Rutgers, The State University of New Jersey, United States

Reviewed by:

Chenxu Liu, China Agricultural University, China
Rohit Kumar, Clemson University, United States

Copyright © 2020 Li, Qu, Tian, Lao, Wei, Wang, Hao, Zhang, Xue and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jiquan Xue, eGpxMjkzNEAxNjMuY29t; Shutu Xu, c2h1dHV4dUBud2FmdS5lZHUuY24=

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.