- Corporación Colombiana de Investigación Agropecuaria, Corpoica, Centro de Investigación Tibaitatá, Mosquera, Colombia
Association mapping has been proposed as an efficient approach to assist plant breeding programs to investigate the genetic basis of agronomic traits. In this study, we evaluated 18 traits related to yield, (FWP, NF, FWI, and FWII), fruit size-shape (FP, FA, MW, WMH, MH, HMW, DI, FSI, FSII, OVO, OBO), and fruit quality (FIR, CF, and SST), in a diverse collection of 100 accessions of Physalis peruviana including wild, landraces, and anther culture derived lines. We identified seven accessions with suitable traits: fruit weight per plant (FWP) > 7,000 g/plant and cracked fruits (CF) < 4%, to be used as parents in cape gooseberry breeding program. In addition, the accessions were also characterized using Genotyping By Sequencing (GBS). We discovered 27,982 and 36,142 informative SNP markers based on the alignment against the two cape gooseberry references transcriptomes. Besides, 30,344 SNPs were identified based on alignment to the tomato reference genome. Genetic structure analysis showed that the population could be divided into two or three sub-groups, corresponding to landraces-anther culture and wild accessions for K = 2 and wild, landraces, and anther culture plants for K = 3. Association analysis was carried out using a Mixed Linear Model (MLM) and 34 SNP markers were significantly associated. These results reveal the basis of the genetic control of important agronomic traits and may facilitate marker-based breeding in P. peruviana.
Introduction
Physalis peruviana L is also known as cape gooseberry, golden berry, ground cherry, rasbhari, and winter cherry in different parts of the world. It is an exotic fruit that belongs to the Solanaceae family and is well-known for its nutritional value (high contents of vitamins A, C, and B), micronutrient content (phosphorus, calcium, and iron), and antioxidant, anti-inflammatory, and anti-hepatotoxic activities (Wu et al., 2006; Ramadan, 2011; Ramadan et al., 2015). P. peruviana fruits are desirable for confections, dried-fruit snacks, and fresh consumption. Colombia is the world's top producer of this fruit, followed by South Africa (Bonilla et al., 2009). It is the second most exported fruit from Colombia, trailing only the banana. In 2014, 13,260 tons were harvested, mainly from the departments Boyacá, Antioquia, and Cundinamarca. Exports to the Netherlands, Germany, Belgium, and Canada accounted for 5,852 tons with sales of 30 million dollars (Agronet, 2016). Despite great promise, production yield decreased from 13.76 t/ha in 2010 to 9.81 t/ha in 2014 (Agronet, 2016) partly as a result of the vascular wilt disease caused by Fusarium oxysporum (Cotes et al., 2012). Additionally, 20–45% of harvested fruits are discarded because of cracking problems (Fischer, 2005) and 15% of the fruit production does not satisfy the standards of size and quality required for export (Valdenegro et al., 2013), significantly reducing the volume of exportable fruit.
There is a desire to develop varieties with high fruit yield and quality, especially those with resistance to cracking and that meet the standards of the size required for the market. Given the success in related species such as tomato (Solanum lycopersicum L.), where crack-resistant material has been developed (Matas et al., 2004), it seems likely that similar improvement may be expected for the cape gooseberry. Developing resources and increasing genetic knowledge on fruit quality and yield characteristics in the cape gooseberry will accelerate the time of development of new varieties, facilitating the identity of the cultivar, the evaluation of genetic diversity, the selection of parents, and the confirmation of hybrids with the use of Marker Assisted Selection (MAS) (Chhetri et al., 2017; Favoretto et al., 2017).
The identification of SNP markers responsible for natural phenotypic variation may be detected with Association Mapping (AM) (Soto-cerda and Cloutier, 2012; Xu et al., 2017). AM is a strategy based on Linkage Disequilibrium (LD) that ultimately seeks to identify specific functional variants linked to phenotypic differences in a particular trait. The polymorphisms in the DNA sequence responsible for phenotypic change can be detected and then introgressed into crop germplasm (Flint-Garcia et al., 2003; Oraguzie et al., 2003; Abe et al., 2012; Xu et al., 2017). AM uses unstructured populations for trait mapping based on the strength of high-throughput genotyping and phenotypic characterization. Genotyping By Sequencing (GBS) has allowed high-throughput identification of molecular markers in the rose (Heo et al., 2017), apple (Norelli et al., 2017), pepper (Taranto et al., 2016), and pigeonpea (Saxena et al., 2017) at low costs (Voss-fels and Snowdon, 2016). The success of GBS in maize (Elshire et al., 2011), potato (Uitdewilligen et al., 2013), sesame (Uncu et al., 2016), and wheat (Poland et al., 2012; Kobayashi et al., 2016) suggest a role for the technique in non-model specialty crops. This technique is based on the reduction of genome complexity through methylation-sensitive restriction enzymes (Elshire et al., 2011), making it possible to search for polymorphisms in species with large genomes, high diversity, or without a reference genome (Poland and Rife, 2012). The utility of GBS in the cape gooseberry, a species without a reference genome, has previously been demonstrated in the identification of candidate genes associated with the resistance response to F. oxysporum (Osorio-Guarín et al., 2016), using tomato (S. lycopersicum) as a reference genome for the SNP calling process.
The phylogeny reconstruction carried out by Garzón-Martínez et al. (2012) demonstrated a close relation between P. peruviana and S. lycopersicum. The tomato is a diploid species with a haploid set of 12 chromosomes and a small genome (950 Mb), encoding ~35,000 genes that are sequestered mainly in the adjacent euchromatic region (Barone et al., 2008). In contrast, the cape gooseberry has a chromosomic complement of 2n = 4x = 48 and a large genome size ranging from 1410.77 to 1985.34 Mb (Liberato et al., 2015). Despite considerable differences in genome size, the comparative analysis conducted in Solanaceae family by Wang et al. (2008) revealed high-degree sequence synteny in chromosomal regions with small-scale differences between species, as a result of nucleotide substitutions, insertions, deletions, tandem duplications of individual genes, inversions, and transpositions. Therefore, tomato genes could be conserved in the cape gooseberry.
GBS is a useful approach for analyze the genetic diversity, population structure, and offers an ultimate MAS tool to accelerate plant breeding. This is the first time that GBS methodology was implemented in cape gooseberry for mapping SNP markers of fruit quality traits. The objectives of this study were to: (1) evaluate 18 phenotypic traits of 100 cape gooseberry accessions from the Corporación Colombiana de Investigación Agropecuaria—Corpoica germplasm collection; (2) examine the level of genetic diversity and the population structure within the cape gooseberry collection; and (3) identify candidate genes/SNPs significantly associated with yield, fruit size, and quality.
Materials and Methods
Plant Material and Experimental Design
Cape gooseberry accessions were selected from the germplasm collection maintained at the Corporación Colombiana de Investigación Agropecuaria—Corpoica. The collection consisted of 100 accessions based on 77 accessions reported by Osorio-Guarín et al. (2016) and 23 new accessions derived from anther culture germplasm that included doubled haploids and haploids accessions (Table S1).
The plantlets were propagated clonally by in vitro subculturing of node cuttings at 4–6 weeks using Murashige and Skoog (MS) medium supplemented with 0.1 mg L−1 GA3. The plantlets were acclimated in a chamber with 80% of relative humidity. After acclimation, those plantlets were cultivated using a triple lattice experimental design at the Corpoica Tibaitatá research center (4°40′55.5″N, 74°12′12.4″). The accessions were grown in three replicates using a row-to plant and plant to plant spacing of 2 m. Three plants per accession were grown per replication. The Kenyan and Colombian ecotypes were used as reference accessions (Peña et al., 2011; Criollo et al., 2014).
Phenotyping
A total of 18 traits evaluated were distributed into three categories: yield, fruit size-shape, and fruit quality (Tables 1, 2). For the yield category: Fruits Weight per Plant (FWP, weight of fruits during all harvests), Plant Fruit Number (NF, the number of fruits during all harvests), and Fruit Weight with and without calyx traits (FWI-FWII, mean weight of 10 fruits in each harvest) were evaluated in eight harvests.
For fruit size-shape category, nine fruits from each accession per replicate were evaluated and cut longitudinally through the center, placed cut-side down on a Hewlett Packard® C9866A and digitalized at 200 dots per inch. The Tomato Analyzer software v3.0 (Rodríguez et al., 2010) was used to measure: Fruit Perimeter (FP), Fruit Area (FA), Width at Mid Height (WMH, the width measured at ½ of the fruit's height), Maximum Width (MW, the maximum horizontal distance of the fruit), Height at Mid Width (HMW, the height measured at ½ of the fruit's width), and Maximum Height (MH, the maximum vertical distance of the fruit). For fruit shape, the following traits were measured: Distal end Indentation Area (DI, distal end indentation area relative to total fruit area), the Fruit Shape Index External I (FSI, the ratio of the maximum height to the maximum width), the Fruit Shape Index external II (FSII, the ratio of height mid-width to width mid-height), and the Asymmetry As Ovoid (OVO) when the area of the fruit is higher above mid-height than below it or Obovoid (OBO) when the area of the fruit is greater below mid-height than above it.
For fruit quality category, measurements were assessed in eight harvests except for firmness which was evaluated in six harvests in the population. The fruit was collected at the optimal harvest point corresponding to maturity stage 3 as determined by the color according to the NTC 4580 Standard (Instituto Colombiano de Normas Técnicas-Icontec, 1999). The percentage of Cracked Fruits (CF) per plant was determined based on the average of cracked fruits in all harvests and the Firmness (FIR) was measured using Chatillon TDC200 Digital Force Tester. Soluble Solids Concentration (SST) was measured as °Brix with a hand-held refractometer ATAGO PAL1 on a minimum of 3 mature fruits per accession per replicate.
Statistical Analysis of Phenotypic Data
Analysis of variance (ANOVA) was performed using the General Linear Model (GLM) procedure in SAS software (SAS Institute, Cary, NC) to determine the existence of significant differences between accessions for the quantitative traits evaluated. The model tested was Y = G + Rep + Error, with all factors considered fixed. Principal Component Analysis (PCA) and the Cluster Analysis (CA) by the Ward method (semi-partial R2 = 0.10) were also conducted with SAS software. Correlations among traits, were detected using Pearson's correlation coefficient (r) at P = 0.05. Broad-sense heritability () of all traits was calculated using the formula as described by Allard (1960) as follow: [(σ2G)/(σ2P)] × 100, where: σ2G = Genotypic variance; σ2P = Phenotypic variance.
Genotyping and SNP Markers Calling
Genomic DNA of 77 accessions was previously isolated by Osorio-Guarín et al. (2016). The DNA of the 23 new accessions was isolated from 100 mg of leaf tissue collected from in vitro grown plants. Tissue was macerated in liquid nitrogen using a mortar and pestle. DNA extraction was performed using the DNeasy Plant Mini Kit (Qiagen, Valencia, USA) according to the manufacturer's protocol. Total DNA was quantified by NanoDrop 1000 spectrophotometer (NanoDrop Technologies, Wilmington, DE, USA), and the quality was checked through restriction enzyme digestion with HindIII enzyme and visualized by electrophoresis using 2% agarose gels. The GBS libraries were constructed using the restriction enzyme ApekI (GCWGC) and sequenced twice with the Illumina HiSeq (Illumina Inc. San Diego, CA) next-generation sequencing platform at the Cornell Genomic Diversity Facility.
SNP calling was performed using the Tassel-GBS pipeline v5.0 (Bradbury et al., 2007). A filtered HapMap was created with the following parameters: minimum minor allele frequency (mnMAF) of 0.05, minimum locus coverage (mnLCov) of 0.8, minimum taxon coverage (mnTCov) of 0.3, and minimum site coverage (mnSCov) of 0.7.
Genetic Diversity and Population Structure
Standard measures of diversity including Expected Heterozygosity (HE), Observed Heterozygosity (HO), and Polymorphic Information Content (PIC) were calculated by PowerMarker v3.2 (Liu and Muse, 2005) using filtered SNP markers. The alignment was realized with Bowtie2 (Langmead and Salzberg, 2012) against the two cape gooseberry references transcriptomes (leaf and root NCBI Bioproject: PRJNA67621) and the tomato reference genome version SL2.40.
Population structure analysis was carried out based on software Admixture v1.3.0 (Alexander and Novembre, 2009) in an unsupervised mode. This program estimates individual admixture proportions from multi-locus SNP data using a maximum-likelihood method. It employs a similar statistical model as the program Structure (Pritchard et al., 2000) but uses fast numerical optimization algorithm to achieve greater speed. This computational efficiency provides an advantage mainly when using very large numbers of markers and individuals (Liu et al., 2013). The number of populations (K) was set from 1 to 10 and the K optimum was selected based on the cross-validation error compared to other K values. The Q estimates (Q matrix) of the K optimum was used to the association mapping and was plotted in R software (R Team, 2014). To corroborate the population structure, the Neighbor-Joining algorithm was used for cluster analyses based on the Nei's genetic distance. Three-dimensional scatter plot was carried out with the results of the Principal Component Analysis (PCA) which was performed using Tassel v5.0 based on N x SNP matrix (Bradbury et al., 2007).
Association Analysis
Linkage disequilibrium for each marker pair was calculated using r2 parameter with sliding windows size of 50 sites (bp) through the software Tassel v5.0 (Bradbury et al., 2007). Associations between molecular markers and phenotypic data were computed using the Genome Association and Prediction Integrated Tool—GAPIT (Lipka et al., 2012) based on the Mixed Linear Model (MLM) that controls the population structure and genetic relatedness among the individuals by incorporating the Q and K matrices. The kinship coefficients (K matrix) between individuals were estimated according to the method of Loiselle (Loiselle et al., 1995). For the association analysis, the non-normal dataset of phenotypic traits was transformed with Box-Cox transformation procedure using the software Statistica v12.0 (Statsoft Inc., Tulsa, USA). The p-values were adjusted with multiple testing, according to Benjamini and Hochberg (1995), to control the False Discovery Rate (FDR). The amount of phenotypic variation explained by each marker was estimated by r2. Associations were considered significant when p ≤ 0.0001 or LOD scores greater than 4.0. Finally, the biological function of the associated markers was identified in JBrowse environment from Sol Genomics Network (SGN) (Fernandez-Pozo et al., 2015) using tomato genome version SL2.40 and the ITAG annotation version 2.4.
Results
Phenotyping
The traits related to yield, fruit size-shape, and fruit quality categories showed a high coefficient of variation (CV), suggesting both phenotypic variations in the germplasm and representativeness of the gene pool. The mean squares values from the ANOVA of the 18 quantitative traits for the 100 accessions showed highly significant differences (p ≤ 0.0001) for all of the studied characteristics (Table 1). In contrast, there were no significant differences between the replications for the DI, FSI, FSII, OVO, OBO, CF, and FIR traits. The variability for the more important traits measured as %CV ranged from 3.47 to 10.54%, and the traits DI, NF, FWP, OBO, and CF exhibited higher levels of variation with 42.65, 46.18, 47.45, 71.34, and 88.66%, respectively (Table 2).
The yield traits presented moderate to high variation and ranged from 9.63 (FWI) to 47.45 (FWP). The fruit weight per plant (FWP) ranged between 22.44 and 8304.51 g per plant, the number of fruits (NF) ranged between 81.94 to 1471.79 fruits per plant, and the weight of fruits with and without calyx ranged between 0.15 to 0.26 and 9.24 to 10.69 g per fruit, respectively (Table 2). The accessions 09U033_1 and 09U277_5 showed a FWP greater than 7,500 g/plant, higher than the Kenyan and Colombian ecotypes, considered as references (Table S2). For the FWI and FWII traits, the haploid accessions 12U398_1, 09U294_6, 12U366_1, 14U447_1, 14U425_1, and 09U295_4 represented outlier values since these accessions exhibited a value less than 0.26 g, as compared to a mean value of 4.73 g for the entire population. In contrast, the accessions 09U134_3, 14U426_1, and 14U426_2 exhibited a value greater than 7.8 g, the first one corresponded to an accession that originated from Nepal and the last two correspond to Kenyan accessions. The accessions that showed the highest NF were 09U033_1 and 09U277_5, both of which exhibited more than 1,400 fruits, while the accessions 14U449_1 and 12U398_1 showed less than 105 fruits. The Pearson correlation coefficients (r) among the yield traits showed a positive value for FWP, with NF (r = 0.75), FWI (r = 0.59), and FWII (r = 0.58) having significant p-values > 0.0001 (Table S3).
The fruit size characteristics showed moderate variation ranging between 4.50% (MH) and 9.75% (FA) (Table 2). The WMH and HMW showed similar values to the WH and MH, respectively, suggesting that the traits did not provide additional relevant information. The accessions 09U134_3 and 09U282_3 showed extreme values for these traits, exhibiting a width and height greater than 2.5 and 2.2 cm, respectively (Table S2). In contrast, the haploid accessions presented the lowest values, less than 1.0 and 1.3 cm for width and height, respectively. The Pearson coefficient showed a high positive correlation for fruit perimeter (FP) with FA, WMH, MW, HMW, and MH (r ≥ 0.96). Similarly, the FP showed high positive correlations with FWI-FWII (r = 0.90) and a moderate positive correlation with FWP (r = 0.49).
The fruit shape traits showed moderate variation and most of the traits ranged between 3.17% (FSI) and 9.51% (OVO), but the DI and OBO showed 42.65 and 71.34% CV (Table 2). The FSI and FSII ranged from 0.68 to 1.06, indicating that the fruits showed a shape from round to elongated. Only the fruits of the haploid accessions exhibited a strong indentation area, exceeding 0.13 cm2, while accession 14U449_1 showed a low indentation area of 0.04 cm2 (Table S2). The obovoid asymmetry was found in 6% of the accessions, corresponding to haploid accessions with FSI-FSII < 0.8; while the ovoid asymmetry was found in 94% of the accession with FSI-FSII > 0.8. An exception was accession 14U449_1 with ovoid asymmetry demonstrated by FSI-FSII < 0.8 characteristics of an oblate shape. The Pearson coefficient showed a high negative correlation of OVO with DI and OBO (r ≥ 0.88) and moderate positive correlation with FSII (r = 0.53). Likewise, OVO showed high positive correlation with FWI-FWII (r = 0.77) and the fruit size traits: FP, FA, WMH, MW, HMW, and MH (r ≥ 0.82); moderate positive correlation with FIR (r = 0.67) and high negative correlation with CF (r = −0.71) (Table S3).
For the fruit quality traits, the percentage of cracked fruits (CF) showed the highest variation and ranged from 0 to 59.27% (Table 2). The accessions 09U026_1, 09U187_4, 09U131_3, 14U426_1, and 09U280_3 were uniform and exhibited values less than 0.2%, while accessions derived from anther culture exhibited high variation, generally greater than 37% (Table S2). The haploid accessions exhibited low firmness with less than 1.2 lb-f, an unsurprising result since this type of fruit has no seeds and no flesh at all. The accessions 09U026_1, 09U187_4, 09U131_3, 14U426_1, and 09U280_3 showed low CF and firmness greater than 1.5 lb-f. The trait SST showed the lowest variation and ranged from 13.13 to 17.27 °Brix. The accessions 12U398_1, 09U130_2, 14U420_1, 12U350_1, and 12U347_1 showed more than 16 °Brix and most of them were generated by anther culture technology, except accession 09U130_2. In contrast, the wild accession 09U193_1 showed only 13.1 °Brix. The FIR was negatively correlated with CF (r = −0.70), and had a moderate positive correlation with the yield traits: FWP, FWI, and FWII (r = 0.54), size of fruit traits: FP, FA, WMH, MW, HMW, and MH (r ≥ 0.57) and a positive correlated with OVO (r = 0.67) (Table S3). The broad-sense heritability () of the traits ranged from 58.70% for NF to 99.50% for DI.
Based on the PCA, the first four principal components had eigenvalues >1 and contributed 89.87% of the total cumulative variability among the different accessions. The first principal component (PC1), was represented mainly by the fruit weight and size traits (FWI, FWII, FP, FA, MW, WMH, MH, and HMW); fruit cracking and firmness explained ~63% of the observed variation and could be useful for selection schemes in cape gooseberry breeding. The PC2, represented by the number of fruits (NF) and fruit shape index I (FSI), explained 12.6% of the observed variation; whereas, PC3, primarily the fruit yield per plant (FWP) and fruit shape index I (FSII), explained 9% of the observed variation. The soluble solids concentration contributed strongly to PC4 and explained 5.3% (Table 3).
The cluster analysis grouped 100 cape gooseberry accessions into four groups, as shown in Figure 1. Group-I was comprised of 6 accessions, followed by 18, 59, and 17 accessions respectively in group-II, III, and group-V. The accessions in group-I presented small fruits (mean 0.80 g), low fruit firmness (mean 0.97 lb-f), and high fruit cracking (mean 36.88%), as compared to all other groups and it was represented by haploid accessions. Additionally, this group presented distal indentation and obovoid asymmetry. The second group (II) presented the largest fruits (mean 6.69 g), moderate fruit firmness and cracking (mean 1.57 lb-f and 9.15%, respectively). The fruit of these accessions had an ovoid shape and were collected mainly from Boyacá, Cundinamarca, and Nariño in Colombia and from international repositories in Denmark, France, Nepal, and Ecuador. The third group (III) consisted of 59 landraces from Antioquia, Boyacá, Caldas, Cundinamarca, Nariño, and Valle and was divided into two sub-groups. The first sub-group of accessions was derived from anther culture or collected from South Africa. These accessions presented intermediate-high fruit weight (mean 6.29 g), moderate fruit firmness, low fruit cracking (mean 1.75 lb-f and 2.55%, respectively) and an ovoid fruit shape. The second sub-group had seven accessions from Cundinamarca, Nariño, Boyacá, and Valle in Colombia, showed the lowest cracking percentage (<4%) with a production higher than 7,000 g/plant and are likely to be useful in cape gooseberry breeding programs. The fourth group (IV) presented an intermediate fruit weight (mean 4.61 g), moderate fruit firmness (mean 1.69 lb-f), low fruit cracking (mean 5.73%) and an ovoid fruit shape. These accessions are from Antioquia, Boyacá, Cundinamarca, Nariño, Norte de Santander, and Valle.
Figure 1. Cluster dendrogram of cape gooseberry collection using 18 phenotypic traits based on Ward method.
Genotyping and Population Structure
A total of 225,161,229 reads were obtained with an average sequence length of 101 bp and phred quality score > 26. After filtering, we excluded accession 09U039-1 because of low-quality data. We identified 27,982, 36,142, and 30,344 SNPs for cape gooseberry leaf and root transcriptomes and the tomato genome had <4.4% of missing data (Table 4). The heterozygosity observed was found to be high with HO = 0.725, and the average of HE was 0.44, determined according to Nei (1973). The PIC, an estimate of the relative informativeness of each genetic marker, averaged 0.342.
The genetic structure of the entire population was assessed using the Admixture software and PCA. The results are presented for analysis using the set of 30,344 polymorphic SNP markers identified with the tomato genome as a reference because of the similar results of genetic diversity of this set when compared with the cape gooseberry transcriptomes. Additionally, with this strategy, one better understands the function of the associated markers because many tomato coding genes are well-reported and annotated with their biological functions. The optimal K of the population, inferred according to the cross-validation error, indicated that K = 2 and K = 3 can be the best number of sub-populations (Figure S1). For K = 2 (Figure 2A), the accessions were sub-divided into wild and a second sub-population that included landraces and anther culture accessions. The wild sub-population consisted of 33 accessions from the Colombian departments Antioquia, Boyacá, Nariño, Valle del Cauca, and international repositories in Denmark, Ecuador, France, Nepal, and South Africa. The second sub-population consisted of 66 accessions, most of which were landraces from the Colombian departments Antioquia, Boyacá, Caldas, Cundinamarca, Nariño, and Norte de Santander. Additionally, the accessions obtained with the in vitro anther culture clustered in this group. When the number of sub-populations increased from two to three (K = 3) (Figure 2B), the population was sub-divided into wild, landraces, and anther culture accessions. The wild group consisted of the same 33 accessions identified in K = 2. The landrace group consisted of 42 accessions from Antioquia, Boyacá, Caldas, Cundinamarca, Nariño, and Norte de Santander, while the anther culture accessions consisted of 24 accessions mostly from Boyacá and Cundinamarca.
Figure 2. Population structure of cape gooseberry collection based on 30,344 SNPs markers. Inferred population structure of the cape gooseberry collection using the tomato SNPs matrix. Bar plot for K = 2 (A) and K = 3 (B) grouped by state of cultivation and the bar length represent the membership probability of each accessions belonging to different sub-populations. (C) Scatterplot of Principal Component Analysis scores of components PC1, PC2 and PC3 based on 30,344 SNP markers. (D) NJ-based dendrogram with cape gooseberry SNPs clustered into four sub-populations. Colors correspond to each sub-population which consisted of: mostly wild accessions (I-green), mostly AC accessions (II-red), mix of CA accessions and landraces (III-violet), and only landraces (IV-red).
PCA was used to corroborate the sub-populations of the collection obtained by the Admixture software analysis. A three-dimensional scatter plot involving 99 accessions showed that the first three PCA axes accounted for 8.2, 2.3, and 2.0% of the genetic variation among populations, respectively. These results confirmed the separation of the accessions into three sub-populations: mostly wild, landraces and the accessions of anther culture (Figure 2C). The anther culture accessions were grouped in the PCA analysis probably because of the presence of homozygous loci that differentiate them from the landraces.
The NJ-based dendrogram showed that some accessions concurred with the geographical origin of the accessions (Figure 2D). The first group (in green) mostly contained wild accessions (47) from Nariño, Antioquia, Valle del Cauca, Boyacá, Cundinamarca, and Norte de Santander in Colombia, and the accessions from the international repositories. The second group (in red) contained 21 accessions, including the anther culture accessions and some landraces, such as the Colombian ecotype (14U424_2) and the accessions 09U292_2, 09U292_3, 09U293_2 09U295_1, and 09U296_1 from the Boyacá Department. The third group (violet) encompassed 23 accessions with a mixture of landraces from Boyacá, Cundinamarca, Antioquia, and some anther culture accessions. The fourth group (in blue) contained eight accessions with only landraces from Boyacá, Nariño, and Antioquia of Colombia, the main producing areas in the country.
Based on the FST values for the whole collection, we did not find a high level of differentiation between the wild and cultivated sub-populations (FST = 0.028). When wild, landraces, and anther culture sub-populations were compared based on pairwise FST values, the anther culture and wild populations were more distinct (0.044), followed by wild and landraces (0.032) while anther culture and landraces showed 0.031. The landraces maintained similar alleles in the wild population and the anther culture were derived from the landraces and maintained genetic similarity.
Association Analyses
The association analysis was carried out for 93 accessions because accession 09U039-1 presented low quality in the sequence data (mnSCov < 0.7). Besides, the accessions 12U398_1, 09U294_6, 12U366_1, 14U447_1, 14U425_1, and 09U295_4 presented a gametic chromosome number (n = 24 chromosomes) and the ploidy is positively correlated with fruit size according to Chevalier et al. (2014). On the other hand, the AM was carried out in 10 of the 18 traits because some traits were associated with the same SNP markers. For this reason, we grouped the traits as follows: FWI and FWII were combined in Fruit Weight (FW), WMH and MW were combined in Fruit Width (FWD), and HMW and MH were combined in Fruit Height (FH) (Table 5). In addition, the traits of fruit shape category (DI, FSI, FSII, OVO, and OBO) were not used for the association analysis because the genotypes were very homogenous for this character (93% presented ovoid fruit shape).
Table 5. Association statistics of most significantly associated loci with cape gooseberry traits using K = 2 and K = 3 sub-populations inferred by Admixture.
Analysis of LD decay was not carried out because the reference genome was unavailable for the species. The LD was estimated using the squared correlation (r2) from pairs of all SNP markers without the LD filter being specified and using the tomato genome as a reference. A total of 38,884 pairs of markers showed a significant LD value with an average of 0.008 and, from these, 13,184 pairs of markers showed an r2 ≤ 0.01.
The CP and FH traits that displayed non-normal distribution were transformed with the Box-Cox transformation procedure to improve sensitivity and to avoid false positives in small sample sizes, according to Goh and Yap (2009). According to the kinship analysis based on the Loiselle logarithm, the accessions were unrelated. Using a significance threshold of –log10(p) ≥ 4.0, after the FDR correction, we did not identify any significant association. Considering this aspect, we reduced the threshold parameter to LOD Score = 4.0, supported by the Q-Q plots that evidenced the association of the SNP-trait with lower but still significant p-values (p ≤ 1.0E-04) before the FDR correction (Figure 3, Figure S2). The significant associations detected by the MLM were visualized in a Manhattan plot (Figure 4, Figure S3).
Figure 3. Quantile-quantile (Q-Q) plot for fruit weight and fruit firmness. Q-Q plots showing the ratio of the observed p-values (blue dots) compared to the expected p-value distribution (red line) for (A) fruit weight and (B) fruit firmness.
Figure 4. Manhattan plots showing significant associations for fruit weight and fruit firmness. Chromosome number are displayed along the X-axis and the negative log10 of the association p-value for each SNP on the Y-axis. Higher negative log10 indicates stronger association with the trait. (A) fruit weight and (B) fruit firmness.
Using the Q-matrix for both K = 2 and K = 3, we found 34 unique SNPs, which mapped to 21 distinct tomato genes with p-values ≤ 1.0E-04 (Table 5). The largest association numbers were detected for the FW, FA, and FWD with 7, 10, and 12 associated markers, respectively. For yield traits, 10 SNP markers were located inside nine genes and one SNP (S01_149166) was located nearby of Solyc01g005190.1 gene. These markers explained between 17.0 and 23.6% of the phenotypic variation and were located on chromosomes 1, 2, 3, 6, 11, and 12 in the tomato. For the fruit size traits, 19 SNP markers were identified that explained between 16.8 and 21.3% of the phenotypic variation and were located inside 14 genes on chromosomes 2, 3, 4, 5, 6, 9, 10, 11, and 12. Finally, a total of 10 SNP markers located in four genes were identified as significantly associated with fruit quality. These markers explained between 12.5 to 19.9% of the phenotypic variation and were located on chromosomes 1, 6, 7, 8, and 11 (Table 5).
Some phenotype/genotype associations were related to multiple traits. For K = 2, four markers: S02_44121109 (Solyc02g079590.2), S03_70268245 (Solyc03g123410.1), S11_1524907 (Solyc11g007040.1), and S12_6422882 (Solyc12g017230.1) showed an association with both fruit weight and fruit size. Marker S03_52616353 (Solyc03g082690.2) was associated with three characteristics: fruit weight per plant (FWP), fruit weight and fruit size (as area, perimeter, height, and width). For K = 3, the SNP markers S03_70268245, S06_2049586 (Solyc06g008160.2), and S12_6422882 showed association with the fruit weight and fruit size, as did marker S03_52616353, but with no significant association with the FWP.
Discussion
Highly significant differences (p ≤ 0.0001) were observed between the accessions for all of the studied traits (Table 1). These results agree with the conclusions of Herrera et al. (2011) who reported significant differences for yield and the average weight of fruit with calyx (p ≤ 0.0001). Earlier studies failed to identify significant differences in the number of fruits per plant although this observation was probably due to differences in the number of accessions evaluated. The fruit shape traits (DI, FSI, FSII, OVO, and OBO) showed no significant differences between the replications, probably suggesting that fruit shape was not greatly influenced by environmental factors. Similar results have been reported by Liu et al. (2017) in the FSI of the tomato. The CF and FIR also showed no significant differences between replications, but significant differences between accessions, suggesting a genetic effect contribution to these traits, as has been demonstrated in the tomato (Mustafa et al., 2017). The development of cracking-resistant varieties can be an effective solution for the cape gooseberry.
The coefficient of variation for the major traits ranged from 3.47 to 10.54% for traits related to fruit size and weight. The high variability observed in the FWP and NF is similar to the findings of Herrera Moreno et al. (2012) and could be explained by the relationship between yield and other variables such as length and the number of internodes in productive shoots. Fruit cracking was the most variable trait between accessions, possibly due to differences in the shape and arrangement of sub-epidermis cells of the fruits of the different accessions, as reported for the sweet cherry (Demirsoy and Demirsoy, 2004), or cuticular membrane thickness, as reported for the tomato (Matas et al., 2004).
The more important correlations were CF-FWP, CF-FWI-FWII, and CF-OVO, which showed a strong negative relationship between the cracking and size, weight, and asymmetry of the fruits. This observation suggests that the phenomenon of fruit cracking may not result from quick filling. Not all of the accessions showed the same percentage of fruit cracking as has been reported by Herrera et al. (2011) who reported differences in the percentage of cracking in 54 accessions of cape gooseberry. Our results supported the hypothesis that cracking in the cape gooseberry involves a genetic component and genetic variation that may permit breeding progress as suggested by Cooman et al. (2005).
According to Singh (2001), the heritability of FWP and NF were medium and moderately high, respectively, and the other traits were very high. This high heritability indicates a small contribution of environmental factors to the phenotype. High to medium estimates of broad sense heritability have also been reported by Leiva-Brondo et al. (2001) for yield, fruit weight, fruit shape (length/width), soluble solids content, titratable acidity, and ascorbic acid content.
The PCA clustering for the first four principal components explained 89.87% of total variance. The weight and size traits and fruit firmness and cracking percentage showed the highest contribution. Similar results were found in the earlier analysis of Herrera Moreno et al. (2012) who reported that the first four PCs explained 70.19% of the variance and were related to the physical aspects of fruits, such as weight, volume, and diameter. Similar results were found by Morillo et al. (2011) who reported that the first three PCs explained 81.75% of the variance and were represented by measures of fruit size. In this study, we identified numerous wild and cultivated accessions with desirable horticultural characteristics, such as high yield and fruit quality, highlighting seven accessions with less than 4% cracking fruit percentage and high yield that have been included in breeding programs for developing varieties from recurrent selection schemes.
The accessions 12U398_1, 09U294_6, 12U366_1, 14U447_1, 14U425_1, and 09U295_4 showed the lowest FW, haploid accessions with n = 24 chromosomes and ploidy that can affect the fruit size (Chevalier et al., 2014), explaining these results. In contrast, the accessions that have shown the highest values are in foreign and previous studies, reporting that African accessions produce bigger fruits than Colombian ones (Fischer et al., 2007). The accessions 09U033_1 and 09U277_5 showed the highest FWP and NF, indicating that the FWP was influenced by the number of fruit and not by fruit weight.
We reported the identification of 27,982 SNPs in the cape gooseberry using the software Tassel v5.0. Enhanced SNP discovery, SNP quality, and production steps and some optimization of parameters improved the SNP detection over the 1,739 SNPs previously reported (Osorio-Guarín et al., 2016). Based on the SNPs identified, the mean expected heterozygosity value was lower than the observed heterozygosity, indicating an excess of heterozygotes probably because of high rates of cross-pollination of the species, around 54% (Lagos et al., 2008). Our results are similar to the study published by Berdugo et al. (2015) who reported values of HE = 0.44, HO = 0.73, and PIC = 0.35 for a collection of parents, intra, and interspecific hybrids for P. peruviana and P. floridana evaluated with COSII and IRGs markers. Similarly, Garzón-martínez et al. (2015), using 47 P. peruviana accessions analyzed with SNP markers, found a mean value of HE = 0.41 and PIC = 0.32. However, Garzón-martínez et al. (2015) reported a value of HO = 0.59 and Osorio-Guarín et al. (2016) reported diversity values of HE = 0.665, HO = 0.431, and PIC = 0.344 using 100 accessions analyzed with 1,739 SNP markers. The discrepancies could be due to differences in the population studied and the use of a low number of SNP markers compared with this study.
Again, the variability in the collection and markers used likely explains the slight differences compared with this study. In general, the collection showed a low genetic differentiation, possibly because of allogamy of the species as reported by Silvertown and Charlesworth (2009). Low differentiation, high genetic variation, and an excess of heterozygotes are characteristics of outbreeding populations. Our results are similar to those reported by Garzón-martínez et al. (2015) who found FST (0.038) values for P. peruviana and related taxa populations using SNP markers. Likewise, Chacón et al. (2016) found a low genetic differentiation (FST = 0.058) between cultivated and non-cultivated populations using SSR markers in 345 cape gooseberry accessions. In contrast, Osorio-Guarín et al. (2016) found a high FST value (0.3507) when analyzing 100 accessions with SNP markers. This strong discrepancy may be explained by the differences in accessions used and the number of loci analyzed.
The overall level of detected LD was low, which indicates high recombination, as expected in allogamous and partially allogamous species (Rafalski and Morgante, 2004). Estimates of cross-pollination in P. peruviana exceed 52%, which would support rapid LD decay (Rafalski, 2010). A total of 34 marker-trait associations were identified. Many associations with the FW, FA, and FWD were found but only two associations for the FWP, NF, and SST. FWP is a quantitatively inherited trait and it tends to correlate with the number of fruits produced by each plant and fruit weight. For yield, the tomato gene Solyc03g082690.2, associated with FWP and FW, is related to the U-box domain-containing protein involved in cellular processes, including cell cycle regulation, vesicle-mediated protein transport, protein folding, and protein degradation (Azevedo et al., 2001). The gene Solyc01g005190.1, associated with the FW, is related to the zinc fingers protein involved in early fruit development, as reported in the tomato (Aiese Cigliano et al., 2013) and nicotiana fruit (Wu et al., 2014).
The associations identified for the fruit size traits were distributed over nine chromosomes (no associations detected on chromosomes 1, 7, and 8 chromosomes). The gene Solyc04g012040.2, associated with the FWD and FA, is annotated as a 26S proteasome, involved in protein degradation and the balancing of cell expansion with cell proliferation rates as has been reported by Kurepa et al. (2009) in Arabidopsis. The gene Solyc09g018790.2, associated with both FWD and FA, is annotated as a Gamma hydroxybutyrate dehydrogenase-like protein, which is involved in multiple physiological responses and plays an important role during early fruit development in the tomato (Fait et al., 2008; Takayama and Ezura, 2015). Furthermore, the gene Solyc11g006940.1, coding a pentatricopeptide repeat protein, is involved in plant growth and development (Sharma and Pandey, 2015).
In general, fruits with an ovoid shape are more desirable and a higher FWP and lower CF. FIR and CF are important for increasing the shelf-life of this fruits and preventing the early appearance of fungal and bacterial diseases that alter the organoleptic characteristics. For fruit cracking, we found an association with the gene Solyc06g073100.2 which is annotated as a lipase and esterase enzyme involved in the deposition of the cutin polyester in the tomato fruit cuticle (Girard et al., 2012). In contrast, the fruit firmness was associated with the gene Solyc07g043610.2, which is annotated as an auxin response factor involved in the control of the ripening process and fruit firmness in the tomato (Hao et al., 2015; Breitel et al., 2016). For the SST, the gene Solyc06g071080.2 is related to the proton-dependent oligopeptide transport family protein involved as a nitrate transporter (Tsay et al., 2007). It is tempting to think of these associations as causal though further research would be needed to establish such a relationship.
Co-localized associations for the FWP, FW, FA, FP, FH, and FWD were identified. Such co-localization might be related to the pleiotropic effects of the genes or result from genetic linkage. Similar results have been reported in the tomato for fruit traits such as soluble solids and sugar content, titratable acidity, fruit weight, and locule number (Xu et al., 2013), in rice for flowering and yield (Zhao et al., 2011) and in the cape gooseberry for response to Fusarium oxysporum (Osorio-Guarín et al., 2016). Based on the high synteny reported between members of the Solanaceae family (Wang et al., 2008), we assume that the function of some genes found in the tomato should be conserved in the cape gooseberry. The detected associated markers could be then recommended for fruit yield and size improvement in cape gooseberry breeding programs after functional confirmation.
Conclusions
The association mapping population used in this study presented high phenotypic and genetic variability that can be exploited in plant breeding programs. The results allowed for the identification of promising material for breeding programs with a high FWP and low CF. Fruit cracking and lack of firmness might be related to genetic events since the results showed clear differences among the accessions. This could be useful for exhaustive studies on the heritability and genetic architecture of these traits in breeding. Our findings suggested that using SNP markers and the mixed linear model were suitable for detecting significant associations and allowed for the detection of 34 associations for the main cape gooseberry fruit traits. The important correlation of the FH and FWD on FWP and the co-location of one associated SNP suggest that fruit size SNPs can have a strong effect on the yield of the cape gooseberry. Furthermore, novel SNP markers for yield (FWP, NF, and FW), fruit size (FP, FA, FWD, and FH), and fruit quality (FIR, CF, and SST) were found, and it should be noted that this study is an important contribution to the knowledge on the genetic basis of some traits in the cape gooseberry.
Author Contributions
VN conceived the study. FG-A and JO-G analyzed the data. FG-A prepared the manuscript and JO-G edited. All authors read and approved the final version of the manuscript.
Funding
This work was supported by the Colombian Ministry of Agriculture Agreement TV16.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
The authors would like to acknowledge Felipe Acuña, Efrain Acuña, and Sara Torres for their help in planting and fruit harvest for the analyses. We would like to thank Jorge Arguelles to support the phenotypic data analysis and David Francis to read the manuscript and give suggestions.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2018.00362/full#supplementary-material
References
Abe, A., Kosugi, S., Yoshida, K., Natsume, S., Takagi, H., Kanzaki, H., et al. (2012). Genome sequencing reveals agronomically important loci in rice using MutMap. Nat. Biotech. 30, 174–178. doi: 10.1038/nbt.2095
Agronet (2016). Reportes Estadísticos. Available online at: http://www.agronet.gov.co/estadistica/Paginas/default.aspx
Aiese Cigliano, R., Sanseverino, W., Cremona, G., Ercolano, M. R., Conicella, C., and Consiglio, F. M. (2013). Genome-wide analysis of histone modifiers in tomato: gaining an insight into their developmental roles. BMC Genomics 14:1. doi: 10.1186/1471-2164-14-57
Alexander, D. H., and Novembre, J. (2009). Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664. doi: 10.1101/gr.094052.109
Azevedo, C., Santos-Rosa, M. J., and Shirasu, K. (2001). The U-box protein family in plants. Trends Plant Sci. 6, 354–358. doi: 10.1016/S1360-1385(01)01960-4
Barone, A., Chiusano, M. L., Ercolano, M. R., Giuliano, G., Grandillo, S., and Frusciante, L. (2008). Structural and functional genomics of tomato. Int. J. Plant Genomics 2008:820274. doi: 10.1155/2008/820274
Benjamini, Y., and Hochberg, Y. (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. 57, 289–300.
Berdugo, J., Rodríguez, F., González, C., and Barrero, L. (2015). Variabilidad genética de parentales y poblaciones F1 inter e intraespecíficas de Physalis peruviana L. y P. floridana Rydb. Rev. Bras. Frutic. 37, 179–192. doi: 10.1590/0100-2945-002/14
Bonilla, M. H., Arias, P. A., Landínez, L. M., Moreno, J. M., Cardozo, F., and Suárez, M. S. (2009). Agenda Prospectiva de Investigación y Desarrollo Tecnológico Para la Cadena productiva de la Uchuva en Fresco Para Exportación en Colombia. Minist. Agric. y Desarro. Rural. Proy. Transic. La Agric. - Univ. Nac. Colomb. Corporación Colomb. Investig. Agropecu. – Corpoica- Bogotá., 151.
Bradbury, P. J., Zhang, Z., Kroon, D. E., Casstevens, T. M., Ramdoss, Y., and Buckler, E. S. (2007). TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23, 2633–2635. doi: 10.1093/bioinformatics/btm308
Breitel, D. A., Chappell-Maor, L., Meir, S., Panizel, I., Puig, C. P., Hao, Y., et al. (2016). AUXIN RESPONSE FACTOR 2 intersects hormonal signals in the regulation of tomato fruit ripening. PLoS Genet. 12:e1005903. doi: 10.1371/journal.pgen.1005903
Chacón, M., Sánchez, P., and Barrero, L. S. (2016). Genetic structure of a Colombian cape gooseberry (Physalis peruviana L.) collection by means of microsatellite markers Estructura genética de la colección colombiana de uchuva (Physalis peruviana L.) por medio de microsatélites. Agron. Colomb. 34, 5–16. doi: 10.15446/agron.colomb.v34n1.52960
Chevalier, C., Bourdon, M., Pirrello, J., Cheniclet, C., Gévaudant, F., and Frangne, N. (2014). Endoreduplication and fruit growth in tomato: evidence in favour of the karyoplasmic ratio theory. Exp. Bot. 65, 2731–2746. doi: 10.1093/jxb/ert366
Chhetri, M., Bariana, H., Wong, D., Sohail, Y., Hayden, M., and Bansal, U. (2017). Development of robust molecular markers for marker-assisted selection of leaf rust resistance gene Lr23 in common and durum wheat breeding programs. Mol. Breed. 37:21. doi: 10.1007/s11032-017-0628-6
Cooman, A., Torres, T., and Fischer, G. (2005). Determinación de las causas del rajado del fruto de uchuva (Physalis peruviana L.) bajo cubierta II. Efecto de la oferta de calcio, boro y cobre. Agron. Colomb. 23, 74–82. Available online at: https://revistas.unal.edu.co/index.php/agrocol/article/view/19919
Cotes, A. M., Jiménez, P., Rodríguez, M. X., Díaz, A., Zapata, J., Gomez, M., et al. (2012). Estrategias de Control Biológico de Fusarium Oxysporum en el Cultivo de Uchuva (Physalis peruviana). Bogotá: Corpoica.
Criollo, H., Lagos, T., Fischer, G., Mora, L., and Zamudio, L. (2014). Comportamiento de tres genotipos de uchuva (Physalis peruviana L.) bajo diferentes sistemas de poda. Rev. Colomb. Cienc. Hortíc. 8, 34–43. doi: 10.17584/rcch.2014v8i1.2798
Demirsoy, L., and Demirsoy, H. (2004). The epidermal characteristics of fruit skin of some sweet cherry cultivars in relation to fruit cracking. Pak. J. Bot. 36, 725–731. Available online at: http://www.pakbs.org/pjbot/PDFs/36(4)/PJB36(4)725.pdf
Elshire, R. J., Glaubitz, J. C., Sun, Q., Poland, J. A., Kawamoto, K., Buckler, E. S., et al. (2011). A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE 6:e19379. doi: 10.1371/journal.pone.0019379
Fait, A., Fromm, H., Walter, D., Galili, G., and Fernie, A. R. (2008). Highway or byway: the metabolic role of the GABA shunt in plants. Trends Plant Sci. 13, 14–19. doi: 10.1016/j.tplants.2007.10.005
Favoretto, P., da Silva, C. C., Tavares, A. G., Giatti, G., Moraes, P. F., Lobato, M. T. V., et al. (2017). Assisted-selection of naturally caffeine-free coffee cultivars—characterization of SNPs from a methyltransferase gene. Mol. Breed. 37:31. doi: 10.1007/s11032-017-0636-6
Fernandez-Pozo, N., Menda, N., Edwards, J. D., Saha, S., Tecle, I. Y., Strickler, S. R., et al. (2015). The Sol Genomics Network (SGN)-from genotype to phenotype to breeding. Nucleic Acids Res. 43, D1036–D1041. doi: 10.1093/nar/gku1195
Fischer, G. (2005). “El problema del rajado del fruto de la uchuva y su posible control,” in Avances en Cultivo, Poscosecha y Exportación de la Uchuva Physalis peruviana L. en Colombia, eds G. Fischer, D. Miranda, W. Piedrahita, and J. Romero (Bogotá: Universidad Nacional de Colombia), 329–338.
Fischer, G., Ebert, G., and Lüdders, P. (2007). Production, seeds and carbohydrate contents of cape gooseberry (Physalis peruviana L.) fruits grown at two contrasting Colombian altitudes. J. Appl. Bot. Food Qual. 81, 29–35.
Flint-Garcia, S. A., Thornsberry, J. M., and Buckler, E. S. IV. (2003). Structure of linkage disequilibrium in plants. Annu. Rev. Plant Biol. 54, 357–374. doi: 10.1146/annurev.arplant.54.031902.134907
Garzón-martínez, G. A., Osorio-guarín, J. A., Delgadillo-durán, P., Mayorga, F., Enciso-rodríguez, F. E., Landsman, D., et al. (2015). Genetic diversity and population structure in Physalis peruviana and related taxa based on InDels and SNPs derived from COSII and IRG markers. Plant Gene 4, 29–37. doi: 10.1016/j.plgene.2015.09.003
Garzón-Martínez, G. A., Zhu, Z. I., Landsman, D., Barrero, L. S., and Mariño-Ramírez, L. (2012). The Physalis peruviana leaf transcriptome: assembly, annotation and gene model prediction. BMC Genomics 13:151. doi: 10.1186/1471-2164-13-151
Girard, A. L., Mounet, F., Lemaire-chamley, M., Elmorjani, K., Vivancos, J., Runavot, J., et al. (2012). Tomato GDSL1 is required for cutin deposition in the fruit cuticle. Plant Cell 24, 3106–3121. doi: 10.1105/tpc.112.101055
Goh, L., and Yap, V. B. (2009). Effects of normalization on quantitative traits in association test. BMC Bioinformatics 10:415. doi: 10.1186/1471-2105-10-415
Hao, Y., Hu, G., Breitel, D., Liu, M., Mila, I., Frasse, P., et al. (2015). Auxin response factor SlARF2 is an essential component of the regulatory mechanism controlling fruit ripening in tomato. PLoS Genet. 11:e1005649. doi: 10.1371/journal.pgen.1005649
Heo, M.-S., Han, K., Kwon, J.-K., and Kang, B.-C. (2017). Development of SNP markers using genotyping-by-sequencing for cultivar identification in rose (Rosa hybrida). Hortic. Environ. Biotechnol. 58, 292–302. doi: 10.1007/s13580-017-0268-0
Herrera Moreno, A. M., Fischer, G., and Chacón Sánchez, M. I. (2012). Agronomical evaluation of cape gooseberries (Physalis peruviana L.) from central and north-eastern Colombia (Evaluación agronómica de materiales de uchuva (Physalis peruviana L.)). Agron. Colomb. 30, 15–24. Available online at: https://revistas.unal.edu.co/index.php/agrocol/article/view/22440
Herrera, A. M., Ortiz, J. D., Fischer, G., and Chacón, M. (2011). Behavior in yield and quality of 54 cape gooseberry (Physalis peruviana L.) accessions from north-eastern Colombia (Comportamiento en producción y calidad de 54 accesiones de uchuva (Physalis peruviana L.) provenientes del nor-oriente colombiano). Agron. Colomb. 29, 189–196. Available online at: https://revistas.unal.edu.co/index.php/agrocol/article/view/29027
Instituto Colombiano de Normas Técnicas-Icontec (1999). Frutas frescas: Uchuva. Especificaciones. Norma Técnica Colombiana NTC 4580. Bogotá.
Kobayashi, F., Tanaka, T., Kanamori, H., Wu, J., Katayose, Y., and Handa, H. (2016). Characterization of a mini core collection of Japanese wheat varieties using single-nucleotide polymorphisms generated by genotyping-by-sequencing. Breed. Sci. 66, 213–225. doi: 10.1270/jsbbs.66.213
Kurepa, J., Wang, S., Li, Y., Zaitlin, D., Pierce, A. J., and Smalle, J. A. (2009). Loss of 26S Proteasome function leads to increased cell size and decreased cell number in Arabidopsis. Plant Physiol. 150, 178–189. doi: 10.1104/pp.109.135970
Lagos, T., Vallejo Cabrera, F. A., Criollo Escobar, H., and Muñoz Flórez, J. E. (2008). Biología reproductiva de la uchuva. Acta Agron. 57, 81–87. Available online at: https://revistas.unal.edu.co/index.php/acta_agronomica/article/view/1346
Langmead, B., and Salzberg, S. (2012). Fast gapped-read alignment with Bowtie 2. Nat. methods 9, 357–359. doi: 10.1038/nmeth.1923
Leiva-Brondo, M., Prohens, J., and Nuez, F. (2001). Genetic analyses indicate superiority of performance of cape gooseberry (Physalis peruviana L.) hybrids. J. New Seeds 3, 1–35. doi: 10.1300/J153v03n03_04
Liberato, S., Sánchez-Betancourt, E., Argüelles, J., González, C., Núñez, V., and Barrero, L. S. (2015). Cytogenetic of Physalis peruviana L., and Physalis floridana Rydb. Genotypes with differential response to Fusarium oxysporum. Corpoica Cienc. Tecnol. Agropecu. 15, 51–61. doi: 10.21930/rcta.vol15_num1_art:396
Lipka, A. E., Tian, F., Wang, Q., Peiffer, J., Li, M., Bradbury, P. J., et al. (2012). GAPIT: genome association and prediction integrated tool. Bioinformatics 28, 2397–2399. doi: 10.1093/bioinformatics/bts444
Liu, K., and Muse, S. V. (2005). PowerMaker: an integrated analysis environment for genetic maker analysis. Bioinformatics 21, 2128–2129. doi: 10.1093/bioinformatics/bti282
Liu, X., Geng, X., Zhang, H., Shen, H., and Yang, W. (2017). Association and genetic identification of loci for four fruit traits in tomato using InDel markers. Front. Plant Sci. 8:1269. doi: 10.3389/fpls.2017.01269
Liu, Y., Nyunoya, T., Leng, S., Belinsky, S. A., Tesfaigzi, Y., and Bruse, S. (2013). Softwares and methods for estimating genetic ancestry in human populations. Hum. Genomics 7:1. doi: 10.1186/1479-7364-7-1
Loiselle, B. A., Sork, V. L., Nason, J., and Graham, C. (1995). Spatial genetic structure of a tropical understory shrub, Psychotria officinalis (Rubiaceae). Am. J. Bot. 82, 1420–1425. doi: 10.1002/j.1537-2197.1995.tb12679.x
Matas, A. J., Cobb, E. D., Paolillo, D. J., and Niklas, K. J. (2004). Crack resistance in cherry tomato fruit correlates with cuticular membrane thickness. Hortscience 39, 1354–1358. Available online at: http://hortsci.ashspublications.org/content/39/6/1354.abstract
Morillo, A., Villota Cerón, D., Lagos Burbano, T., and Ordóñez Jurado, H. (2011). Caracterización Morfológica y Molecular de 18 Introducciones de Uchuva Physalis peruviana L. de la Colección de la Universidad de Nariño. Rev. Fac. Nal. Agr. Medellín 64, 6043–6053. Available online at: http://www.scielo.org.co/pdf/rfnam/v64n2/v64n2a02.pdf
Mustafa, M., Syukur, M., and Hadi Sutja, S. (2017). Inheritance of fruit cracking resistance in tomato (Solanum lycopersicum L.). Asian J. Agric. Res. 11, 10–17. doi: 10.3923/ajar.2017.10.17
Nei, M. (1973). Analysis of gene diversity in subdivided populations. Proc. Natl. Acad. Sci. U.S.A. 70, 3321–3323.
Norelli, J. L., Wisniewski, M., Fazio, G., Burchard, E., Gutierrez, B., Levin, E., et al. (2017). Genotyping-by-sequencing markers facilitate the identification of quantitative trait loci controlling resistance to Penicillium expansum in Malus sieversii. PLoS ONE 12:e0172949. doi: 10.1371/journal.pone.0172949
Oraguzie, N. C., Wilcox, P. L., Rikkerink, E. H. A., and De, H. N. (2003). “Linkage disequilibrium,” in Forest Research, eds N. Oraguzie, E. Rikkerink, S. Gardiner, and H. Nihal de Silva (New York, NY: Springer-Verlag), 11–39.
Osorio-Guarín, J. A., Enciso-Rodríguez, F. E., González, C., Fernández-Pozo, N., Mueller, L. A., and Barrero, L. S. (2016). Association analysis for disease resistance to Fusarium oxysporum in cape gooseberry (Physalis peruviana L.). BMC Genomics 17:248. doi: 10.1186/s12864-016-2568-7
Peña, J. F., Ayala, J. D., Fischer, G., Cháves, B., Cárdenas-Hernández, J. F., and Almanza, P. J. (2011). Relaciones semilla-fruto en tres ecotipos de uchuva (Physalis peruviana L.). Rev. Colomb. Cienc. Hortíc. 4, 43–54. doi: 10.17584/rcch.2010v4i1.1224
Poland, J. A., Brown, P. J., Sorrells, M. E., and Jannink, J. (2012). Development of high-density genetic maps for barley and wheat using a novel two-enzyme genotyping-by- sequencing approach. PLoS ONE 7:e32253. doi: 10.1371/journal.pone.0032253
Poland, J., and Rife, T. (2012). Genotyping-by-sequencing for plant breeding and genetics. Plant Genome 5, 92–102. doi: 10.3835/plantgenome2012.05.0005
Pritchard, J. K., Stephens, M., and Donnelly, P. (2000). Inference of population structure using multilocus genotype data. Genetics 155, 945–959. Available online at: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1461096/pdf/10835412.pdf
Rafalski, A., and Morgante, M. (2004). Corn and humans: recombination and linkage disequilibrium in two genomes of similar size. Trends Genet. 20, 103–111. doi: 10.1016/j.tig.2003.12.002
Rafalski, J. A. (2010). Association genetics in crop improvement. Curr. Opin. Plant Biol. 13, 174–180. doi: 10.1016/j.pbi.2009.12.004
Ramadan, M. F. (2011). Bioactive phytochemicals, nutritional value, and functional properties of cape gooseberry (Physalis peruviana L.): an overview. Food Res. Int. 44, 1830–1836. doi: 10.1016/j.foodres.2010.12.042
Ramadan, M. M., El-ghorab, A. H., and Ghanem, K. Z. (2015). Volatile compounds, antioxidants, and anticancer activities of Cape gooseberry fruit (Physalis peruviana L.): an in-vitro study. J. Arab Soc. Med. Res. 10, 56–64. doi: 10.4103/1687-4293.175556
Rodríguez, G. R., Moyseenko, J. B., Robbins, M. D., Huarachi Morejón, N., Francis, D. M., and van der Knaap, E. (2010). Tomato analyzer: a useful software application to collect accurate and detailed morphological and colorimetric data from two-dimensional objects. J. Vis. Exp. e1856. doi: 10.3791/1856
R Team (2014). R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing. Available online at: http://www.r-project.org
Saxena, R. K., Singh, V. K., Kale, S. M., Tathineni, R., Parupalli, S., Kumar, V., et al. (2017). Construction of genotyping-by-sequencing based high-density genetic maps and QTL mapping for fusarium wilt resistance in pigeonpea. Sci. Rep. 7, 1–11. doi: 10.1038/s41598-017-01537-2
Sharma, M., and Pandey, G. K. (2015). Expansion and function of repeat domain proteins during stress and development in plants. Front. Plant Sci. 6:1218. doi: 10.3389/fpls.2015.01218
Silvertown, J., and Charlesworth, D. (2009). Introduction to Plant Population Biology. Hoboken, NY: Wiley-Black.
Soto-cerda, B. J., and Cloutier, S. (2012). “Association mapping in plant genomes,” in Genetic Diversity in Plants, ed M. Caliskan (InTech), 29–55. Available online at: http://www.intechopen.com/books/genetic-diversity-in-plants/association-mapping-in-plant-genomes
Takayama, M., and Ezura, H. (2015). How and why does tomato accumulate a large amount of GABA in the fruit ? Front. Plant Sci. 6:612. doi: 10.3389/fpls.2015.00612
Taranto, F., D'Agostino, N., Greco, B., Cardi, T., and Tripodi, P. (2016). Genome-wide SNP discovery and population structure analysis in pepper (Capsicum annuum) using genotyping by sequencing. BMC Genomics 17:943. doi: 10.1186/s12864-016-3297-7
Tsay, Y. F., Chiu, C. C., Tsai, C. B., Ho, C. H., and Hsu, P. K. (2007). Nitrate transporters and peptide transporters. FEBS Lett. 581, 2290–2300. doi: 10.1016/j.febslet.2007.04.047
Uitdewilligen, J. G. A. M. L., Wolters, A. A., Bjorn, B. D., Borm, T. J. A., Visser, R. G. F., and van Eck, H. J. (2013). A next-generation sequencing method for genotyping- by-sequencing of highly heterozygous autotetraploid potato. PLoS ONE 8:e62355. doi: 10.1371/journal.pone.0062355
Uncu, A. O., Frary, A., Karlovsky, P., and Doganlar, S. (2016). High-throughput single nucleotide polymorphism (SNP) identification and mapping in the sesame (Sesamum indicum L.) genome with genotyping by sequencing (GBS) analysis. Mol. Breed. 36:173. doi: 10.1007/s11032-016-0604-6
Valdenegro, M., Almonacid, S., Henríquez, C., Lutz, M., Fuentes, L., and Simpson, R. (2013). The effects of drying processes on organoleptic characteristics and the health quality of food ingredients obtained from goldenberry fruits. Open Access Sci. Rep. 2, 1–7. doi: 10.4172/scientificreports
Voss-fels, K., and Snowdon, R. J. (2016). Understanding and utilizing crop genome diversity via high-resolution genotyping. Plant Biotechnol. J. 14, 1086–1094. doi: 10.1111/pbi.12456
Wang, Y., Diehl, A., Wu, F., Vrebalov, J., Giovannoni, J., Siepel, A., et al. (2008). Sequencing and comparative analysis of a conserved syntenic segment in the Solanaceae. Genetics 180, 391–408. doi: 10.1534/genetics.108.087981
Wu, S. J., Tsai, J. Y., Chang, S. P., Lin, D. L., Wang, S. S., Huang, S., et al. (2006). Supercritical carbon dioxide extract exhibits enhanced antioxidant and anti-inflammatory activities of Physalis peruviana. J. Ethnopharmacol. 108, 407–413. doi: 10.1016/j.jep.2006.05.027
Wu, W., Cheng, Z., Liu, M., Yang, X., and Qiu, D. (2014). C3HC4-Type RING Finger Protein NbZFP1 is involved in growth and fruit development in Nicotiana benthamiana. PLoS ONE 9:e99352. doi: 10.1371/journal.pone.0099352
Xu, J., Ranc, N., Muños, S., Rolland, S., Bouchet, J. P., Desplat, N., et al. (2013). Phenotypic diversity and association mapping for fruit quality traits in cultivated tomato and related species. Theor. Appl. Genet. 126, 567–581. doi: 10.1007/s00122-012-2002-8
Xu, Y., Li, P., Yang, Z., and Xu, C. (2017). Genetic mapping of quantitative trait loci in crops. Crop J. 5, 175–184. doi: 10.1016/j.cj.2016.06.003
Keywords: GWAS, fruit traits, Physalis peruviana, mixed linear model, SNP markers
Citation: García-Arias FL, Osorio-Guarín JA and Núñez Zarantes VM (2018) Association Study Reveals Novel Genes Related to Yield and Quality of Fruit in Cape Gooseberry (Physalis peruviana L.). Front. Plant Sci. 9:362. doi: 10.3389/fpls.2018.00362
Received: 22 November 2017; Accepted: 05 March 2018;
Published: 20 March 2018.
Edited by:
Jaime Prohens, Universitat Politècnica de València, SpainReviewed by:
Gustavo R. Rodríguez, Instituto de Investigaciones en Ciencias Agrarias de Rosario (IICAR - CONICET), ArgentinaYuepeng Han, Wuhan Botanical Garden of Chinese Academy of Sciences, China
Prashanth N. Suravajhala, Birla Institute of Scientific Research, India
Copyright © 2018 García-Arias, Osorio-Guarín and Núñez Zarantes. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Victor M. Núñez Zarantes, dm51bmV6QGNvcnBvaWNhLm9yZy5jbw==