Skip to main content

ORIGINAL RESEARCH article

Front. Plant Sci., 21 August 2023
Sec. Functional and Applied Plant Genomics

Genome-wide survey and characterization of microsatellites in cashew and design of a web-based microsatellite database: CMDB

Siddanna Savadi*Siddanna Savadi1*B. M. MuralidharaB. M. Muralidhara2V. VenkataravanappaV. Venkataravanappa2J. D. AdigaJ. D. Adiga1
  • 1ICAR- Directorate of Cashew Research (DCR), Puttur, Karnataka, India
  • 2ICAR-Indian Institute of Horticultural Research (IIHR), CHES, Madikeri, Karnataka, India

The cashew is an edible tree nut crop having a wide range of food and industrial applications. Despite great economic importance, the genome-wide characterization of microsatellites [simple sequence repeats (SSRs)] in cashew is lacking. In this study, we carried out the first comprehensive genome-wide microsatellites/SSRs characterization in cashew and developed polymorphic markers and a web-based microsatellite database. A total of 54526 SSRs were discovered in the cashew genome, with a mean frequency of 153 SSRs/Mb. Among the mined genome-wide SSRs (2-6 bp size motifs), the dinucleotide repeat motifs were dominant (68.98%) followed by the trinucleotides (24.56%). The Class I type of SSRs (≥20 bp) were 45.10%, while Class II repeat motifs (≥12–<20 bp) were 54.89% of the total genomic SSRs discovered here. Further, the AT-rich SSRs occurred more frequently in the cashew genome (84%) compared to the GC-rich SSRs. The validation of the in silico-mined genome-wide SSRs by PCR screening in cashew genotypes resulted in the development of 59 polymorphic SSR markers, and the polymorphism information content (PIC) of the polymorphic SSR markers ranged from 0.19 to 0.84. Further, a web-based database, “Cashew Microsatellite Database (CMDB),” was constructed to provide access to the genome-wide SSRs mined in this study as well as transcriptome-based SSRs from our previous study to the research community through a user-friendly searchable interface. Besides, CMDB provides information on experimentally validated SSRs. CMDB permits the retrieval of SSR markers information with the customized search options. Altogether, the genome-wide SSRs characterization, the polymorphic markers and CMDB database developed in this study would serve as valuable marker resources for DNA fingerprinting, germplasm characterization, genetic studies, and molecular breeding in cashew and related Anacardium species.

1 Introduction

The cashew (Anacardium occidentale L.) is an edible tree nut crop grown in more than 30 countries in the tropical and subtropical regions (Sharma et al., 2020). Cashew is an evergreen tree with one or more vegetative and reproductive flushings occurring in an annual cycle (Adiga et al., 2019). It is a diploid species (2n =42) with an andromonoecy breeding system (Aliyu and Awopetu, 2007; Savadi et al., 2020a). It has its origins in North-Eastern Brazil and it was spread to different parts of the world, mainly by the Portuguese during the 16th century as the soil conservation tree (Samal et al., 2003; dos Santos et al., 2019).

Over time, cashew has become an economically important horticulture crop in many developing countries earning huge foreign exchange. Presently, the global raw cashew nut production is over 3.8 million metric tons, and the value of the global cashew nut market is worth nearly US$ 6 billion (INC Statistical Yearbook, 2020–2021). Cashew nut kernels, cashew nut shell liquid (CNSL), kernel oil, and cashew apple (hypocarp or pseudofruit) are utilized in food and numerous other industries (Samal et al., 2003; Talbiersky et al., 2009; Das and Arora, 2017; Sharma et al., 2020). Kernels are used as a dessert nut and in confectioneries such as chocolates, cashew milk, cashew butter, etc. The kernel oil has excellent cooking quality and also widely used in the cosmetic industry due to its nourishing and nurturing effects on skin (Visalakshi et al., 2011). The cashew apple is also edible and is consumed raw or in processed form viz., jam, jelly, syrup, ready-to-serve juice (Akinwale, 2000; Sharma et al., 2020). The cashew apples used to prepare fermented alcoholic beverages (Das and Arora, 2017). CNSL, a reddish-brown oil present in a cashew nutshell, is a multipurpose byproduct of the cashew industry widely used in varnishes, lubricants, synthetic resins, molding compositions, and insulating coatings (Talbiersky et al., 2009; Furtado et al., 2019). Recently, anacardic acid, cardanol, and cardol, the major constituents of CNSL, have acquired great importance in the pharmaceutical industry as they have anticancer and many other great medicinal properties (Ashraf and Rathinasamy, 2018; Shi et al., 2019). Thus, the demand for cashew nuts and their products is increasing throughout the world.

Despite its great economic importance and demand, the genetic improvement of cashew using molecular breeding tools has lagged behind the other important fruit and nut crops due to the limited or lack of genetic and genomic investigations (Savadi et al., 2020a). Molecular markers are important molecular breeding tools extensively used in plant genetics and breeding (Collard and Mackill, 2008). Over the course of time, different types of molecular markers have been developed, which fall into mainly the dominant and co-dominant classes. Co-dominant markers such as Simple Sequence Repeats (SSRs)/Microsatellites and Single Nucleotide Polymorphisms (SNPs) are considered to be more informative and produce consistent results compared to the dominant markers such as RAPD, ISSR and/or AFLP and as a result, the dominant markers are becoming obsolete (Collard and Mackill, 2008; Grover and Sharma, 2016).

To date, in the majority of the genetic studies in cashew, the first generation or dominant markers, viz., RAPD, ISSR, and AFLP markers, have been used (Mneney et al., 2001; Dhanaraj et al., 2003; Archak et al., 2003a; Archak et al., 2003b; Archak et al., 2009; Thimmappaiah et al., 2009; Aliyu, 2012; Jena et al., 2016; Borges et al., 2018; dos Santos et al., 2019; da Costa Gomes et al., 2021) due to the limited availability of co-dominant markers viz., SSR, SNP and InDel (Insertion/Deletion) markers (Croxford et al., 2006; Savadi et al., 2022a; Savadi et al., 2023). Among the co-dominant markers, microsatellite or SSR markers have gained wide popularity and become markers of choice for genetic studies because of their multi-allelism, high abundance in the genome, ease of use, and amenability for high-throughput analysis (Taheri et al., 2018; Savadi et al., 2020b). Presently, only 21 genomic SSR (Croxford et al., 2006) and 36 genic SSR (Savadi et al., 2022a) markers are available in cashew, which is extremely low to represent the entire genome and does not meet the needs of comprehensive genetic research in cashew.

Previously, the development of microsatellite or SSR markers was an expensive and time-consuming process. But the rapid improvements in sequencing technologies and bioinformatics have reduced the cost and time required for the development of a large set of robust markers through genome sequencing and in silico mining of the genome sequences for potential markers (Luo et al., 2021; Wang et al., 2022; Savadi et al., 2023). To date, there have been no studies on the genome-wide characterization of microsatellites or SSRs in cashew. Further, the large set of SSRs mined from the genomic sequences cannot be efficiently utilized by the researchers without a user-friendly analytical search tool. In numerous crops such as Cucumis melo (CmMDb: Chaduvula et al., 2015), Sugar beet (SBMDb: Iquebal et al., 2015), Sesame (SisatBase: Dossa et al., 2017), and Anemone sp. (Martina et al., 2022), genome-wide SSRs have been discovered and web-based databases are designed for storage and easy accessibility of the large set of genome-wide SSRs to researchers. Recently, the first draft genome of cashew cv. Bhaskara (356 Mb size with 92% BUSCO value) was generated through hybrid assembly of Illumina mate-pair reads and Oxford Nanopore reads and reported by our group (Savadi et al., 2022b). The availability of draft genome sequence prompts the discovery of a large set of SSRs at the genome level and make them easily accessible to researchers.

In the present study, we carried out the comprehensive characterization of genome-wide microsatellites/SSRs for the first time in cashew and designed a user-friendly web-based microsatellite database for easy availability of genome-wide SSR information to cashew researchers. The possibility of in silico-discovered SSRs to detect polymorphism in cashew genotypes and cross-amplify in related Anacardium species was validated by PCR amplification and fragment separation using a subset of mined SSRs. Thus, microsatellites/SSRs resources generated here would be useful for accelerating genetic studies and crop improvement in cashew and related species.

2 Materials and methods

2.1 Plant material and DNA extraction

In this study, 32 cashew genotypes (Table 1) and two Anacardium species, A. microcarpum and A. othonianum, were used for the validation of polymorphism and cross species amplification of SSRs, respectively. Leaves were harvested from the field-grown plants, and genomic DNA was extracted following the Inglis et al. (2018) method. Initially, finely ground leaf tissues were pre-washed twice with the Sorbitol wash buffer [100 mM tris hydrochloride (Tris-HCl) pH 8.0, 0.35 M Sorbitol, 5 mM ethylenediaminetetraacetic acid (EDTA) pH 8.0, 1% (w/v) polyvinylpyrrolidone (PVP-40)] with 2-Mercaptoethanol (1% v/v) to remove the excessive phenolics. The prewashed samples were then used for DNA extraction with cetyl trimethylammonium bromide (CTAB) buffer [100 mM Tris-HCl pH 8.0, 3 M NaCl, 3% CTAB, 20 mM EDTA, and 1% (w/v) PVP-40]. The integrity of extracted DNA was checked by electrophoresis on a 1% agarose gel containing 0.1 mg/ml of ethidium bromide and quantified using a NanoPhotometer N60 (Implen, Munich, Germany).

TABLE 1
www.frontiersin.org

Table 1 List of genotypes used in this study with important characteristics.

2.2 Discovery, characterization, and validation of genome-wide SSRs

The whole-genome sequence (356 MB size, 92% BUSCO value) of cashew cv. Bhaskara generated by hybrid assembly of Illumina mate-pair reads and Oxford Nanopore reads at ICAR-DCR (Savadi et al., 2022b) and deposited in the National Centre for Biotechnology Information (NCBI) database (PRJNA766521) was used for the mining of genome-wide microsatellites/SSRs. The PolyMorphPredict software, which permits in silico mining of microsatellites and designs primers from genome and transcriptome data, was used to discover and characterize the composition and distributions of genome-wide SSRs (Das et al., 2019). PolyMorphPredict takes input sequence data in Fasta format, mines the SSRs with the MISA tool, and designs primers for the MISA mined SSRs using the Primer3 software with default parameters. The draft genome sequence was mined for SSRs with a minimum repeat number of 6, 5, 5, 4, 3 for di, tri, tetra, penta, and hexanucleotide SSRs, respectively, and a maximum difference of 100 bp between two SSRs. Further, a total of 100 primer pairs were synthesized from Eurofins Genomics, Bengaluru, India, for validation of in silico mined SSRs by polymerase chain reaction (PCR) amplification.

2.3 PCR amplification of SSRs

The annealing temperatures (Ta) of the synthesized SSR primers were optimized using gradient PCR. The PCR reaction was performed in 15 μl reaction mixtures containing 7.5 μl EmeraldAmp® GT PCR Master Mix (Takara Bio Inc., Japan), 20 pM each of the forward and reverse primers, and 100 ng of genomic DNA in the Veriti™ 96-Well Thermal Cycler (ThermoFisher Scientific USA) and volume makeup was done with Millipore water. The thermal profile conditions used for PCR amplification of SSRs included an initial denaturation step of 3 min at 95°C followed by 35 cycles of 40 s at 95°C, 40 s at primer-specific Ta, 45 s at 72°C and finally, 8 min at 72°C. The PCR products were resolved along with a 100 bp DNA ladder on 3.5% agarose gels containing 0.1 mg/ml of ethidium bromide in 1X TBE (Tris/borate/EDTA) buffer by electrophoresis at 70 V for 3 h. The gels were visualized by exposing them to UV light in the Gel Doc system (Alpha Imager, USA). The primer pairs producing clear and distinct PCR bands in the expected size range on the gel were considered positive amplifications. The primer pairs showing positive amplifications were evaluated for polymorphism detection in 32 cashew genotypes at standardized PCR conditions (Table 1), and the primer pairs displaying different-sized bands among the genotypes were considered polymorphic. Further, SSR primer pairs were also tested for cross-species amplification in A. microcarpum and A. othonianum by PCR screening, and the analysis of PCR products was the same as described above for cashew. The SSR markers producing specific PCR bands in the expected size range were considered cross-transferable to the related Anacardium species.

2.4 Data analysis

The PCR bands in the gel photos of each SSR primer were scored in the allelic format (band sizes in bp). After scoring the data, genetic diversity, heterozygosity, allele frequencies, allele number, genotype frequency, and polymorphic information content (PIC) values were calculated for each SSR marker using PowerMarker V3 (Liu and Muse, 2005). Dice index-based dissimilarity matrix was calculated and used for clustering analysis of cashew genotypes by the Neighbor-Joining (NJ) method with the DARwin software V6 (Perrier and Jacquemond-Collet, 2006).

2.5 Development of the Cashew Microsatellite Database

Cashew Microsatellite Database (CMDB) was designed and implemented as a three-tier architecture website using the tech tools Node js, react js, and MongoDB. Users can create a custom query using the multiple filters available on submission of the search button on the website. An HTTP request is made to the Node JS server where the request is processed, and data will be retrieved from MongoDB and sent back as an HTTP response to the client browser. The basic scheme of CMDB development involved the following steps: i) Genomic and Genic SSR datasets were experimented with, recorded, and consolidated; ii) entities and relationships among the entities in the rational database were created; iii) Database Normalization i.e., organization of data into tables in such a way that the results of using the database are always unambiguous and intended; iv) relationships between the tables were established according to rules designed both to protect the data and to make the database more flexible by eliminating redundancy and inconsistent dependency; and v) design and implementation of a three-tier architecture website using tech tools Node js, react js and MongoDB to make the SSR data accessible to the user.

3 Results

3.1 Composition and distribution of genome-wide microsatellite/SSRs

A total of 54,526 SSRs were mined from the 356 Mb draft genome sequence of A. occidentale, with mean marker density of 153 SSRs per Mb (Table 2). However, the primer pairs could be successfully designed for flanking sequences of 47,646 SSRs of the detected SSRs (Supplementary Table 1). Analysis of repeat motifs showed that 87.39% of mined SSRs were the perfect type of SSRs, i.e., repeat motifs are continuous without interruption by any nucleotide [e.g., (GC)20], while 12.61% were the imperfect or compound type of SSRs, i.e., SSRs with the stretches of repeat motifs interrupted by nucleotides that are not repeated [e.g., (AT)12GC(AT)8] (Table 2).

TABLE 2
www.frontiersin.org

Table 2 Summary statistics and characteristics of genome-wide SSRs in cashew genome.

Analysis of the distribution of the five classes of perfect SSRs in the cashew genome revealed that the dinucleotide repeat types were most abundant, comprising 68.98%, followed by the trinucleotide repeat motifs (24.57%), the compound SSR repeats (12.62%), the tetra-nucleotide repeat motifs (4.70%), the pentanucleotide repeat motifs (0.96%), and the hexanucleotide repeat motifs (0.80%) (Table 2).The frequency distribution of different repeat motifs in the draft genome of cashew is presented in Figure 1A. The nucleotide composition of the identified SSRs showed that 84% were composed of A and/or T nucleotides, while 16% were composed of G and/or C nucleotides. The most dominant repeat sequences were AT (23.54%), followed by TA (16.71%) and AAT (4%) (Figure 1B). The detailed frequency distribution of repeat motifs and the repeat numbers in the di- and tri-nucleotide SSRs, which are dominant in the genome-wide SSRs, is presented in Table 3. In the di-nucleotide SSRs, AT/AT repeat motifs were most abundant (65.79%) and CG/CG repeats were the least abundant repeat motifs (0.30), and the remaining two types, AC/GT and AG/CT, were 17.38% and 16.49%, respectively. In the trinucleotide SSRs, AAT/ATT repeat motifs were most abundant (55.37%), followed by AAG/CTT with a frequency of 22.46%, and ATC/ATG repeat motifs were 8.94%, ACC/GGT motifs were 3.81%, AAC/GTT motifs were 3.10%, AGC/CTG motifs were 2.31%, AGG/CCT motifs were 2.61%, and other motifs (ACG/CGT, ACT/AGT, and CGG/CGG) together were 1.39%.

FIGURE 1
www.frontiersin.org

Figure 1 Frequency distribution of different types of SSR repeats in the Cashew genome (A) Frequency of motif types by unit length (K-mers) (B) Frequency of repeat motifs by nucleotide composition.

TABLE 3
www.frontiersin.org

Table 3 Frequencies of different repeat motifs in di- and tri-nucleotide SSRs in cashew genome.

Based on the size of repeats motif, the mined SSRs were categorized into two classes, viz., Class I (hypervariable) SSRs, the SSRs with repeat motif size ≥20 bp, and Class II (variable) SSRs, the SSRs with repeat motif sizes of ≥12 and <20 bp. The frequency of Class I genomic SSRs was 45.10%, while that of Class II types was 54.89%. The Class I SSRs were dominated by the dinucleotide (46.71%) and the compound SSRs (24.66%), while the Class II SSRs were dominated by the dinucleotide (69.01%) and the tri-nucleotide SSRs (30.92%).

3.2 Polymorphism, genetic diversity, and transferability of genomic SSR markers

We validated the mined genomic SSR markers by synthesizing 100 primer pairs and testing them for PCR amplification in A. occidentale. All the tested SSR primer pairs were successfully amplified in A. occidentale, indicating 100% accuracy in primer design. Further, fifty nine of 100 primer pairs screened in 32 germplasm accessions showed polymorphism (Table 4). The 59 polymorphic markers detected 294 alleles in 32 accessions. The number of alleles per SSR locus varied from 2 to 15, with a mean of 4.98 alleles per SSR locus (Table 4). PIC values of the assayed SSRs ranged from 0.19 to 0.84, with a mean of 0.59 (Table 4). Polymorphic SSR markers were grouped into three classes based on the PIC values. Of the 59 SSR markers developed, 47 were highly polymorphic (PIC value ≥0.50), 9 were moderately polymorphic (PIC value between 0.25-0.50), and 3 markers were least polymorphic (PIC value <0.25) (Table 4). Further, 39 of the polymorphic SSRs were of di-nucleotide type, 8 were of trinucleotide type, and 12 were compound SSR repeats (3 perfect and 9 imperfect type compound SSRs). The wide range in amplicon size difference (~150–250 bp) was observed with the marker DCR SSR-22 amplifying a compound motif: (GT)6ct(GA)15 (Figure 2A).

TABLE 4
www.frontiersin.org

Table 4 Characteristics of the 59 novel polymorphic SSR markers developed in this study by scanning of the whole genome sequence of cashew for SSRs and validation.

FIGURE 2
www.frontiersin.org

Figure 2 Validation of genome-wide SSRs for amplification and polymorphism in 32 cashew genotypes: (A) Gel pictures showing polymorphism detection by the DCR SSR-22 and DCR SSR-38 markers in 32 cashew genotypes; (B) Neighbour-Joining dendrogram showing genetic relationships among 32 cashew accessions collected from different geographic regions. The dendrogram is constructed based on Nei’s (D) genetic distance coefficient.

The dendrogram analysis of 32 genotypes using the 59 polymorphic SSR markers classified the assayed genotypes into three major clusters (Figure 2B). The first cluster consisted of 16 genotypes; the second cluster consisted of 14 genotypes; and the third cluster consisted of two genotypes. Pairwise dissimilarity was a maximum of 0.85 between NRC-335 and NRC-265, while a minimum of 0.33 was observed between NRC-385 and NRC-386. NRC-385 and NRC-386 are the two genotypes originating from a common parent, as depicted in the dendrogram analysis. Further, the NRC-335 and NRC-265 are from two geographically distinct regions, i.e., the NRC-335 is from the West Coast region of India, while the NRC-265 is from the East Coast region of India.

So far, there are no species-specific genetic markers designed for less studied species in the Anacardium genus. Cross-species transferability of cashew SSRs can be an alternative source of molecular markers for less studied Anacardium species. Testing of cross-species PCR amplification of the newly designed SSR primers in the two Anacardium species, viz., A. microcarpum and A. othonianum, showed that 91% of the tested primers were successful in PCR amplification (Supplementary Table 2), suggesting a high rate of transferability of cashew genomic SSRs in the Anacardium genus.

3.3 CMDB: microsatellite database for cashew

The Cashew Microsatellite Database (CMDB) is an online relational database that stores the microsatellite repeats information mined from the recently sequenced cashew genome (Savadi et al., 2022b) and the shoot transcriptome (Savadi et al., 2022a), as well as the experimentally validated SSR markers. CMBD is available at https://www.cashewmicrosatellitesdatabase.in/. CMBD is an interactive database that has been implemented as a 3-tier application architecture where we have a Client tier, an Application or Server tier, and a Database tier, as shown in Figure 3A. CMDB has a user-friendly interface developed using React js and a server designed and implemented using Node js that connects to MongoDB, where all the genomic and genic SSR data is stored. Users can access this responsive website using any browser on a desktop or mobile device connected to the internet. User-need-based customized queries can be generated from the web interface and allow users to search the Cashew microsatellite database in MongoDB.

FIGURE 3
www.frontiersin.org

Figure 3 The interface and searching of the Cashew Microsatellite Database (CMDB) for SSRs: (A) The three-tier architecture of CMDB, (B) The database search page displaying different SSR search parameters; and (C) The database search results displaying the details of SSRs.

CMDB can be searched to extract genomic as well as genic microsatellites based on motif type (di, tri, tetra, penta, and hexa), repeat motif, copy number, repeat size, expected PCR product size and primer pair annealing temperatures (Ta) (Figure 3B). The microsatellites can be searched based on the choice of scaffolds/transcripts, where more than one scaffold/transcript can be selected using the dropdown option (Figure 3B) the results of the database search displays details of SSRs including the primer pairs for the displayed SSRs (Figure 3C). This is a novel approach and is helpful for breeders and biotechnologists to easily extract microsatellites based on their needs.

4 Discussion

Though the integration of molecular markers in breeding programs and genetic studies has substantially enhanced the speed and accuracy of crop improvement in important fruit and nut crops, molecular breeding and genetic analysis in cashew have lagged behind due to the scarcity of informative markers. Microsatellites or SSRs are highly informative markers and are widely used in genetic analyses and breeding of crops, including trees. In A. occidentale, very limited SSR markers are available for comprehensive genetic studies and molecular breeding (Croxford et al., 2006; Savadi et al., 2022a).

The NGS technologies allow rapid large-scale sequencings at a lesser cost, which permits discovery and development of SSR markers for less studied crops (Khodaeiaminjan et al., 2018; Taheri et al., 2018; Luo et al., 2021; Wang et al., 2022). The genome sequences generated using the NGS technology have been used to discover genome-wide SSRs and develop SSR markers in various tree species, such as pistachio (Ziya Motalebipour et al., 2016), hazelnut (Öztürk et al., 2018), avocado (Ge et al., 2019), fruit and forest species (Song et al., 2021), and Grevillea sp. (Dabral et al., 2021). Efficient utilization of the large set of SSRs mined through genome scanning is possible with a user-friendly web tool to search the SSR information from a database. So far, genome-wide SSRs discovery and development of a database for storage and retrieval of the discovered SSRs have not been reported in cashew. The present study, for the first time, reports the discovery and characterization of genome-wide SSRs and the development of a microsatellite database (CMDB) for cashew.

In this study, a total of 54526 SSRs were discovered from the cashew genome, with a mean density of 153 SSRs/Mb. The density of markers found in this study was higher than that reported in apple (40.8 SSRs/Mb, 485 Mb) (Zhang et al., 2021b), Chinese spring wheat (36.68 SSRs/Mb, 9.93 Gb) (Han et al., 2015), and Matthiola incana (23.25 SSRs/Mb, 1977.48 Mb) (Tan et al., 2023), while it was less than mango (418.17 SSRs/Mb, 253.6 Mb) (Ravishankar et al., 2015), Prunus mume (794 SSRs/Mb, 237 Mb) (Sun et al., 2013), and Pomegranate (527.97 SSRs/Mb, 296 Mb) (Patil et al., 2020), suggesting that generally, the density of SSRs decrease with an increase in the genome size. Further, the frequency of perfect SSRs (87.39%) was much higher than the frequency of imperfect SSRs (12.61%) in the discovered genome-wide SSRs and is consistent with similar results in eggplant (Portis et al., 2018), Anemone coronaria (Martina et al., 2022), and Aristotelia chilensis (Bastías et al., 2016). In the perfect SSRs, dinucleotide repeat types were most dominant (68.98%), followed by trinucleotide repeat motifs (24.56%), which are consistent with similar results in other plant species investigations (Parida et al., 2015; Vieira et al., 2016; Ziya Motalebipour et al., 2016; Wang et al., 2018).

The most common dinucleotide was AT/AT repeat motifs (65.79%), while CG/CG repeats were the least abundant (0.30). This result is in agreement with previous findings that AT-rich SSRs are predominant in dicots, viz., apple (Zhang et al., 2012a), sweet orange (Biswas et al., 2014), and Cucumis sativus (Cavagnaro et al., 2010), while GC-rich dinucleotide repeats are dominant in monocots (Sonah et al., 2011; Qin et al., 2015). These differences in the SSR nucleotide among the dicots and monocots could be partially explained based on the relative nucleotide composition of the genomes. The average GC content of dicot genomes (34.6%) is lower than that of monocot genomes (43.7%) (Cavagnaro et al., 2010), and it is observed that the frequency of AT and TA in the genomes increased with the evolution of the plant kingdom (Qin et al., 2013).

In this study, Class I type of SSRs (≥20 bp repeat motif) were 45.10%, while Class II types (≥12 and <20 bp repeat motif) were 54.89%. The frequencies of two classes of SSRs are in agreement with other studies in plants (Parida et al., 2015; Wang et al., 2018; Patil et al., 2021). Class I SSRs are observed to be highly polymorphic compared to Class II SSRs (Parida et al., 2015; Vieira et al., 2016; Patil et al., 2021) because shorter SSR sequences tend to have lower mutation rates (Vieira et al., 2016).

The validation of genome-wide SSRs was performed by the synthesis and screening of 100 randomly selected SSR primer pairs in cashew genotypes. Fifty nine of the 100 SSR primers screened in 32 germplasm accessions showed polymorphism. To date, 21 genomic and 36 genic SSRs have been reported in cashew (Croxford et al., 2006; Savadi et al., 2022a). Therefore, this study not only provides genome-wide SSR information but also experimentally validated polymorphic SSR markers for cashew. Further, SSR markers grouped based on the PIC values showed that 47 newly developed SSR markers were highly polymorphic (PIC value ≥0.5), 9 were moderately polymorphic (PIC value between 0.25 and 0.50), and 3 markers were least polymorphic (PIC value <0.25) according to the Botstein et al. (1980) classification of polymorphic markers. Furthermore, the grouping of polymorphic SSRs based on the five classes of perfect SSRs and compound SSRs showed that the dinucleotides were dominant. The higher tendency of dinucleotide SSRs to be polymorphic is consistent with other studies on genomic SSRs in apples (Silfverberg-Dilworth et al., 2006), peanuts (Zhao et al., 2012a), watermelon (Zhao et al., 2012b), and black pepper (Negi et al., 2022). However, in a previous study where a repeat-rich genomic library screening method was used to generate SSR markers, the compound SSRs were found to be more polymorphic in cashew (Croxford et al., 2006). This difference in the polymorphic SSRs motif size could be due to the biases caused by the repeat probe sequences (AC15, AG15, AAC8, AAG8, AAT8, ACC8, AGG8, ATC8, AAAC6, AAAG6, and ACAT6) used to screen the genomic libraries for SSRs. The genetic diversity analysis using the newly developed SSRs clustered the 32 genotypes into three major clusters. Further, the pairwise dissimilarity index revealed that the maximum distinction was observed between the accessions viz., NRC-335 and NRC-265 collected from different geographic regions, the West Coast and the East Coast of India, respectively, while the minimum pairwise dissimilarity index was observed between the two genotypes, viz., NRC-385 and NRC-386, which shared one of the parents, indicating that the newly developed genomic SSRs have high discriminating power and present a powerful molecular tool for investigating genetic diversity and genetic relationships in the cashew genotypes.

To our knowledge, there is no distinct set of microsatellites or SSR markers developed for other species in the Anacardium genus. It has been demonstrated that SSR markers have a high potential for cross-species amplification/transferability in related species of the same genus. Transferability of markers is considered a cost-effective approach for developing genetic markers for species lacking genomic resources (Ellis and Burke, 2007; Vieira et al., 2016). In this study, 91% of the SSR primer pairs showed cross-species amplifications in the two wild relatives of cashew, viz., A. microcarpum and A. othonianum. This transferability rate was comparable with the results observed in previous studies using genomic SSRs in cashew (Croxford et al., 2006; Soares et al., 2013), but lower than the transferability with the genic SSRs (Savadi et al., 2022a). In A. humile, 85% of the 14 tested cashew SSRs amplified (Soares et al., 2013); in A. microcarpum, A. pumilum, and A. nanum, 92% of the 12 cashew SSRs amplified (Croxford et al., 2006); and in A. microcarpum and A. othonianum, 100% of 54 transcriptome-based SSRs amplified (Savadi et al., 2022a). The relatively lower rate of transferability of genomic SSRs compared to genic SSRs could be attributed to the higher conservation of genic sequences compared to sequences from the anonymous regions of the genomes (Ellis and Burke, 2007; Jiang et al., 2020). Thus, we contemplate that the SSR markers developed in this study can be a potential marker repository for not only cashew but also the related Anacardium species and could be employed for macro-syntenic comparisons, germplasm characterizations, genetic mapping, molecular breeding involving interspecies hybridizations, etc.

With the mining of genome-wide microsatellites/SSRs information, there is a need to develop a user-friendly web tool for easy access and efficient utilization of the mined SSRs in genetic studies and crop improvement. Several databases of genome-wide SSRs have been designed in different crop plants (Arora et al., 2013; Dossa et al., 2017; Jasrotia et al., 2019; Martina et al., 2022). However, in cashew, genome-wide SSRs information is not available. In this study, genome-wide SSR information generated in this study as well as in our previous study from transcriptome data (Savadi et al., 2022a) was integrated into CMDB, which permits the extraction of information related to both genomic and genic SSRs. Further, CMDB also provides the experimentally validated SSR markers in the cashew. Most of the microsatellite databases developed in other crops (Arora et al., 2013; Dossa et al., 2017; Jasrotia et al., 2019; Martina et al., 2022) provide only the in silico mined SSRs information of either the genomic SSRs or transcriptomic SSRs but not combined information like the CMDB. Thus, CMDB is the first comprehensive microsatellite database for cashew, and it will be of great use to cashew researchers, particularly the breeders, to develop novel markers from in silico mined SSRs and to directly use the experimentally validated markers in the research programs.

5 Conclusion

The limited availability of microsatellite/SSR markers in cashew has hindered genetic studies and crop improvement. In the current study, we mined and characterized genome-wide SSRs in the cashew genome and developed a cashew microsatellite database (CMDB), a comprehensive repository of microsatellites, which provides accessibility to genome-wide and transcriptome based SSRs information as well as the experimentally validated SSR markers to researchers and breeders. The large set of genome-wide SSRs and their free public accessibility will permit the development of a large set of new SSR markers for cashew, which are currently very scarce. Besides, we developed 59 highly informative SSR markers that are the first set of genomic SSRs developed in cashew through in silico mining of the cashew genome. Thus, the knowledge of genome-wide SSRs distribution, the development of novel SSR markers, the cross-species transferable SSRs, and the comprehensive microsatellite database would significantly accelerate genetic studies and crop improvement in cashew and related Anacardium species.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Author contributions

SS conceived the idea, carried out the in silico analysis and experiments and development of database and wrote the manuscript. BM contributed in the experimentation and development of database. VV contributed towards sampling and experimentation. JA contributed to manuscript writing and proof editing and development of database.

Acknowledgments

The authors acknowledge the financial support and encouragement of the Director, ICAR-Directorate of Cashew Research, Puttur, Karnataka, India, and the Indian Council of Agricultural Research (ICAR), New Delhi.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2023.1242025/full#supplementary-material

References

Adiga, J. D., Muralidhara, B. M., Preethi, P., Savadi, S. (2019). Phenological growth stages of the cashew tree (Anacardium occidentale L.) according to the extended BBCH scale. Ann. Appl. Biol. 175, 246–252. doi: 10.1111/aab.12526

CrossRef Full Text | Google Scholar

Akinwale, T. O. (2000). Cashew apple juice: its use in fortifying the nutritional quality of some tropical fruits. Eur. Food Res. Technol. 211, 205–207. doi: 10.1007/s002170050024

CrossRef Full Text | Google Scholar

Aliyu, O. M. (2012). Genetic diversity of Nigerian cashew germplasm. Genet. Diversity Plants 498, 163–184.

Google Scholar

Aliyu, O. M., Awopetu, J. A. (2007). Chromosome studies in cashew (Anacardium occidentale L.). Afr. J. Biotechnol. 6, 131–136.

Google Scholar

Archak, S., Gaikwad, A. B., Gautam, D., Rao, E. B., Swamy, K. R. M., Karihaloo, J. L. (2003a). DNA fingerprinting of Indian cashew (Anacardium occidentale L.) varieties using RAPD and ISSR techniques. Euphytica 130, 397–404. doi: 10.1023/A:1023074617348

CrossRef Full Text | Google Scholar

Archak, S., Gaikwad, A. B., Gautam, D., Rao, E. V., Swamy, K. R., Karihaloo, J. L. (2003b). Comparative assessment of DNA fingerprinting techniques (RAPD ISSR and AFLP) for genetic analysis of cashew (Anacardium occidentale L.) accessions of India. Genome 46, 362–369. doi: 10.1139/g03-016

PubMed Abstract | CrossRef Full Text | Google Scholar

Archak, S., Gaikwad, A. B., Swamy, K. R. M., Karihaloo, J. L. (2009). Genetic analysis and historical perspective of cashew (Anacardium occidentale L.) introduction into India. Genome 52, 222–230. doi: 10.1139/G08-119

PubMed Abstract | CrossRef Full Text | Google Scholar

Arora, V., Iquebal, M. A., Rai, A., Kumar, D. (2013). PIPEMicroDB: microsatellite database and primer generation tool for pigeonpea genome. Database 2013, bas054. doi: 10.1093/database/bas054

PubMed Abstract | CrossRef Full Text | Google Scholar

Ashraf, M. S., Rathinasamy, K. (2018). Antibacterial and anticancer activity of the purified cashew nut shell liquid: implications in cancer chemotherapy and wound healing. Nat. Prod. Res. 32, 2856–2860. doi: 10.1080/14786419.2017.1380022

PubMed Abstract | CrossRef Full Text | Google Scholar

Bastías, A., Correa, F., Rojas, P., Almada, R., Munoz, C., Sagredo, B. (2016). Identification and characterization of microsatellite loci in maqui (Aristotelia Chilensis [Molina] Stunz) using next-generation sequencing (NGS). PloS One 11, e0159825. doi: 10.1371/journal.pone.0159825

PubMed Abstract | CrossRef Full Text | Google Scholar

Biswas, M. K., Xu, Q., Mayer, C., Deng, X. (2014). Genome wide characterization of short tandem repeat markers in sweet orange (Citrus sinensis). PloS One 9, e104182. doi: 10.1371/journal.pone.0104182

PubMed Abstract | CrossRef Full Text | Google Scholar

Borges, A. N. C., Lopes, A. C. A., Britto, F. B., Vasconcelos, L. F. L., Lima, P. S. C. (2018). Genetic diversity in a cajui (Anacardium spp.) germplasm bank as determined by ISSR markers. Genet. Mol. Res. 17, 1–14. doi: 10.4238/gmr18212

CrossRef Full Text | Google Scholar

Botstein, D., White, R. L., Skolnick, M., Davis, R. W. (1980). Construction of a genetic linkage map in man using restriction fragment length polymorphisms. Am. J. Hum. Genet. 32, 314.

PubMed Abstract | Google Scholar

Cavagnaro, P. F., Senalik, D. A., Yang, L., Simon, P. W., Harkins, T. T., Kodira, C. D., et al. (2010). Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.). BMC Genom. 11, 1–18. doi: 10.1186/1471-2164-11-569

CrossRef Full Text | Google Scholar

Chaduvula, P. K., Bonthala, V. S., Manjusha, V., Siddiq, E. A., Polumetla, A. K., Prasad, G. M. (2015). CmMDb: a versatile database for Cucumis melo microsatellite markers and other horticulture crop research. PloS One 10, e0118630. doi: 10.1371/journal.pone.0118630

PubMed Abstract | CrossRef Full Text | Google Scholar

Collard, B. C., Mackill, D. J. (2008). Marker-assisted selection: an approach for precision plant breeding in the twenty-first century. Philos. Trans. R. Soc Lond. B Biol. Sci. 363, 557–572. doi: 10.1098/rstb.2007.2170

PubMed Abstract | CrossRef Full Text | Google Scholar

Croxford, A. E., Robson, M., Wilkinson, M. J. (2006). Characterization and PCR multiplexing of polymorphic microsatellite loci in cashew (Anacardium occidentale L.) and their cross-species utilization. Mol. Ecol. Notes 6, 249–251. doi: 10.1111/j.1471-8286.2005.01208.x

CrossRef Full Text | Google Scholar

Dabral, A., Shamoon, A., Meena, R. K., Kant, R., Pandey, S., Ginwal, H. S., et al. (2021). Genome skimming-based simple sequence repeat (SSR) marker discovery and characterization in Grevillea robusta. Physiol. Mol. Biol. Plants 27, 1623–1638. doi: 10.1007/s12298-021-01035-w

CrossRef Full Text | Google Scholar

da Costa Gomes, M. F., Borges, A. N. C., Batista, G. S. S., de Andrade, L. G., Oliveira, M. E. A., de Almeida Lopes, Â.C., et al. (2021). Genetic diversity and structure in natural populations of Cajui from Brazilian Cerrado. Biosci. J. 37, e37080. doi: 10.14393/BJ-v37n0a2021-53974

CrossRef Full Text | Google Scholar

Das, R., Arora, V., Jaiswal, S., Iquebal, M. A., Angadi, U. B., Fatma, S., Kumar, D., et al. (2019). PolyMorphPredict: a universal web-tool for rapid polymorphic microsatellite marker discovery from whole genome and transcriptome data. Front. Plant Sci. 9, 1966.

PubMed Abstract | Google Scholar

Das, I., Arora, A. (2017). Post-harvest processing technology for cashew apple–A review. J. Food Eng. 194, 87–98. doi: 10.1016/j.jfoodeng.2016.09.011

CrossRef Full Text | Google Scholar

Dhanaraj, A. L., Rao, E. B., Swamy, K. R. M., Bhat, M. G., Prasad, D. T., Sondur, S. N. (2003). Using RAPDs to assess the diversity in Indian cashew (Anacardium occidentale L.) germplasm. J. Hortic. Sci. Biotechnol. 77, 41–47. doi: 10.1080/14620316.2002.11511454

CrossRef Full Text | Google Scholar

Dossa, K., Yu, J., Liao, B., Cisse, N., Zhang, X. (2017). Development of highly informative genome-wide single sequence repeat markers for breeding applications in sesame and construction of a web resource: SisatBase. Front. Plant Sci. 8, 1470. doi: 10.3389/fpls.2017.01470

PubMed Abstract | CrossRef Full Text | Google Scholar

dos Santos, J. O., Mayo, S. J., Bittencourt, C. B., de Andrade, I. M. (2019). Genetic diversity in wild populations of the restinga ecotype of the cashew (Anacardium occidentale) in coastal Piauí Brazil. Plant Syst. Evol. 305, 913–924. doi: 10.1007/s00606-019-01611-4

CrossRef Full Text | Google Scholar

Ellis, J. R., Burke, J. M. (2007). EST-SSRs as a resource for population genetic analyses. Heredity 99 (2), 125–132. doi: 10.1038/sj.hdy.6801001

PubMed Abstract | CrossRef Full Text | Google Scholar

Furtado, L. B., Nascimento, R. C., Seidl, P. R., Guimarães, M. J. O., Costa, L. M., Rocha, J. C., et al. (2019). Eco-friendly corrosion inhibitors based on Cashew nut shell liquid (CNSL) for acidizing fluids. J. Mol. Liq. 284, 393–404. doi: 10.1016/j.molliq.2019.02.083

CrossRef Full Text | Google Scholar

Ge, Y., Tan, L., Wu, B., Wang, T., Zhang, T., Chen, H., et al. (2019). Transcriptome sequencing of different avocado ecotypes: de novo transcriptome assembly annotation identification and validation of EST-SSR markers. Forests 10, 411. doi: 10.3390/f10050411

CrossRef Full Text | Google Scholar

Grover, A., Sharma, P. C. (2016). Development and use of molecular markers: past and present. Crit. Rev. Biotechnol. 36, 290–302. doi: 10.3109/07388551.2014.959891

PubMed Abstract | CrossRef Full Text | Google Scholar

Han, B., Wang, C., Tang, Z., Ren, Y., Li, Y., Zhang, D., et al. (2015). Genome-wide analysis of microsatellite markers based on sequenced database in Chinese spring wheat (Triticum aestivum L.). PloS One 10, e0141540. doi: 10.1371/journal.pone.0141540

PubMed Abstract | CrossRef Full Text | Google Scholar

INC Statistical Yearbook (2020-2021) International Nut and Dried Fruit Council (INC). Cashew. Available at: https://www.nutfruit.org/.

Google Scholar

Inglis, P. W., Pappas, M. D. C. R., Resende, L. V., Grattapaglia, D. (2018). Fast and inexpensive protocols for consistent extraction of high quality DNA and RNA from challenging plant and fungal samples for high-throughput SNP genotyping and sequencing applications. PloS One 13, e0206085. doi: 10.1371/journal.pone.0206085

PubMed Abstract | CrossRef Full Text | Google Scholar

Iquebal, M. A., Jaiswal, S., Angadi, U. B., Sablok, G., Arora, V., Kumar, S., et al. (2015). SBMDb: first whole genome putative microsatellite DNA marker database of sugarbeet for bioenergy and industrial applications. Database 2015, bav111. doi: 10.1093/database/bav111

PubMed Abstract | CrossRef Full Text | Google Scholar

Jasrotia, R. S., Yadav, P. K., Iquebal, M. A., Bhatt, S. B., Arora, V., Angadi, U. B., et al. (2019). VigSatDB: Genome-wide microsatellite DNA marker database of three species of Vigna for germplasm characterization and improvement. Database 2019, baz055. doi: 10.1093/database/baz055

PubMed Abstract | CrossRef Full Text | Google Scholar

Jena, R. C., Samal, K. C., Pal, A., Das, B. K., Chand, P. K. (2016). Genetic diversity among some promising Indian local selections and hybrids of cashew nut based on morphometric and molecular markers. Int. J. Fruit Sci., 16 (1), 69–93.

Google Scholar

Jiang, Y., Xu, S., Wang, R., Zhou, J., Dou, J., Yin, Q., et al. (2020). Characterization, validation, and cross-species transferability of EST-SSR markers developed from Lycoris aurea and their application in genetic evaluation of Lycoris species. BMC Plant Biol. 20, 1–14. doi: 10.1186/s12870-020-02727-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Khodaeiaminjan, M., Kafkas, S., Motalebipour, E. Z., Coban, N. (2018). In silico polymorphic novel SSR marker development and the first SSR-based genetic linkage map in pistachio. Tree Genet. Genomes 14, 1–14. doi: 10.1007/s11295-018-1259-8

CrossRef Full Text | Google Scholar

Liu, K., Muse, S. V. (2005). PowerMarker: an integrated analysis environment for genetic marker analysis. Bioinformatics 21, 2128–2129. doi: 10.1093/bioinformatics/bti282

PubMed Abstract | CrossRef Full Text | Google Scholar

Luo, L., Yang, Y., Zhao, H., Leng, P., Hu, Z., Wu, J., et al. (2021). Development of EST-SSR markers and association analysis of floral scent in tree peony. Sci. Hortic. 289, 110409. doi: 10.1016/j.scienta.2021.110409

CrossRef Full Text | Google Scholar

Martina, M., Acquadro, A., Barchi, L., Gulino, D., Brusco, F., Rabaglio, M., et al. (2022). Genome-wide survey and development of the first microsatellite markers database (AnCorDB) in Anemone coronaria L. Int. J. Mol. Sci. 23, 3126. doi: 10.3390/ijms23063126

PubMed Abstract | CrossRef Full Text | Google Scholar

Mneney, E., Mantell, S., Bennett, M. (2001). Use of random amplified polymorphic DNA (RAPD) markers to reveal genetic diversity within and between populations of cashew (Anacardium occidentale L.). J. Hortic. Sci. Biotechnol. 76, 375–383. doi: 10.1080/14620316.2001.11511380

CrossRef Full Text | Google Scholar

Negi, A., Singh, K., Jaiswal, S., Kokkat, J. G., Angadi, U. B., Iquebal, M. A., et al. (2022). Rapid genome-wide location-specific polymorphic SSR marker discovery in black pepper by GBS approach. Front. Plant Sci. 13. doi: 10.3389/fpls.2022.846937

PubMed Abstract | CrossRef Full Text | Google Scholar

Öztürk, S. C., Göktay, M., Allmer, J., Doğanlar, S., Frary, A. (2018). Development of simple sequence repeat markers in hazelnut (Corylus avellana L.) by next-generation sequencing and discrimination of Turkish hazelnut cultivars. Plant Mol. Biol. 36, 800–811. doi: 10.1007/s11105-018-1120-0

CrossRef Full Text | Google Scholar

Parida, S. K., Verma, M., Yadav, S. K., Ambawat, S., Das, S., Garg, R., et al. (2015). Development of genome-wide informative simple sequence repeat markers for large-scale genotyping applications in chickpea and development of web resource. Front. Plant Sci. 6, 645. doi: 10.3389/fpls.2015.00645

PubMed Abstract | CrossRef Full Text | Google Scholar

Patil, P. G., Singh, N. V., Parashuram, S., Bohra, A., Mundewadikar, D. M., Sangnure, V. R., et al. (2020). Genome wide identification, characterization and validation of novel miRNA-based SSR markers in pomegranate (Punica granatum L.). Plant Mol. Biol. 26 (4), 683–696. doi: 10.1007/s12298-020-00790-6

CrossRef Full Text | Google Scholar

Patil, P. G., Singh, N. V., Bohra, A., Raghavendra, K. P., Mane, R., Mundewadikar, D. M., Sharma, J., et al. (2021). Comprehensive characterization and validation of chromosome-specific highly polymorphic SSR markers from pomegranate (Punica granatum L.) cv. Tunisia genome. Frontiers in Plant Science 12, 645055.

Google Scholar

Perrier, X., Jacquemond-Collet, J. P. (2006) DARwin software. Available at: http://darwin.cirad.fr/darwin/.

Google Scholar

Portis, E., Lanteri, S., Barchi, L., Portis, F., Valente, L., Toppino, L., et al. (2018). Comprehensive characterization of simple sequence repeats in eggplant (Solanum melongena L.) genome and construction of a web resource. Front. Plant Sci. 9, 401. doi: 10.3389/fpls.2018.00401

PubMed Abstract | CrossRef Full Text | Google Scholar

Qin, Y., Shi, G., Sun, Y. (2013). Evaluation of genetic diversity in Pampus argenteus using SSR markers. Genet. Mol. Res. 12, 5833–5841. doi: 10.4238/2013.November.22.10

PubMed Abstract | CrossRef Full Text | Google Scholar

Qin, Z., Wang, Y., Wang, Q., Li, A., Hou, F., Zhang, L. (2015). Evolution analysis of simple sequence repeats in plant genome. PloS One 10 (12), e0144108. doi: 10.1371/journal.pone.0144108

PubMed Abstract | CrossRef Full Text | Google Scholar

Ravishankar, K. V., Bommisetty, P., Bajpai, A., Srivastava, N., Mani, B. H., Vasugi, C., et al. (2015). Genetic diversity and population structure analysis of mango (Mangifera indica) cultivars assessed by microsatellite markers. Trees 29, 775–783. doi: 10.1007/s00468-015-1155-x

CrossRef Full Text | Google Scholar

Samal, S., Rout, G. R., Nayak, S., Nanda, R. M., Lenka, P. C., Das, P. (2003). Primer screening and optimization for RAPD analysis of cashew. Biol. Plant 46 (2), 301–304. doi: 10.1023/A:1022827416677

CrossRef Full Text | Google Scholar

Savadi, S., Muralidhara, B. M., Venkataravanappa, V., Adiga, J. D., Manjunatha, K., Patil, B. (2022a). De novo transcriptome assembly and its utility in development and characterization of the first set of genic SSR markers in cashew. Industrial Crops and Products 189, 115734. doi: 10.1016/j.scienta.2023.112233

CrossRef Full Text | Google Scholar

Savadi, S., Adiga, J. D., Muralidhara, B. M., Prasad, P., Manjunatha, K., Ashwitha, K., Manoj, K., et al. (2023). Discovery of genome-wide genetic variations and development of first set of InDel markers for genetics research in cashew. Sci. Hortic. 320, 112233. doi: 10.1016/j.scienta.2023.112233

CrossRef Full Text | Google Scholar

Savadi, S., Muralidhara, B. M., Godwin, J., Adiga, J. D., Mohana, G. S., Eradasappa, E., et al. (2022b). De novo assembly and characterization of the draft genome of the cashew (Anacardium occidentale L.). Sci. Rep. 12, 18187. doi: 10.1038/s41598-022-22600-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Savadi, S., Muralidhara, B. M., Preethi, P. (2020a). Advances in genomics of cashew tree: molecular tools and strategies for accelerated breeding. Tree Genet. Genomes 16, 1–15. doi: 10.1007/s11295-020-01453-z

CrossRef Full Text | Google Scholar

Savadi, S., Prasad, P., Sharma, K., Rathore, R., Bhardwaj, S. C., Gangwar, O. P., et al. (2020b). Development of novel transcriptome-based SSR markers in Puccinia triticina and their potential application in genetic diversity studies. Trop. Plant Pathol. 45, 499–510. doi: 10.1007/s40858-020-00347-8

CrossRef Full Text | Google Scholar

Sharma, P., Gaur, V. K., Sirohi, R., Larroche, C., Kim, S. H., Pandey, A. (2020). Valorization of cashew nut processing residues for industrial applications. Ind. Crops Prod. 152, 112550. doi: 10.1016/j.indcrop.2020.112550

CrossRef Full Text | Google Scholar

Shi, Y., Kamer, P. C., Cole-Hamilton, D. J. (2019). Synthesis of pharmaceutical drugs from cardanol derived from cashew nut shell liquid. Green Chem. 21, 1043–1053. doi: 10.1039/C8GC03823F

CrossRef Full Text | Google Scholar

Silfverberg-Dilworth, E., Matasci, C. L., Van de Weg, W. E., Van Kaauwen, M. P. W., Walser, M., Kodde, L. P., et al. (2006). Microsatellite markers spanning the apple (Malus x domestica Borkh.) genome. Tree Genet.Genomes 2, 202–224. doi: 10.1007/s11295-006-0045-1

CrossRef Full Text | Google Scholar

Soares, T. N., Sant’ana, L. D. L., Oliveira, L. K. D., Telles, M. P. D. C., Collevatti, R. G. (2013). Transferability and characterization of microsatellite loci in Anacardium humile A. St. Hil.(Anacardiaceae). Genet. Mol. Res. 12, 3146–3149. doi: 10.4238/2013.January.4.24

PubMed Abstract | CrossRef Full Text | Google Scholar

Sonah, H., Deshmukh, R. K., Sharma, A., Singh, V. P., Gupta, D. K., Gacche, R. N., et al. (2011). Genome-wide distribution and organization of microsatellites in plants: an insight into marker development in Brachypodium. PloS One 6, e21298. doi: 10.1371/journal.pone.0021298

PubMed Abstract | CrossRef Full Text | Google Scholar

Song, X., Li, N., Guo, Y., Bai, Y., Wu, T., Yu, T., et al. (2021). Comprehensive identification and characterization of simple sequence repeats based on the whole-genome sequences of 14 forest and fruit trees. For. Res. 1, 1–10. doi: 10.48130/FR-2021-0007

CrossRef Full Text | Google Scholar

Sun, L., Yang, W., Zhang, Q., Cheng, T., Pan, H., Xu, Z., et al. (2013). Genome-wide characterization and linkage mapping of simple sequence repeats in mei (Prunus mume Sieb. et Zucc.). PloS One 8, e59562. doi: 10.1371/journal.pone.0059562

PubMed Abstract | CrossRef Full Text | Google Scholar

Taheri, S., Lee, A. T., Yusop, M. R., Hanafi, M. M., Sahebi, M., Azizi, P., et al. (2018). Mining and development of novel SSR markers using next generation sequencing (NGS) data in plants. Molecules 23, 399. doi: 10.3390/molecules23020399

PubMed Abstract | CrossRef Full Text | Google Scholar

Talbiersky, J., Polaczek, J., Ramamoorty, R., Shishlov, O. (2009). Phenols from cashew nut shell oil as a feedstock for making resins and chemicals. Oil Gas Eur. Mag. 35, 33–39.

Google Scholar

Tan, C., Zhang, H., Chen, H., Guan, M., Zhu, Z., Cao, X., et al. (2023). First report on development of genome-wide microsatellite markers for stock (Matthiola incana L.). Plants 12, 748. doi: 10.3390/plants12040748

PubMed Abstract | CrossRef Full Text | Google Scholar

Thimmappaiah, Santhosh, W. G., Shobha, D., Melwyn, G. S. (2009). Assessment of genetic diversity in cashew germplasm using RAPD and ISSR markers. Sci. Hortic. 120, 411–417. doi: 10.1016/j.scienta.2008.11.022

CrossRef Full Text | Google Scholar

Vieira, M. L. C., Santini, L., Diniz, A. L., Munhoz, C. D. F. (2016). Microsatellite markers: what they mean and why they are so useful. Genet. Mol. Biol. 39, 312–328. doi: 10.1590/1678-4685-GMB-2016-0027

PubMed Abstract | CrossRef Full Text | Google Scholar

Visalakshi, M., Jawaharlal, M., Ganga, M. (2011). “Cashing in on cashew,” in I International symposium on cashewnut, Vol. 1080. 103–107.

Google Scholar

Wang, H., Gao, S., Liu, Y., Wang, P., Zhang, Z., Chen, D. (2022). A pipeline for effectively developing highly polymorphic simple sequence repeats markers based on multi-sample genomic data. Ecol. Evol. 12, e8705. doi: 10.1002/ece3.8705

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, X., Yang, S., Chen, Y., Zhang, S., Zhao, Q., Li, M., et al. (2018). Comparative genome-wide characterization leading to simple sequence repeat marker development for Nicotiana. BMC Genom. 19, 500. doi: 10.1186/s12864-018-4878-4

CrossRef Full Text | Google Scholar

Zhang, Q., Li, J., Zhao, Y., Korban, S. S., Han, Y. (2012a). Evaluation of genetic diversity in Chinese wild apple species along with apple cultivars using SSR markers. Plant Mol. Biol. 30, 539–546. doi: 10.1007/s11105-011-0366-6

CrossRef Full Text | Google Scholar

Zhang, Q., Ma, B., Li, H., Chang, Y., Han, Y., Li, J., et al. (2012b). Identification characterization and utilization of genome-wide simple sequence repeats to identify a QTL for acidity in apple. BMC Genom. 13, 1–12. doi: 10.1186/1471-2164-13-537

CrossRef Full Text | Google Scholar

Zhao, S., Guan, L., Yan, Z., He, N., Lu, X., Liu, W. (2012b). Fingerprinting of Zhengkang triploid seedless watermelon varieties using SSR markers. China Cucurbits Vegetables 25, 1–4.

Google Scholar

Zhao, Y., Prakash, C. S., He, G. (2012a). Characterization and compilation of polymorphic simple sequence repeat (SSR) markers of peanut from public database. BMC Res. Notes 5, 1–7. doi: 10.1186/1756-0500-5-362

PubMed Abstract | CrossRef Full Text | Google Scholar

Ziya Motalebipour, E., Kafkas, S., Khodaeiaminjan, M., Coban, N., Gozel, H. (2016). Genome survey of pistachio (Pistacia vera L.) by next generation sequencing: Development of novel SSR markers and genetic diversity in Pistacia species. BMC Genom. 17, 998. doi: 10.1186/s12864-016-3359-x

CrossRef Full Text | Google Scholar

Keywords: microsatellite, genome wide, cashew nut, database, validation

Citation: Savadi S, Muralidhara BM, Venkataravanappa V and Adiga JD (2023) Genome-wide survey and characterization of microsatellites in cashew and design of a web-based microsatellite database: CMDB. Front. Plant Sci. 14:1242025. doi: 10.3389/fpls.2023.1242025

Received: 18 June 2023; Accepted: 31 July 2023;
Published: 21 August 2023.

Edited by:

Yi-Hong Wang, University of Louisiana at Lafayette, United States

Reviewed by:

Yupeng Pan, Northwest A&F University, China
Bin Bai, Gansu Academy of Agricultural Sciences (CAAS), China

Copyright © 2023 Savadi, Muralidhara, Venkataravanappa and Adiga. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Siddanna Savadi, c2lkZGFubmFzYXZhZGlAZ21haWwuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.