- 1Division of Vegetable Science, ICAR-Indian Agricultural Research Institute, New Delhi, India
- 2Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
- 3ICAR-National Institute of Plant Biotechnology, New Delhi, India
Cucumber is an extremely perishable vegetable; however, under room conditions, the fruits become unfit for consumption 2–3 days after harvesting. One natural variant, DC-48 with an extended shelf-life was identified, fruits of which can be stored up to 10–15 days under room temperature. The genes involved in this economically important trait are regulated by non-coding RNAs. The study aims to identify the long non-coding RNAs (lncRNAs) and circular RNAs (circRNAs) by taking two contrasting genotypes, DC-48 and DC-83, at two different fruit developmental stages. The upper epidermis of the fruits was collected at 5 days and 10 days after pollination (DAP) for high throughput RNA sequencing. The differential expression analysis was performed to identify differentially expressed (DE) lncRNAs and circRNAs along with the network analysis of lncRNA, miRNA, circRNA, and mRNA interactions. A total of 97 DElncRNAs were identified where 18 were common under both the developmental stages (8 down regulated and 10 upregulated). Based on the back-spliced reads, 238 circRNAs were found to be distributed uniformly throughout the cucumber genomes with the highest numbers (71) in chromosome 4. The majority of the circRNAs (49%) were exonic in origin followed by inter-genic (47%) and intronic (4%) origin. The genes related to fruit firmness, namely, polygalacturonase, expansin, pectate lyase, and xyloglucan glycosyltransferase were present in the target sites and co-localized networks indicating the role of the lncRNA and circRNAs in their regulation. Genes related to fruit ripening, namely, trehalose-6-phosphate synthase, squamosa promoter binding protein, WRKY domain transcription factors, MADS box proteins, abscisic stress ripening inhibitors, and different classes of heat shock proteins (HSPs) were also found to be regulated by the identified lncRNA and circRNAs. Besides, ethylene biosynthesis and chlorophyll metabolisms were also found to be regulated by DElncRNAs and circRNAs. A total of 17 transcripts were also successfully validated through RT PCR data. These results would help the breeders to identify the complex molecular network and regulatory role of the lncRNAs and circRNAs in determining the shelf-life of cucumbers.
Introduction
In recent times, functions of a class of RNAs with little or no protein coding potential is an area of interest with the advancement and wide application of next-generation sequencing technologies (Liu et al., 2015). Recent advancements in high throughput RNA-sequencing technologies have played a key role in the identification and understanding of the role of several groups of novel non-coding RNAs (ncRNA) in different organisms including plants (Ravasi et al., 2006). The upcoming new edge technologies in next-generation sequencing along with the bioinformatics tools have revolutionized the field of biological sciences to understand and explore the functions of all the important groups involved in genes at transcription, post-transcription, post-translation, and even epigenetic levels (Li et al., 2018). Novel long non-coding RNAs (lncRNAs) were discovered and advocated as one of the emerging players to understand important biological processes, growth, and development besides stress response (Yu et al., 2019; Budak et al., 2020; Jha et al., 2020). LncRNA with a length of more than 200 nt is reported to be involved in a wide variety of processes such as splicing, gene inactivation, and translation (Costa, 2005; Liu et al., 2013; Ma et al., 2013). LncRNAs are transcribed by RNA polymerase II or III, and additionally, by polymerase IV/V in plants (Dinger et al., 2008, 2009; Wierzbicki et al., 2008). LncRNAs act as the largest class of diverse RNAs as ‘biological regulators’ with a regulatory role in transcription and post-transcription levels (Bohmdorfer and Wierzbicki, 2015; Chekanova, 2015). The regulatory role of the lncRNAs in an array of biological processes in plants has been elaborated by different research groups (Franco-Zorrilla et al., 2007; Nejat and Mantri, 2018; Shin et al., 2018; Corona-Gomez et al., 2020; Sun Z. et al., 2020).
Besides lncRNAs, another class of ncRNAs called circular RNAs (circRNAs) has been projected as one of the important players in regulating biological processes through transcription and genome imprinting. The first circRNA was identified long back by Sanger et al. (1976) and was advocated as the byproduct of abnormal splicing with negligible functional potential. Back-splicing of exons from precursor mRNA is responsible for the production of circRNAs. RNA circle is formed with the connection of the downstream 5′ splice site with the upstream 3′ splice site and ligation by a 3′–5′ phosphodiester bond at the site of connection (Chen, 2016). Characterization of the circRNA elucidates that they can be originated from intron, exons, and intergenic regions and their expression pattern is specific to cells, tissues, and developmental stages. Because of the higher stability of the circRNA in comparison to the conventional RNAs, they are more likely to be involved in various biological processes (Li et al., 2017).
Cucumber (Cucumis sativus L.) is one of the most important vegetable crops cultivated worldwide and serves as a model plant species for genetic and genomic studies because of its rich source of genomic information. It is known for its wider therapeutic and pharmacological applications. It is used as an antidiabetic having lipid lowering and antioxidant properties (Mukherjee et al., 2013). Smaller genome size, distinct sex expression, and worldwide distribution facilitate detailed genomics studies for several complex mechanisms regulating different biological processes in cucumber. Shelf-life is an important trait determining storability and transport of highly perishable crops like most the vegetables. The various measures widely adopted to improve the shelf life of the harvested produce in the high-income economies are not popular in low-income developing countries because the high-cost intervention affects the final retail price of the fresh produce (Friedman et al., 2020). It is estimated that about 30% of the fresh fruits harvested are lost because of post-harvest deterioration, ripening, and decay (Gajanana et al., 2011; Kasso and Bekele, 2018). Therefore, reducing the post-harvest loss of perishable crops like fruits and vegetables is a global mission for food availability and reducing hunger (Porat et al., 2018). Among the vegetable crops, fruit ripening and shelf-life are better understood in the climacteric crop, tomato through detailed translational studies. However, ripening, shelf-life, and biological processes involved in post-harvest decay are poorly understood in a model non-climacteric crop, like cucumber. A natural variant, DC-48 with extended shelf-life was isolated at the Division of Vegetable Science, ICAR-Indian Agricultural Research Institute, New Delhi (28.6377° N, 77.1571° E). This genotype can be stored under room condition without any low temperature storage up to 10 days after harvesting without loss of the fresh green color and fruit firmness. However, the detailed physiological, biochemical, and molecular networks associated with the extended shelf-life of DC-48 are yet to be explored. Roles of lncRNA and circRNA associated with this extremely important trait in the genotype, DC-48 will provide insight into the complex mechanisms associated with the extended shelf-life. Another contrasting genotype, DC-83 with very poor shelf life and become unfit for marketing within 2–3 days after harvesting (Pradeepkumara et al., 2022) was taken along with DC-48 for the present study to understand the role of lncRNA and circRNA in regulating shelf-life of cucumber. In most vegetable crops, it is widely known that pre-harvest metabolic processes driven by the genetic makeup of the mother plant play important role in determining the post-harvest behavior of the fruits (Weston and Barth, 1997; Arah et al., 2015). It was also evident from studies in a related crop such as in the case of melons that early fruit developmental stages are critical in determining the post-harvest physiology of the fruits (Saladié et al., 2015). Most cucumber fruits attain harvestable maturity at 10 days after pollination (DAP). Therefore, it would be ideal to conduct RNAseq analysis one at the early stage of development and another at harvesting maturity.
In cucumber differentially expressed lncRNA and circRNA in response to heat stress were identified by He et al. (2020) who revealed the role of these groups of novel ncRNAs in stress response. Besides, the role of lncRNA and miRNA were identified in cucumber in response to long term waterlogging conditions (Kȩska et al., 2021). In watermelon, the pivotal role of lncRNAs and circRNA in defense response in relation to cucumber green mottle mosaic virus has been reported by Sun Y. et al. (2020). Genome wide identification of lncRNA and circRNA for fruit ripening has been reported in a number of crops like C. melo (Tian et al., 2019) and strawberry (Tang et al., 2021). However, to the best of our knowledge, identification of lncRNAs and circRNAs in relation to the extended shelf-life in cucumber is still warranted. Therefore, this study was conducted to identify and understand the role of lncRNA and circRNA and complex molecular networks in relation to the extended shelf life in two contrasting cucumber genotypes, namely DC-48 and DC-83.
Materials and methods
Plant material and sample preparation
Two contrasting genotypes, DC-48 (better keeping quality) and DC-83 (poor keeping quality) were grown in the research field of the Division of Vegetable Science, ICAR-Indian Agricultural Research Institute, New Delhi (28.6377° N, 77.1571° E) under protected condition using insect proof net during the spring summer season of 2019. The genotype, DC-48 and DC-83, were slicing type cucumbers and belong to the same species, C. sativus L. Description of the genotypes and their detailed characterization for keeping quality is discussed in our earlier report (Pradeepkumara et al., 2022). Suitable cultural practices were followed for raising a healthy crop. The fully developed fruits at different developmental stages after pollination from the healthy plants were used for determining the shelf life and whole genome RNA-sequencing. Samples were collected at 5DAP and 10DAP using the normally developed fruits for RNA-seq. Sampling of the fruits and isolation of RNA was done in triplicate as per the procedure discussed by Pradeepkumara et al. (2022). Uniformly peeled surface tissues of the fruit epicarp (2 mm) were used for RNA extraction and high throughput sequencing. Post-harvest biology of the cucumber fruits is predominantly determined by the epicarp structure and texture, therefore, considered ideal tissues for RNA-seq analysis. TRIzol reagent (Invitrogen, Waltham, MA, United States) was used for the extraction of total RNA and the concentration and quality of the extracted RNA were determined using Bio-analyzer (Agilent, United Kingdom). Whole genome RNA sequencing was performed using 100 ng μl–1 extracted RNA (Illumina HiSeq X10). For the quantitative real-time polymerase chain reaction (qPCR) analysis, RNA was treated with DNaseI from the DNA-free kit (Promega Corporation, United States) and then checked by PCR to ensure that there was no contaminating DNA. First-strand cDNA was synthesized using the High-Capacity cDNA Reverse Transcription Kit as per the manufacturer’s protocol (Promega Corporation, Madison, WI, United States).
Data pre-processing and assembly
The paired-end HiSeq Illumina reads of 2*151 bp, generated from two contrasting genotypes of cucumber, i.e., DC-48 (extended shelf life; label:1) and DC-83 (poor shelf life; label:2), at two developmental stages viz., 5 DAP (Days after pollination (labelled as A) and 10 DAP (labeled as B) in triplicates were pre-processed. The read quality was checked using the tools fastQC1. The pre-processing of raw reads was done using Trimmomatic v0.39 tool (Bolger et al., 2014) to remove low-quality reads and adapter sequences. The filters used were read length ≤ 36, poor quality ≤ 3 and HEADCROP: 10 bases. After pre-processing, the high-quality reads were used for the construction of a de-novo transcriptome assembly using Trinity software (Haas et al., 2013) resulting in 186,184 transcripts.
Identification of novel long non-coding RNAs
For the identification of lncRNA, transcripts longer than 200 base pair were considered. We found all the 186,184 transcripts to be above 200 base pair in length. Further, the transcripts with FPKM values greater than 0.5 were filtered. Studies have shown that lncRNAs in general have lower quality and shorter ORFs than the protein coding mRNAs. The ORFs in each transcript were predicted with an ORF finder2 and those having ORFs longer than 300 nucleotides (or 100 amino acids) were removed as lncRNAs have shorter ORFs than the protein coding mRNAs. This was followed by Blast2Go (Conesa and Götz, 2008) in order to filter and remove the already annotated sequences. A search against Pfam (Mistry et al., 2021) database was done to identify the protein families if any. Further, the non-coding transcripts were filtered after processing through TransDecoder ver. 5.5.03 categorized the transcripts into coding and non-coding. Binary classifiers like CPC2 ver. 1.0.1 (Kang et al., 2017) and PLEK ver. 1.2 (Li et al., 2014) were employed further to classify the remaining sequences into coding (score > 0) or non-coding (score < 0) based on the scores. In order to remove transcripts having certain housekeeping RNAs, namely, rRNAs, tRNAs, snoRNAs, and other ncRNAs, a BLAST search against the RNA Central database4 was made, and removed the transcripts showing ≥ 95% identity and ≤ 3 mismatches to retain the lncRNAs.
Comparison of identified lncRNAs With known lncRNAs
The identified lncRNAs in the study were compared with the available plant lncRNAs in different databases like CANTATAdb and PLncDB. LncRNAs are known to be poorly conserved as compared to protein coding mRNAs across species. The lncRNAs from 39 plant species available at CANTATAdb (Version 2.0)5 were downloaded. Local blast was performed against the identified cucumber lncRNAs at cutoff of > 70% identity. Similarly, our identified lncRNAs were local Blast with the cucumber lncRNAs from PLncDB Version 2.06 and sequences with 100% identity were reported as a significant match.
Identification of differentially expressed lncRNAs (DElncRNAs)
The DElncRNAs were identified from all the four comparison sets (1A:1B, 1A:2A, 1B:2B, and 2A:2B). The assembled transcripts obtained after reads’ pre-processing were aligned with the paired-end reads using Bowtie2 (Langmead and Salzberg, 2012). RSEM (RNA-Seq by expectation maximization) was used to measure the transcript abundance level of genes and isoforms for each set and calculate the expression (Li and Dewey, 2011). EdgeR (Empirical Analysis of Digital Gene Expression in R) was used for expression analysis of transcripts from inter-varietal and intra-varietal developmental stages at stringent parameters (FDR < 0.001, p < 0.05 and log2fold change = ± 1) (Robinson et al., 2010). The differentially expressed genes (DEGs) obtained from the assembled transcripts were further compared with the identified lncRNA to find the DElncRNA.
Identification of the lncRNAs interacting miRNAs and their mRNA targets
In order to check if the identified lncRNAs could potentially be targeted by miRNAs or could be target mimicry for miRNAs, psRNATarget (Dai et al., 2018) was used. Here, lncRNA sequences were used as input for target candidates. Matches with expectation ≤ 3 was considered significant in our study. To predict endogenous target mimics (eTMs), TAPIR (Bonnet et al., 2010) was used with an MFE ratio ≥ 0.6 (Bouba et al., 2019).
For creating the lncRNA-miRNA-mRNA network, the mRNA targets of the identified miRNAs were predicted. For this, psRNATarget was run with identified miRNA sequences and cucumber cDNA sequences available at the server as input. Matches with expectation ≤ 3 were considered significant.
Identification of mRNA targets of DElncRNA
LncRNAs can target mRNA and affect gene expression. Both, cis and trans targets were identified in the cucumber reference genome. Cis-targets were identified by searching the 100kb window upstream and downstream of the identified lncRNAs using the window-bed option of Bedtools. The trans mRNA targets for all the differential expressed lncRNA were identified using lncTAR (Li et al., 2015) using cucumber cDNA sequences7. A high normalized deltaG (nDG) threshold (-0.20) was set to get high confidence lncRNA-mRNA interacting pair.
Identification of circular RNA and differentially expressed CircRNA (DEcircRNAs)
To identify the circRNAs, the high-quality clean reads obtained after pre-processing were used. These reads were aligned against the C. sativus reference genome using BWA (v0.7.17, mem-T 20) employing the circRNA identification tool, CIRI2 (v2.1.1) (Gao et al., 2018). The resulting output SAM file was further inspected by the CIRI2 core program to identify the putative circular RNA. The Differentially expressed circular RNA (DEcircRNAs) were obtained from all the four comparison sets (1A:1B, 1A:2A, 1B:2B, and 2A:2B) by comparing the identified circRNA to DEGs obtained from the assembled transcripts at parameters, i.e., FDR < 0.001, p < 0.05 and log2fold change = ± 1 (Robinson et al., 2010).
Analysis of lncRNA, miRNA, circRNA, and mRNA network interactions
The roles of lncRNAs were studied by constructing a miRNA–lncRNA–mRNA network based on differentially expressed lncRNAs and miRNAs, and the target pairs of miRNAs–lncRNAs, miRNAs–mRNAs, lncRNAs–mRNAs and circRNA-miRNA. The regulatory networks contained miRNAs, lncRNAs acting as miRNA targets, mRNAs acting as lncRNA targets, and mRNAs acting as miRNA targets, and also circRNA acting as miRNA target. Cytoscape 3.7.2 software (Shannon et al., 2003) was used to visualize the regulatory networks of miRNA–lncRNA–mRNA.
Quantitative real-time PCR validation of identified lncRNAs
A total of 17 differentially expressed transcripts were selected for validation using qRT-PCR at different developmental stages of the two contrasting genotypes. The primers were designed using IDT primer quest software8. cDNA was synthesized from DNase-treated total RNA (2 μg) using Go Script TM reverse transcription system kit (Promega, USA) as per the manufacturer’s instructions and diluted 20 times with nuclease free water. The qRT-PCR was performed on lightcycler 96 system Real-Time PCR (Roche, Indianapolis, IN, United States) in a final volume of 10 μl containing 1 μl diluted cDNA (200 ng), 5 μl 2xSYBR Green (Go Taq qPCR system, United States), 0.4 μl each of forward and reverse primer (10 μM), and 3.2 μl RNase-free water as per the manufacturer’s instructions with three biological replicates. The thermal cycling conditions were as follows: 95°C for 1 min followed by 40 repeated cycles of 95°C for 10 s, 58°C for 30 s, and 72°C for 30 s. Relative gene expression was determined using 2-ΔΔCT method by normalizing the Actin gene expression. The primers used for qRT-PCR validation along with the description were listed in Supplementary Table 1.
Development of web genomic resources of cucumber lncRNAs
Long non-coding RNA C. sativus Extended shelf-life Database (LncR-CsExSLDb) is a “three-tier architecture” based relational database with client-, middle- and database tier. It catalogs the predicted lncRNAs, circular RNAs, DElncRNAs, DEcircRNAs, lncRNA targets of miRNAs, mRNA targets of miRNAs, mRNA targets of lncRNAs, and circRNA targets of miRNAs in cucumber (C. sativus) transcriptome. All the data has been stored in MySQL tables as a database tier. LncR-CsExSLDb provides various information, like differentially expressed lncRNA (DElncRNA), miRNA which could target the predicted lncRNA and mRNA targets of lncRNA. It also provides information on the miRNA targets in terms of mRNA. Users can also retrieve the predicted putative circular RNAs. miRNA that can possibly target circRNA are also listed. For database browsing, web pages were developed in html, along with CSS and javascript in the client tier. This web-genomic resource is freely available for academic use at http://webtom.cabgrid.res.in/lncrcsexsldb.
Results
Data pre-processing and assembly
Approximately 0.03% of the generated paired-end Illumina reads from the two contrasting cucumber varieties, DC-48 (extended shelf-life) and DC-83 (poor shelf-life) were dropped in all four sets of data, i.e., 1A, 1B, 2A, and 2B. The de novo assembly using the trinity of the pooled 147 MB clean reads yielded around 186K transcript having N50 2.9 kbp and GC content 38.89%. Almost 92% of these assembled transcripts were mapped on the available C. sativus genome9.
Identification of novel long non-coding RNAs
All the assembled transcripts having a length > 200 base pairs were further filtered based on the removal of transcripts having FPKM ≥ 0.5, leading to the retention of 77223 transcripts. Further, after ORF prediction in all the transcripts using ORF Finder, the transcripts having ORF longer than 300 bp were removed leading to 41712 transcripts to be used for further analysis. After screening these sequences for similarity with already annotated sequences through BLAST2GO Pro ver. 3.1, a total of 23816 sequences were retained for further analysis, after discarding 17,896 significantly similar sequences. The output of the PFAM search was parsed through TransDecoder that identified a total of 351 transcripts with coding potentials, hence these were removed. The remaining 23,465 transcripts were analyzed for coding potentials using CPC2 and PLEK, which identified 2 and 173 of these having coding potentials, hence discarded. Finally, 23,290 non-coding transcripts were identified (Supplementary Table 2).
Housekeeping genes like tRNA, rRNA, and other non-coding RNAs were also removed. A sequence search against the RNA Central database identified 1219 housekeeping genes and were removed and all the matches with lncRNA were retained. After filtering, finally, a total of 22071 transcripts were retained for all the further analysis (Supplementary Table 3). The workflow of lncRNA prediction has been shown in Figure 1.
In order to explore the trend of occurrence of lncRNAs in the cucumber genome, various characteristics of the predicted lncRNAs were examined. It was observed that the sequence length of the majority of the lncRNAs ranged between 201–400 (73.7%), followed by 400–800 (19.4%) while coding transcripts were abundant with the length ranging between 1600–3200 (Figure 2). The circos plot shows the distribution of the identified lncRNA on 7 reference cucumber chromosomes (Figure 3). The genomic locations of lncRNAs were classified into six categories with respect to the reference annotation. Most of the lncRNAs were in the unknown lncRNAs (u) category (∼67%), followed by complete intronic ( = ) (∼24%), fully in reference intronic (i) (∼4%), splice junction (j) (∼3.0%), exonic overlap other strands (x) (∼1%) and exonic overlap (o)(∼1%) (Figure 4).
Comparison of identified lncRNAs with known lncRNAs
The blast search against CANTATAdb revealed 3782 unique matches representing conserved nature of these lncRNA across 39 plants species (Supplementary Table 4). Also, a total of 3408 unique matches were retrieved from PlncDB database, specific to cucumber lncRNAs. After removing the redundant sequences, a total of 5522 known lncRNA from CANTATAdb and PlncDB were achieved, while 16549 lncRNAs were identified as novel.
Identification of differentially expressed lncRNAs (DElncRNAs)
A total of 4188 unique DEGs were obtained from the four comparison sets (1909, 1624, 1712, and 189 in 1A:1B, 1A:2A, 1B:2B, and 2A:2B at defined cutoff, respectively using the EdgeR (Supplementary Table 5). These DEGs were further compared with the identified 22071 lncRNAs to filter DElncRNA. A total of 23, 31, 38, and 05 lncRNA were found to be significantly different in the four comparison sets, namely, 1A:1B, 1A:2A, 1B:2B, and 2A:2B, respectively (Table 1 and Supplementary Table 6). We found 10, 18, and 13 unique DElncRNAs in the 1A:2A, 1A:1B, and 1B:2B comparison. The shared and unique DElncRNAs are represented in the form of a Venn diagram (Figure 5). Hierarchical clustering of DElncRNAs, as well as samples based on transcripts abundance in the form of heatmap of the three biological replicates per sample, is represented in Supplementary Figure 1.
Figure 5. A venn diagram showing the distribution of differential expressed lncRNA in the four conditions that are compared in this study.
Identification of the lncRNAs interacting miRNAs and their mRNA targets
We predicted 136 (99 unique) miRNAs which could target our identified DElncRNA using psRNATarget (Supplementary Table 7). Further, the target mRNA for these identified miRNAs were identified by psRNATarget by giving miRNA sequences as input and searching against the C. sativus cDNA library (ASM407v2) available at psRNATarget. This resulted in 2001 (1228 unique) mRNAs acting as targets for miRNAs (Supplementary Table 7). Moreover, we found that 16 lncRNAs could act as endogenous target mimics (eTMs) for 15 miRNAs, making 23 possible combinations (Table 2).
These miRNA targets abundantly belonged to the class of genes associated with cell wall stability and degradation, namely, polygalacturonase, polygalacturonase-1 non-catalytic subunit beta, alpha-expansin-3, alpha-expansin-1, and expansin-S1, pectin esterase and beta galactosidase. Besides, trehalose-6-phosphate, heat shock proteins, STS14 protein, Squamosa promoter-binding protein, ethylene responsive, and ACC oxidase were also identified which are key to the ripening process and post-harvest degradation of fruits.
Identification of mRNA targets of DElncRNAs
Both, the cis and trans targets of DElncRNAs were identified. In the case of cis targets, 1124 unique target genes were found to be targets of 58 DElncRNAs out of 69 DElncRNAs. The remaining 11 DElncRNAs could not be matched to the reference genome. In the case of trans-targets, lncTAR was used to identify the mRNA targets of differentially expressed lncRNA where 2,616 unique target genes were found to be interacting with the DElncRNAs (Supplementary Table 8).
The lncRNAs with possible targets which are associated with shelf-life were identified. The DElncRNAs, TRINITY_DN10719_c0_g2_i2 was downregulated in the set 1A:2A and important target genes were Csa_6G363610: extensin-like, Csa_7G322600: extensin-like isoform X1, Csa_7G312940: extensin-3, and Csa_4G017050, Csa_4G291360: extension-2 like. TRINITY_DN10873_c0_g1_i14 was another identified downregulated DElncRNA for the set 1B:2B with targets for the genes associated with chlorophyll biosynthesis and ethylene responsive transcription factors. TRINITY_DN11625_c4_g1_i1 was another important DElncRNA that was down regulated in 1B:2B and targeted the genes associated with cell wall degradation like cell wall/vacuolar inhibitor of fructosidase 2 and Fimbrin-2. The downregulated lncRNAs in the combination 1B:2B were found to target genes associated with probable xyloglucan glycosyltransferase 6 which is closely associated with cell wall degradation. The DElncRNA, TRINITY_DN14508_c2_g11_i3 was found to be downregulated, both in the combinations of 1A:2A and 1B:2B. Xyloglucan glycosyltransferase 6 (Csa_7G341210 and Csa_7G062870) was one of the important target genes of this lncRNA. These results indicated the role of the DElncRNA in determining the shelf life of the cucumber fruits through regulation of the genes associated with cell wall degradation, chlorophyll, and ethylene biosynthesis.
Identification of circular RNA and differentially expressed circRNA (DEcircRNAs)
CIRI2, the circular RNA (circRNA) identification tool detected 2,746 junction reads from the SAM files. Based on the back-spliced reads, a total of 238 circRNAs were finally detected (Supplementary Table 9). Genomic location analysis of these circRNAs showed that the majority of these identified circRNAs were exonic circRNAs (117, 49%), followed by intergenic circRNAs (112, 47%), and the remaining were intronic circRNAs (9, 4%), hence proving their distribution across the whole genome (Figure 6A). Nevertheless, similar to coding genes, circRNAs were more commonly found to be distributed at both ends of chromosomes (Figure 6B). The chromosome distribution showed an abundance of circRNAs in chromosome 4 (71), followed by chromosomes 6 (35) and 3(33) (Figure 6C). Principal target genes of identified circRNAs were presented in (Supplementary Table 9). Important targets of the identified circRNAs were genes associated with callose synthase (Csa_1G605110, Csa_4G645250), phenylalanine ammonia-lyase (Csa_2G008770), Polyadenylate-binding protein 5 (Csa_3G653460), galactokinase (Csa_3G883000), 4-alpha-glucanotransferase (Csa_4G420150), elongation factor 4 (Csa_4G437010), Probable galacturonosyltransferase 15 (Csa_6G177440), and glutamate formiminotransferase (Csa_6G124080). A total of 2,250 miRNA were identified which could target these circRNAs (Supplementary Table 10). We identified a total of 51 DEcircRNAs, out of which 41 are downregulated while 10 are upregulated under different experimental conditions (Supplementary Table 11). It was also observed a multiple number of circRNAs were involved in regulating a particular gene. It was observed that most of the DEcirc RNAs were detected under the 1B: 2B condition indicating a greater role of these regulatory components at the marketable harvesting stage. One transcript, TRINITY_DN9807_c0_g1_i7 was upregulated under both 1A:2A (11.64) and 1B:2B (8.68) and was found to be regulated by two circRNAs, stout_110 and stout_209. Some of the key transcripts downregulated were TRINITY_DN14188_c4_g4_i3 and TRINITY_DN14188_c4_g3_i2 both of which were regulated by multiple numbers of circRNAs and downregulated at the stage of 1B: 2B.
Figure 6. Characterization of cucumber circRNAs. (A) Pie chart representing the amount and percentage of circular RNAs generated from exonic, intronic, and intergenic regions. (B) Circosplot showing the distribution of circular RNAs in seven cucumber chromosomes. (C) Histogram showing the number of circRNAs detected in seven cucumber chromosomes.
Analysis of lncRNA, circRNA, miRNA, and mRNA network interactions
The competing endogenous RNA (ceRNA) network was constructed for each of the four comparison datasets. Nodes represented in green, red, and blue are miRNAs, lncRNAs, and circRNAs while those in light blue are mRNAs (Figures 7A–D). The highest number of interactions in miRNA-lncRNA (86) and miRNA-mRNA (1529) were observed in 1A:2A while 1B:2B had maximum lncRNA-mRNA (2874) and 1A:1B has maximum miRNA-circRNA in the ceRNA network (Table 3 and Supplementary Table 12). It was observed that a multiple number of miRNAs were targeting number of circ RNAs (Supplementary Table 9). The circRNA, stout_120#7:13224577| 13227148 was recorded to be one of the most common targets of miRNAs as a large number of miRNAs were found to be targeting this circRNA. Interactions in miRNA-lncRNA and miRNA-mRNA revealed that a large number of genes are regulated by the ncRNAs and several genes regulated by the network interactions were associated with fruit ripening, cell wall stability, and integrity, ethylene biosynthesis and chlorophyll synthesis, and degradation pathways.
For further detailed insight, the ceRNA network for one of the experimentally validated DElncRNA was constructed. Transcript id TRINITY_DN13265_c2_g1_i4 was differentially expressed under the conditions 1A:1B and 1A:2A (Figure 8).
Quantitative real-time PCR validation of identified lncRNAs
A total of 17 random transcripts were selected for validation of the RNA-seq data through RT-PCR (Supplementary Table 1). The results indicated a high correlation between the expression pattern of the transcripts between RNA-seq and RT-PCR data for all selected 17 primers (Supplementary Figure 1). Among the 17 selected transcripts, three were down regulated and 14 were upregulated under the different conditions as revealed through RNA-seq and expression analysis using RT-PCR. The expression pattern of the selected transcripts revealed through RNA-seq and RT PCR were similar to each other with no significant difference. The high reliability of the RNA-seq data was elucidated through a similar pattern of the fold change of the RNA-seq and RT PCR data using the randomly selected primers.
Development of web genomic resources of cucumber lncRNAs
The lncRNA C. sativus extended shelf life database (LncR-CsExSLDb), available freely at http://webtom.cabgrid.res.in/lncrcsexsldb. It has five tabs, namely, Home, lncRNAs, circRNAs, Download, and Contact us. LncR-CsExSLDb is a user-friendly and can be easily navigated for browsing the data. The “Download” tab provides links to download all the analyzed data available in the MySQL database. This database catalogs a total of 22,071 predicted lncRNA from cucumber. Out of these, 69 lncRNAs have been identified as differentially expressed lncRNAs (DElncRNAs). It also catalogs 99 miRNAs that can target the identified DElncRNA and these miRNAs in turn can also target 1,228 mRNAs. A total of 3,049 mRNAs were identified as putative targets of lncRNA. Also, a total of 238 circular RNAs are identified which may be the possible targets of 2,250 miRNAs.
Discussion
Advancement in high-throughput RNA-seq, combined with suitable computational and bioinformatics approaches has revolutionized the discovery of novel lncRNAs and their functional analysis in the last decade (Wang et al., 2009; Iyer et al., 2015). Genes associated with post-harvest biology and the decay of fruits are regulated at transcriptional and post-transcriptional levels. LncRNAs have emerged as one of the important groups regulating the shelf-life of the fruits in different crops (Tian et al., 2019). LncRNAs in plants are associated with numerous biological processes including fruit development, ripening, gene silencing, regulation of flowering time, and abiotic and biotic stress response, besides several other developmental pathways. The natural variant of cucumber, DC-48 with extended shelf-life is an ideal candidate genotype to elucidate the detailed molecular network including the role of lncRNAs and circRNAs in determining the post-harvest biology of the fruits. Insight into the molecular mechanisms participating in gene regulation for enhanced shelf-life in a natural variant of cucumber, DC-48 will provide the much-needed information for further studies on the genotypes with extended shelf-life.
We found the occurrence of the majority of identified lncRNAs having sequence lengths 201-400 bp across the cucumber genome. A similar trend has been observed in grapes (Bhatia et al., 2019). Besides, the genomic locations of lncRNAs were found to be distributed unevenly, but across all the chromosomes. Such chromosomal distribution has been reported in cucumber, grapes, and tomato lncRNAs (Hao et al., 2015; Wang et al., 2015; Bhatia et al., 2019) supporting that they might get transcribed from wider locations in the genome.
A complex network of physiological and metabolic activities is involved in the process of fruit ripening. The role of the lncRNAs in the ripening of both climacteric and non-climacteric fruits has been demonstrated in tomatoes (Zhu et al., 2015), strawberries (Wang et al., 2017), pear (Wu et al., 2014), and melons (Tian et al., 2019). In recent times, several evidence are emerging elucidating the key role of the lncRNAs in the ripening of different groups of fruit crops (Tang et al., 2021). Knocking down of lncRNA1459 and lncRNA1840 in tomatoes resulted in delayed fruit ripening (Zhu et al., 2015). Similarly, in sea buckthorn in vivo anthocyanin biosynthesis during fruit ripening was affected by the silencing of two lncRNAs (LNC1 and LNC2) (Zhang et al., 2018). Several lncRNAs associated with different stages of flower and fruit development have been identified in non-climacteric fruit like strawberries through comparative transcriptomic studies (Kang and Liu, 2015). One lncRNA, FRILAIR has been identified recently in strawberries with a key role in fruit ripening by functioning as a non-canonical target mimic (Tang et al., 2021).
LncRNA can act as endogenous target mimics of miRNA, resulting in the blocking of the interaction between miRNA and its target gene (Karakülah et al., 2016). The interaction between the miRNAs and their target genes plays a pivotal role in regulatory gene networks in a wide variety of plants (Meng et al., 2021). Prediction through psRNATarget server revealed that 8183 unique miRNAs were targeting the identified lncRNAs. Similar interaction has been reported in Chinese cabbage under heat stress (Wang et al., 2019), Arabidopsis under phosphate deficiency (Franco-Zorrilla et al., 2007), C. melo under powdery mildew infestation (Gao et al., 2020), and C. sativus under waterlogging condition (Kȩska et al., 2021). Among the different targets, the important genes were extensin like, polygalactoronase-1 non-catalytic sub-unit beta, WRKY domain class transcription factor, xylogucan specific endonuclease inhibitors, Endo 1,4 beta glucanase and Pectinestarase that determine the cell wall stability and fruit firmness. Besides, the genes related to the ethylene metabolism like 1-aminocyclopropane-1-carboxylic acid oxidase, ethylene responsive transcription factors and ethylene receptor CS-ETR 2 were also targeted by multiple miRNAs. Different classes of heat shock proteins (HSPs) namely Cytosolic class-II low molecular weight HSPs, Chloroplast small HSP class-I, HSP-70, and mitochondrial small HSPs were the principal targets. Trehalose-6-phosphate synthase, Squamosa promoter binding protein, WRKY domain transcription factors, MADS box proteins, and abscisic stress ripening inhibitors were other identified miRNA targets. The miRNA target genes, namely, trehalose-6-phosphate, heat shock proteins, STS14 protein, squamosa promoter-binding protein, ethylene responsive, ACC oxidase were also identified in our study which are known as a key to the ripening process and post-harvest degradation of fruits (Liu et al., 2015; Upadhyay et al., 2020; Xu et al., 2020). The results revealed the miRNA targets to be associated with genes like polygalacturonase family, alpha-expansin-3, alpha-expansin-1 and expansin-S1, pectin esterase, and beta galactosidase which play important roles in cell wall stability and degradation (Bordenave, 1996; Kalunke et al., 2015; Yang et al., 2018; Valenzuela-Riffo et al., 2020).
It was reported that reduction in polygalacturonase beta subunit expression in tomatoes affects pectin solubilization and degradation of tomato fruits during ripening (Watson et al., 1994). The role of the beta subunit of polygalacturonase-1 (PG-1) in fruit firmness and ripening has been described by Yang et al. (2018). In the present study, it was found that a multiple number of miRNA target genes were associated with the beta subunit of PG-1 which might have a possible role in the retention of fruit firmness and extended shelf-life in the genotype DC-48. Xyloglucan endonuclease, endo 1,4 glucanase, and pectin esterase were reported to play a key role in the stability of cell wall, retention of fruit firmness and regulates the ripening in strawberries and pears (Paniagua et al., 2014; Yang et al., 2016; Witasari et al., 2019). It is well established that WRKY transcription factors (TFs) play important roles in stress responses in different plant species. Recently, the role of of WRKY TFs in fruit ripening and color change has been reported in tomatoes (Wang et al., 2017). It was found that SQUAMOSA promoter binding protein-like transcription factors were a major player in the development and ripening of the papaya fruits (Xu et al., 2020), and this group of transcription factors was found widely distributed among the miRNA target genes in the present study. The trehalose-6-phosphate function is mediated through the non-fermenting-related kinase-1 (SnRK1) pathway (Figueroa and Lunn, 2016). In cucumber, trehalose-6-phosphate and SnRK1-mediated pathways were found to be involved in fruit setting and further development (Zhang et al., 2015). In addition, the role of the SnRK1 pathways in fruit ripening has been reported in tomatoes and apples (Wang et al., 2012; Yu et al., 2018). Heat shock proteins (HSPs) are known to be ubiquitous and highly conserved in nature and their role in abiotic stress tolerance is well established. Small heat shock proteins (sHSP) attracted attention in recent times because of their role in regulating a wide variety of developmental pathways in plants. Recently, two small HSP genes, SlHSP17.7A and SlHSP17.7B which are localized on Chr.6 and Chr.9 in tomatoes were found to regulate the fruit development and ripening and were upregulated during the transition phase of the fruits from mature green to beaker stage (Upadhyay et al., 2020). A large number of HSPs were identified as miRNA target genes in the present study indicating their possible role in the regulation of the fruit development in two contrasting genotypes for extended shelf-life. In tomatoes, MADS box proteins play important role in the regulation of fruit ripening through ethylene synthesis, ethylene response, and ethylene perception (Yuste-Lisbona et al., 2016). Multiple miRNA target MADS box genes were identified in the present study indicating their possible role in the regulation of ethylene biosynthesis and determining the shelf-life of the cucumber fruits.
Among the 97 DElncRNAs in two different genotypes at two different developmental stages, the highest number of DElncRNAs were identified in the combination, 1B: 2B (DC-48 : DC-83 at 10 days after pollination) followed by 1A : 2A (DC-48 : DC-83 at 5 days after pollination). Besides, 18 DElncRNAs were common among the two contrasting genotypes at 5 days and 10 days after pollination indicating a crucial role of these lncRNAs in determining the extended shelf-life of the cucumber fruits. Among the 18 common lncRNAs in two different developmental stages, 8 were downregulated and 10 were upregulated in under both the developmental stages in two contrasting genotypes. The lncRNAs which were downregulated in the genotype with low shelf-life indicated that these lncRNAs acted as the miRNA decoy resulting in their inhibited expression (Kȩska et al., 2021). Among the eight downregulated lncRNAs, most of them had targets related to the genes associated with fruit firmness and chlorophyll biosynthesis explaining their role in determining shelf-life in cucumbers. Similarly, most of the upregulated lncRNAs at both the developmental stages had miRNA target genes associated with ethylene biosynthesis, cell wall degradation, stability, and fruit ripening explaining their critical role in extended shelf-life in the natural variant, DC-48. The identified DElncRNAs in the present study would be instrumental in understanding the role of lncRNAs in determining shelf life in a wide variety of crops.
In the past few years, the regulatory role of circRNAs in different biological processes has been established in transcriptional and post-transcriptional stages. The role of the circRNAs in their function as microRNA (miRNA) sponges has been reported widely in different animals (Bolha et al., 2017). Circular RNAs are often generated by a non-linear back splicing event between a downstream splice donor and an upstream splice acceptor. The development of circRNA sequence in recent time facilitated the identification of several circRNA in both, plants and animals (Zhu et al., 2019). Unraveling the highly expressed and conserved enhanced the functional impact of non-coding RNAs on the regulation of a wide variety of biological processes (Errichelli et al., 2017). Biogenesis, regulation, and function of circRNAs are less understood in plants than compared in animals (Zhou et al., 2018). The difference in the biogenesis of circRNA in plants has been reported through the characterization of circRNAs in Oryza sativa and Arabidopsis thaliana (Ye et al., 2015). Different studies have established that the plant circRNAs are conserved in nature with low expression levels. The role of circRNA in the regulation of several biological processes like abiotic stresses has been reported in different plants including cucumber under salt stress conditions (Zhu et al., 2019). In watermelon, the regulatory role of circRNAs in resistance to cucumber green mottle mosaic virus infection has been reported (Sun Y. et al., 2020). The circRNAs associated with the transcripts, TRINITY_DN14188_c4_g4_i3, TRINITY_DN14188_c4_g3_i2 were most commonly reported to be differentially expressed in the genotypes, DC-48 and DC-83 at 10 DAP (1B:2B). Expression of TRINITY_DN14188_c4_g4_i3 transcript was regulated by 27 DEcircRNAs, and expression of TRINITY_DN14188_c4_g3_i2 was regulated by 4 DEcircRNA. The identified circDNAs associated with the regulation of expression of the key transcripts must be involved in determining the shelf-life of the cucumber fruits. Besides, the transcript TRINITY_DN9807_c0_g1_i7 was found to be up-regulated among the contrasting genotypes at 5DAP and 10DAP was regulated by the circRNAs stout_110 and stout_209. Our study identified numbers of circRNA regulating the functions of key genes associated with shelf-life in cucumbers. Phenylalanine ammonia-lyase is one of the important enzymes associated with fruit ripening in mango (Palafox-Carlos et al., 2014). Galactokinase was reported to play important role in cell wall stability in the ripening process of yellow melon (Schemberger et al., 2020). It was further established that circRNAs were involved in determining the shelf-life of cucumber fruits in the natural variant, DC-48 with exceptionally high shelf-life and minimum post-harvest degradation through the regulation of the important genes associated with fruit ripening and cell wall stability.
In the present study, it was observed that the majority of the circRNAs were exonic in origin followed by intergenic and intronic types, indicating genome-wide distribution of the circRNAs in cucumber. In the earlier study in cucumber, genome-wide distribution of circRNAs and similar types were reported by Zhu et al. (2019) while studying their regulatory role in salt stress tolerance. Besides, it was also found that circRNAs were also distributed uniformly in both ends of the chromosomes and the highest number of circRNAs were identified in chromosome number 4 followed by chromosomes 6 and 2. Several identified circRNAs in chromosome numbers 4 and 6 were found to regulate the parent genes associated with metabolisms like cell wall stability and degradation, chlorophyll biosynthesis, degeneration, and ethylene biosynthesis. A similar pattern was also observed while studying the target genes associated with the identified DElncRNAs.
The comprehensive web resource of cucumber lncRNAs, LncR-CsExSLDb available to researchers worldwide for the academic purpose in a single place would provide a platform to understand the key roles of lncRNAs, circRNAs, and miRNA targets in the delayed ripening of cucumber. The lncRNA expression may also be used as a promising biomarker for the delayed shelf-life of this very important vegetable crop. The results of this study indicated the regulatory roles of the lncRNAs and circRNAs in determining the shelf-life of the cucumber fruits. These results will be instrumental in the future study of cucumbers in understanding the complex molecular networks and regulatory roles of the ncRNAs in determining complex traits like extended shelf-life in cucumbers. However, further detailed studies need to be conducted to delineate the role of the specific lncRNAs and circRNAs in regulating the specific metabolism associated with extended shelf-life.
Data availability statement
The original contributions presented in the study are publicly available. This data can be found here: NCBI, PRJNA702645.
Author contributions
SSD: conceived theme of the study and designed experiment. SSD, MI, and SJ: data curation. PS, MI, SJ, and SSD: computational analysis and development of web-resources. SSD, BG, and KK: investigation. SSD, TKB, and ADM: Resources. SSD, ADM, and RB: supervision. SSD, TKB, ADM, and RB: visualization. SSD, PS, MI, ADM, and SJ: writing original draft. SSD, TKB, AR, and DK: review and editing. All authors read and approved the final manuscript.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgements
Authors are thankful to ICAR-Indian Agricultural Research Institute for providing necessary funding and infrastructure in conducting the research work. Besides authors are also thankful to Indian Council of Agricultural Research, Ministry of Agriculture and Farmers’ Welfare, Government of India for providing financial assistance in the form of CABin grant (F. no. Agril. Edn.4–1/2013-A&P) as well as Advanced Super Computing Hub for Omics Knowledge in Agriculture (ASHOKA) facility at ICAR-IASRI, New Delhi, India).
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2022.884476/full#supplementary-material
Supplementary Figure 1 | Experession analysis of the selected 17 transcripts through RT-PCT and fold change through RNA-seq.
Footnotes
- ^ http://www.bioinformatics.babraham.ac.uk/projects/fastqc/
- ^ https://www.ncbi.nlm.nih.gov/orffinder
- ^ https://hpc.nih.gov/apps/TransDecoder.html
- ^ https://rnacentral.org
- ^ http://cantata.amu.edu.pl/
- ^ http://plncdb.tobaccodb.org/
- ^ http://ftp.ensemblgenomes.org/pub/plants/release-48/fasta/cucumis_sativus/cdna/
- ^ https://www.idtdna.com/
- ^ http://ftp.gramene.org/CURRENT_RELEASE/fasta/cucumis_sativus/dna/
References
Arah, I. K., Amaglo, K., Kumah, E. K., and Ofori, H. (2015). Preharvest and postharvest factors affecting the quality and shelf life of harvested tomatoes: a mini review. Int. J. Agron. 2015:6. doi: 10.1155/2015/478041
Bhatia, G., Sharma, S., Upadhyay, S. K., and Singh, K. (2019). Long non-coding RNAs coordinate developmental transitions and other key biological processes in grapevine. Sci. Rep. 9, 1–14. doi: 10.1038/s41598-019-38989-7
Bohmdorfer, G., and Wierzbicki, A. T. (2015). Control of chromatin structure by long non-coding RNA. Trends Cell Biol. 25, 623–632.
Bolger, A. M., Lohse, M., and Usadel, B. (2014). Trimmomatic: a flexible trimmer for illumina sequence data. Bioinformatics 30, 2114–2120. doi: 10.1093/bioinformatics/btu170
Bolha, L., Ravnik-Glavaè, M., and Glavaè, D. (2017). Long noncoding RNAs as biomarkers in cancer. Dis. Mark. 2017:7243968. doi: 10.1155/2017/7243968
Bonnet, E., He, Y., Billiau, K., and Van de Peer, Y. (2010). TAPIR, a web server for the prediction of plant microRNA targets, including target mimics. Bioinformatics 26, 1566–1568. doi: 10.1093/bioinformatics/btq233
Bordenave, M. (1996). “Analysis of pectin methyl esterases,” in Plant Cell Wall Analysis. Modern Methods of Plant Analysis, Vol. 17, eds H. F. Linskens and J. F. Jackson (Berlin: Springer). doi: 10.1007/978-3-642-60989-3_10
Bouba, I., Kang, Q., Luan, Y. S., and Meng, J. (2019). Predicting miRNA-lncRNA interactions and recognizing their regulatory roles in stress response of plants. Math. Biosci. 312, 67–76. doi: 10.1016/j.mbs.2019.04.006
Budak, H., Kaya, S. B., and Cagirici, H. B. (2020). Long non-coding RNA in plants in the era of reference sequences. Front. Plant Sci. 11:276. doi: 10.3389/fpls.2020.00276
Chekanova, J. A. (2015). Long non-coding RNAs and their functions in plants. Curr. Opin. Plant Biol. 27, 207–216.
Chen, L. L. (2016). The biogenesis and emerging roles of circularRNAs. Nat. Rev. Mol. Cell Biol. 17, 205–211. doi: 10.1080/15384047.2019.1617563
Conesa, A., and Götz, S. (2008). Blast2GO: a comprehensive suite for functional analysis in plant genomics. Int. J. Plant Genomics 2008:619832. doi: 10.1155/2008/619832
Corona-Gomez, J. A., Garcia-Lopez, I. J., Stadler, P. F., and Fernandez-Valverde, S. L. (2020). Splicing conservation signals in plant long noncoding RNAs. RNA 26, 784–793. doi: 10.1261/rna.074393.119
Dai, X., Zhuang, Z., and Zhao, P. X. (2018). psRNATarget: a plant small RNA target analysis server (2017 release). Nucleic Acids Res. 46, W49–W54. doi: 10.1093/nar/gky316
Dinger, M. E., Pang, K. C., Mercer, T. R., and Mattick, J. S. (2008). Differentiating protein-coding and noncoding RNA: challenges and ambiguities. PLoS Computat. Biol. 4:e1000176. doi: 10.1371/journal.pcbi.1000176
Dinger, M. E., Pang, K. C., Mercer, T. R., Crowe, M. L., Grimmond, S. M., and Mattick, J. S. (2009). NRED: a database of long noncoding RNA expression. Nucleic Acids Res. 37(Suppl. 1) D122–D126.
Errichelli, L., Modigliani, S. D., Laneve, P., Colantoni, A., Legnini, I., Capauto, D., et al. (2017). FUS affects circular RNA expression in murine embryonic stem cell-derived motor neurons. Nat. Commun. 8, 1–11.
Figueroa, C. M., and Lunn, J. E. (2016). A tale of two sugars: trehalose 6-phosphate and sucrose. Plant Physiol. 172, 7–27. doi: 10.1104/pp.16.00417
Franco-Zorrilla, J. M., Valli, A., Todesco, M., Mateos, I., Puga, M. I., Rubio-Somoza, I., et al. (2007). Target mimicry provides a new mechanism for regulation of microRNA activity. Nat. Genet. 39, 1033–1037.
Friedman, W. R., Halpern, B. S., McLeod, E., Beck, M. W., Duarte, C. W., Kappel, C. V., et al. (2020). Research priorities for achieving healthy marine ecosystems and human communities in a changing climate. Front. Mar. Sci. 7:5. doi: 10.3389/fmars.2020.00005
Gajanana, T. M., Murthy, D. S., and Sudha, M. (2011). Post-harvest losses in fruits and vegetables in South India–a review of concepts and quantification of losses. Indian Food Packer 65, 178–187.
Gao, C., Sun, J., Dong, Y., Wang, C., Xiao, S., Mo, L., et al. (2020). Comparative transcriptome analysis uncovers regulatory roles of long non-coding RNAs involved in resistance to powdery mildew in melon. BMC Genomics 21:125. doi: 10.1186/s12864-020-6546-8
Gao, Y., Zhang, J., and Zhao, F. (2018). Circular RNA identification based on multiple seed matching. Brief. Bioinformatics 19, 803–810. doi: 10.1093/bib/bbx014
Haas, B. J., Papanicolaou, A., Yassour, M., Grabherr, M., Blood, P. D., Bowden, J., et al. (2013). De novo transcript sequence reconstruction from RNA-seq using the trinity platform for reference generation and analysis. Nat. Protoc. 8, 1494–1512. doi: 10.1038/nprot.2013.084
Hao, Z., Fan, C., Cheng, T., Su, Y., Wei, Q., and Li, G. (2015). Genome-wide identification, characterization and evolutionary analysis of long intergenic noncoding RNAs in cucumber. PLoS One 10:e0121800. doi: 10.1371/journal.pone.0121800
He, X., Guo, S., Wang, Y., Wang, L., Shu, S., and Sun, J. (2020). Systematic identification and analysis of heat-stress-responsive lncRNAs, circRNAs and miRNAs with associated co-expression and ceRNA networks in cucumber (Cucumis sativus L.). Physiol. Plant. 168, 736–754. doi: 10.1111/ppl.12997
Iyer, M. K., Niknafs, Y. S., Malik, R., Singhal, U., Sahu, A., Hosono, Y., et al. (2015). The landscape of long noncoding RNAs in the human transcriptome. Nat. Genet. 47, 199–208.
Jha, U. C., Nayyar, H., Jha, R., Khurshid, M., Zhou, M., Mantri, N., et al. (2020). Long non-coding RNAs: emerging players regulating plant abiotic stress response and adaptation. BMC Plant Biol. 20:446. doi: 10.1186/s12870-020-02595-x
Kalunke, R. M., Tundo, S., Benedetti, M., Cervone, F., De Lorenzo, G., and D’Ovidio, R. (2015). An update on polygalacturonase-inhibiting protein (PGIP), a leucine-rich repeat protein that protects crop plants against pathogens. Front. Plant Sci. 6:146. doi: 10.3389/fpls.2015.00146
Kang, C., and Liu, Z. (2015). Global identification and analysis of long non-coding RNAs in diploid strawberry Fragaria vesca during flower and fruit development. BMC Genomics 16:815. doi: 10.1186/s12864-015-2014-2
Kang, Y. J., Yang, D. C., Kong, L., Hou, M., Meng, Y. Q., Wei, L., et al. (2017). CPC2: a fast and accurate coding potential calculator based on sequence intrinsic features. Nucleic Acids Res. 45, W12–W16. doi: 10.1093/nar/gkx428
Karakülah, G., Yücebilgili Kurtoðlu, K., and Unver, T. (2016). PeTMbase: a database of plant endogenous target mimics (eTMs). PLoS One 11:e0167698. doi: 10.1371/journal.pone.0167698
Kasso, M., and Bekele, A. (2018). Post-harvest loss and quality deterioration of horticultural crops in Dire Dawa Region, Ethiopia. J. Saudi Soc. Agric. Sci. 17, 88–96.
Kȩska, K., Szcześniak, M. W., Adamus, A., and Czernicka, M. (2021). Waterlogging-stress-responsive LncRNAs, their regulatory relationships with miRNAs and target genes in cucumber (Cucumis sativus L.). Int. J. Mol. Sci. 22:8197. doi: 10.3390/i
Langmead, B., and Salzberg, S. (2012). Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359. doi: 10.1038/nmeth.1923
Li, B., and Dewey, C. N. (2011). RSEM: accurate transcript quantification from RNA-seq data with or without a reference genome. BMC Bioinformatics 12:323. doi: 10.1186/1471-2105-12-323
Li, J., Ma, W., Zeng, P., Wang, J., Geng, B., Yang, J., et al. (2015). LncTar: a tool for predicting the RNA targets of long noncoding RNAs. Brief. Bioinformatics 16, 806–812. doi: 10.1093/bib/bbu048
Li, L., Cui, X., Yu, S., Zhang, Y., Luo, Z., Yang, H., et al. (2014). PSSP-RFE: accurate prediction of protein structural class by recursive feature extraction from PSI-BLAST profile, physical-chemical property and functional annotations. PLoS One 9:e92863. doi: 10.1371/journal.pone.0092863
Li, L., Guo, J., Chen, Y., Chang, C., and Xu, C. (2017). Comprehensive CircRNA expression profile and selection of key CircRNAs during priming phase of rat liver regeneration. BMC Genomics 18:80. doi: 10.1186/s12864-016-3476-6
Li, X., Yang, L., and Chen, L. L. (2018). The biogenesis, functions, and challenges of circular RNAs. Mol. Cell 71, 428–442.
Liu, T. T., Zhu, D., Chen, W., Deng, W., He, H., He, G., et al. (2013). A global identification and analysis of small nucleolar RNAs and possible intermediate-sized non-coding RNAs in Oryza sativa. Mol. Plant 6, 830–846. doi: 10.1093/mp/sss087
Liu, X., Hao, L., Li, D., Zhu, L., and Hu, S. (2015). Long non-coding RNAs and their biological roles in plants. Genomics Proteomics Bioinformatics 13, 137–147. doi: 10.1016/j.gpb.2015.02.003
Ma, L., Bajic, V. B., and Zhang, Z. (2013). On the classification of long non-coding RNAs. RNA Biol. 10, 925–933.
Meng, X., Li, A., Yu, B., and Li, S. (2021). Interplay between miRNAs and lncRNAs: mode of action and biological roles in plant development and stress adaptation. Comput. Struct. Biotechnol. J. 19, 2567–2574. doi: 10.1016/j.csbj.2021.04.062
Mistry, J., Chuguransky, S., Williams, L., Qureshi, M., Salazar, G. A., Sonnhammer, E. L., et al. (2021). Pfam: the protein families database in 2021. Nucleic Acids Res. 49, D412–D419. doi: 10.1093/nar/gkaa913
Mukherjee, P. K., Nema, N. K., Maity, N., and Sarkar, B. K. (2013). Phytochemical and therapeutic potential of cucumber. Fitoterapia 84, 227–236. doi: 10.1016/j.fitote.2012.10.003
Nejat, N., and Mantri, N. (2018). Emerging roles of long non-coding RNAs in plant response to biotic and abiotic stresses. Crit. Rev. Biotechnol. 38, 93–105.
Palafox-Carlos, H., Contreras-Vergara, C. A., Muhlia-Almazán, A., Islas-Osuna, M. A., and González-Aguilar, G. A. (2014). Expression and enzymatic activity of phenylalanine ammonia-lyase and p-coumarate 3-hydroxylase in mango (Mangifera indica ‘Ataulfo’) during ripening. Genet. Mol. Res. 16, 3850–3858. doi: 10.4238/2014.May.16.10
Paniagua, C., Pose, S., Morris, V. J., Kirby, A. R., Quesada, M. A., and Mercado, J. A. (2014). Fruit softening and pectin disassembly: an overview of nanostructural pectin modifications assessed by atomic force microscopy. Ann. Bot. 114, 1375–1383. doi: 10.1093/aob/mcu149
Porat, R., Lichter, A., Terry, L. A., Harker, R., and Buzby, J. (2018). Postharvest losses of fruit and vegetables during retail and in consumers’ homes: quantifications, causes, and means of prevention. Postharvest Biol. Technol. 139, 135–149.
Pradeepkumara, N., Sharma, P. K., Munshi, A. D., Behera, T. K., Bhatia, R., Kumari, K., et al. (2022). Fruit transcriptional profiling of the contrasting genotypes for shelf life reveals the key candidate genes and molecular pathways regulating post-harvest biology in cucumber. Genomics 114:110273. doi: 10.1016/j.ygeno.2022.110273
Ravasi, T., Suzuki, H., Pang, K. C., Katayama, S., Furuno, M., Okunishi, R., et al. (2006). Experimental validation of the regulated expression of large numbers of non-coding RNAs from the mouse genome. Genome Res. 16, 11–19.
Robinson, M.D., McCarthy, D.J., and Smyth, G.K. (2010). edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 26, 139–140. doi: 10.1093/bioinformatics/btp616
Saladié, M., Cañizares, J., Phillips, M. A., Rodriguez-Concepcion, M., Larrigaudière, C., Gibon, Y., et al. (2015). Comparative transcriptional profiling analysis of developing melon (Cucumis melo L.) fruit from climacteric and non-climacteric varieties. BMC Genomics 16:440. doi: 10.1186/s12864-015-1649-3
Sanger, H. L., Klotz, G., Riesner, D., Gross, H. J., and Kleinschmidt, A. K. (1976). Viroids are single-stranded covalently closed circular RNA molecules existing as highly base-paired rod-like structures. Proc. Natl. Acad. Sci. U.S.A. 73, 3852–3856. doi: 10.1073/pnas.73.11.3852
Schemberger, M. O., Stroka, M. A., Reis, L., de Souza Los, K. K., de Araujo, G., Sfeir, M., et al. (2020). Transcriptome profiling of non-climacteric ‘yellow’ melon during ripening: insights on sugar metabolism. BMC Genomics 21:262. doi: 10.1186/s12864-020-6667-0
Shannon, P., Markiel, A., Ozier, O., Baliga, N. S., Wang, J. T., Ramage, D., et al. (2003). Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504. doi: 10.1101/gr.1239303
Shin, S. Y., Jeong, J. S., Lim, J. Y., Kim, T., Park, J. H., Kim, J. K., et al. (2018). Transcriptomic analyses of rice (Oryza sativa) genes and non-coding RNAs under nitrogen starvation using multiple omics technologies. BMC Genomics 19:532. doi: 10.1186/s12864-018-4897-1
Sun, Y., Zhang, H., Fan, M., He, Y., and Guo, P. (2020). Genome-wide identification of long non-coding RNAs and circular RNAs reveal their ceRNA networks in response to cucumber green mottle mosaic virus infection in watermelon. Arch. Virol. 165, 1177–1190. doi: 10.1007/s00705-020-04589-4
Sun, Z., Huang, K., Han, Z., Wang, P., and Fang, Y. (2020). Genome-wide identification of Arabidopsis long noncoding RNAs in response to the blue light. Sci. Rep. 10:6229.
Tang, Y., Qu, Z., Lei, J., He, R., Adelson, D. L., Zhu, Y., et al. (2021). The long noncoding RNA FRILAIR regulates strawberry fruit ripening by functioning as a noncanonical target mimic. PLoS Genet. 17:e1009461. doi: 10.1371/journal.pgen.1009461
Tian, Y., Bai, S., Dang, Z., Hao, J., Zhang, J., and Hasi, A. (2019). Genome-wide identification and characterization of long non-coding RNAs involved in fruit ripening and the climacteric in Cucumis melo. BMC Plant Biol. 19:369. doi: 10.1186/s12870-019-1942-4
Upadhyay, R. K., Tucker, M. L., and Mattoo, A. K. (2020). Ethylene and ripening inhibitor modulate expression of SlHsp17.7A, B class I small heat shock protein genes during tomato fruit ripening. Front. Plant Sci. 11:975. doi: 10.3389/fpls.2020.00975
Valenzuela-Riffo, F., Zúñiga, P. E., Morales-Quintana, L., Lolas, M., Cáceres, M., and Figueroa, C. R. (2020). Priming of defense systems and upregulation of MYC2 and JAZ1 genes after Botrytis cinerea inoculation in methyl jasmonate-treated strawberry fruits. Plants (Basel, Switzerland) 9:447. doi: 10.3390/plants9040447
Wang, A., Hu, J., Gao, C., Chen, G., Wang, B., Lin, C., et al. (2019). Genome-wide analysis of long non-coding RNAs unveils the regulatory roles in the heat tolerance of Chinese cabbage (Brassica rapa ssp. chinensis). Sci. Rep. 9, 1–14. doi: 10.1038/s41598-019-41428-2
Wang, J., Yu, W., Yang, Y., Li, X., Chen, T., Liu, T., et al. (2015). Genome-wide analysis of tomato long non-coding RNAs and identification as endogenous target mimic for microRNA in response to TYLCV infection. Sci. Rep. 5:16946.
Wang, L., Zhang, X. L., Wang, L., Tian, Y., Jia, N., Chen, S., et al. (2017). Regulation of ethylene-responsive SlWRKYs involved in color change during tomato fruit ripening. Sci. Rep. 7:16674. doi: 10.1038/s41598-017-16851-y
Wang, X., Peng, F., Li, M., Yang, L., and Li, G. (2012). Expression of a heterologous SnRK1 in tomato increases carbon assimilation, nitrogen uptake and modifies fruit development. J. Plant Physiol. 169, 1173–1182. doi: 10.1016/j.jplph.2012.04.013
Wang, Z., Gerstein, M., and Snyder, M. (2009). RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet. 10, 57–63.
Watson, C. F., Zheng, L., and DellaPenna, D. (1994). Reduction of tomato polygalacturonase beta subunit expression affects pectin solubilization and degradation during fruit ripening. Plant Cell 6, 1623–1634. doi: 10.1105/tpc.6.11.1623
Weston, L. A., and Barth, M. M. (1997). Preharvest factors affecting postharvest quality of vegetables. Hortic. Sci. 32, 812–816.
Wierzbicki, A. T., Haag, J. R., and Pikaard, C. S. (2008). Noncoding transcription by RNA polymerase Pol IVb/Pol V mediates transcriptional silencing of overlapping and adjacent genes. Cell 135, 635–648. doi: 10.1016/j.cell.2008.09.035
Witasari, L. D., Huang, F. C., Hoffmann, T., Rozhon, W., Fry, S. C., and Schwab, W. (2019). Higher expression of the strawberry xyloglucan endotransglucosylase/hydrolase genes FvXTH9 and FvXTH6 accelerates fruit ripening. Plant J. 100, 1237–1253. doi: 10.1111/tpj.14512
Wu, J., Wang, D., Liu, Y., Wang, L., Qiao, X., and Zhang, S. (2014). Identification of miRNAs involved in pear fruit development and quality. BMC Genomics 15:953. doi: 10.1186/1471-2164-15-953
Xu, Y., Xu, H., Wall, M. M., and Yang, J. (2020). Roles of transcription factor SQUAMOSA promoter binding protein-like gene family in papaya (Carica papaya) development and ripening. Genomics 112, 2734–2747. doi: 10.1016/j.ygeno.2020.03.009
Yang, H., Liu, J., Dang, M., Zhang, B., Li, H., Meng, R., et al. (2018). Analysis of β-galactosidase during fruit development and ripening in two different texture types of apple cultivars. Front. Plant Sci. 9:539. doi: 10.3389/fpls.2018.00539
Yang, S., Zhang, X., Zhang, X., Dang, R., Zhang, X., and Wang, R. (2016). Expression of two endo-1, 4-β-glucanase genes during fruit ripening and softening of two pear varieties. Food Sci. Technol. Res. 22, 91–99.
Ye, C. Y., Chen, L., Liu, C., Zhu, Q. H., and Fan, L. (2015). Widespread noncoding circular RNA s in plants. New Phytol. 208, 88–95. doi: 10.1111/nph.13585
Yu, W., Peng, F., Xiao, Y., Wang, G., and Luo, J. (2018). Overexpression of PpSnRK1α in tomato promotes fruit ripening by enhancing RIPENING INHIBITOR regulation pathway. Front. Plant Sci. 871:1856. doi: 10.3389/fpls.2018.01856
Yu, Y., Zhang, Y., Chen, X., and Chen, Y. (2019). Plant noncoding RNAs: hidden players in development and stress responses. Annu. Rev. Cell Dev. Biol. 35, 407–431. doi: 10.1146/annurev-cellbio-100818-125218
Yuste-Lisbona, F. J., Quinet, M., Fernández-Lozano, A., Pineda, B., Moreno, V., Angosto, T., et al. (2016). Characterization of vegetative inflorescence (mc-vin) mutant provides new insight into the role of MACROCALYX in regulating inflorescence development of tomato. Sci. Rep. 6:18796. doi: 10.1038/srep18796
Zhang, G., Chen, D., Zhang, T., Duan, A., Zhang, J., and He, C. (2018). Transcriptomic and functional analyses unveil the role of long non-coding RNAs in anthocyanin biosynthesis during sea buckthorn fruit ripening. DNA Res. 25, 465–476. doi: 10.1093/dnares/dsy017
Zhang, Z. P., Deng, Y., Song, X., and Miao, M. (2015). Trehalose-6-phosphate and SNF1-related protein kinase 1 are involved in the first-fruit inhibition of cucumber. J. Plant Physiol. 177, 110–120. doi: 10.1016/j.jplph.2014.09.009
Zhou, M. Y., Yang, J. M., and Xiong, X. D. (2018). The emerging landscape of circular RNA in cardiovascular diseases. J. Mol. Cell. Cardiol. 122, 134–139. doi: 10.1016/j.yjmcc.2018.08.012
Zhu, B., Yang, Y., Li, R., Fu, D., Wen, L., Luo, Y., et al. (2015). RNA sequencing and functional analysis implicate the regulatory role of long non-coding RNAs in tomato fruit ripening. J. Exp. Bot. 66, 4483–4495. doi: 10.1093/jxb/erv203
Keywords: cucumber, lncRNA, circRNA, regulatory role, fruit firmness, shelf life
Citation: Dey SS, Sharma PK, Munshi AD, Jaiswal S, Behera TK, Kumari K, G B, Iquebal MA, Bhattacharya RC, Rai A and Kumar D (2022) Genome wide identification of lncRNAs and circRNAs having regulatory role in fruit shelf life in health crop cucumber (Cucumis sativus L.). Front. Plant Sci. 13:884476. doi: 10.3389/fpls.2022.884476
Received: 26 February 2022; Accepted: 27 June 2022;
Published: 03 August 2022.
Edited by:
Dinesh Yadav, Deen Dayal Upadhyay Gorakhpur University, IndiaReviewed by:
Haidong Yan, University of Georgia, United StatesVidya Sagar, Indian Institute of Vegetable Research (ICAR), India
Copyright © 2022 Dey, Sharma, Munshi, Jaiswal, Behera, Kumari, G, Iquebal, Bhattacharya, Rai and Kumar. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Shyam S. Dey, c2h5YW0uaWFyaUBnbWFpbC5jb20=; Mir Asif Iquebal, bWEuaXF1ZWJhbEBpY2FyLmdvdi5pbg==
†Present address: T. K. Behera ICAR-Indian Institute of Vegetable Research, Varanasi, India
‡ORCID: Shyam S. Dey, https://orcid.org/0000-0001-9211-8820; Mir Asif Iquebal, https://orcid.org/0000-0003-3787-5997