- 1Department of Biotechnology, D.D.U. Gorakhpur University, Gorakhpur, India
- 2International Center for Genetic Engineering and Biotechnology, New Delhi, India
- 3Department of Plant Sciences, College of Agricultural and Marine Sciences, Sultan Qaboos University, Al-Khod, Oman
The genus Begomovirus represents a group of multipartite viruses that significantly damage many agricultural crops, including papaya, and influence overall production. Papaya leaf curl disease (PaLCD) caused by the complex begomovirus species has several important implications and substantial losses in papaya production in many developing countries, including India. The increase in the number of begomovirus species poses a continuous threat to the overall production of papaya. Here, we attempted to map the genomic variation, mutation, evolution rate, and recombination to know the disease complexity and successful adaptation of PaLCD in India. For this, we retrieved 44 DNA-A and 26 betasatellite sequences from GenBank reported from India. An uneven distribution of evolutionary divergence has been observed using the maximum-likelihood algorithm across the branch length. Although there were phylogenetic differences, we found high rates of nucleotide substitution mutation in both viral and sub-viral genome datasets. We demonstrated frequent recombination of begomovirus species, with a maximum in intra-species recombinants. Furthermore, our results showed a high degree of genetic variability, demographic selection, and mean substitution rate acting on the population, supporting the emergence of a diverse and purifying selection of viruses and associated betasatellites. Moreover, variation in the genetic composition of all begomovirus datasets revealed a predominance of nucleotide diversity principally driven by mutation, which might further accelerate the advent of new strains and species and their adaption to various hosts with unique pathogenicity. Therefore, the finding of genetic variation and selection emphases on factors that contribute to the universal spread and evolution of Begomovirus and this unanticipated diversity may also provide guidelines toward future evolutionary trend analyses and the development of wide-ranging disease control strategies for begomoviruses associated with PaLCD.
Introduction
Begomoviruses (family: Geminiviridae) are the largest group of plant viruses that pose a significant yield loss to many economically important crops in tropical and sub-tropical regions of the world (Varma and Malathi, 2003). Begomoviruses are transmitted by the whitefly Bemisia tabaci, a cryptic species complex which has been adapted and co-evolved with the begomovirus genome (Brown et al., 2012; Marwal et al., 2021). Due to its semi-persistent mode of transmission and behavioural manipulation, B. tabaci is considered the second most important vector of plant viruses that effectively transmits a large number of virus species belonging to the genus begomovirus (Moreno-Delafuente et al., 2013).
Begomoviruses are approximately 2.7–2.8 Kb in size and composed of circular single-stranded DNA (ssDNA) encapsulated in a quasi-isometric non-enveloped twinned particle (Stanley et al., 2005). Begomoviruses are either bipartite [native to New World (NW)] segments of DNA A and DNA B or monopartite [native to Old World (OW)] segment of DNA A. The OW monopartite begomoviruses are also associated with ssDNA helper molecules designated as betasatellite or alphasatellite and a newly reported deltasatellite (Zhou, 2013; Lozano et al., 2016). Papaya leaf curl virus (PaLCuV) is a monopartite begomovirus ~2.7–2.8 Kb in size (Figure 1) that causes PaLCD. PaLCuV genome consists of six open reading frames (ORFs) systematised in two transcriptional directions and are divided by an intergenic region (IR) (Nehra et al., 2019). They have been named according to their functions, and encode proteins (AV2, AV1, AC3, AC2, AC1, and AC4) that assist in viral particles movement (intra- and inter-cellular) within the host. Besides, this IR contains a highly conserved unique nonanucleotide “TAATATTAC” with an origin of replication (Figure 2). Moreover, OW monopartite viruses are mostly associated with betasatellite, which encodes an important βC1 protein (Figure 2) at the complementary sense strand and plays an important role in transcriptional and post-transcriptional gene silencing, disease epidemics, and symptom induction (Varun and Saxena, 2018), and can also affect the Jasmonic acid repressor gene (Zhou, 2013). Thus, begomovirus together with betasatellite causes systemic infection and develops typical symptoms in plants (Nawaz-ul-Rehman and Fauquet, 2009; Hanley-Bowdoin et al., 2013). Leaf curl symptoms caused by the papaya leaf curl virus are found associated with infected papaya plants from different regions, which initiate serious production losses and can act as a potential inducer for viral transmission via the vector whitefly (Guo et al., 2015).
Figure 1. Genome organisation of the Papaya Leaf Curl Virus with associated satellites. (A) DNA-A; (B) Betasatellite; and (C) Alphasatellite.
Figure 2. The genomic component of the monopartite Papaya Leaf Curl Virus genome with betasatellite indicating the silencing suppressor genes; six ORF (AV2, AC2, AC4, AV1, AC3, and AC1); βC1.
Papaya (Carica papaya), a major tropical, sweet, large, and herbaceous food crop, belongs to the order Brassicales (family: Caricaceae) and is cultivated throughout India (Yadav et al., 2017). However, its local and commercial cultivation makes it difficult to achieve its full potential (FAOSTAT, 2019) due to the large number of insects pest, bacteria, fungi, and viruses. Among viral diseases, papaya leaf curl disease (PaLCD) caused by a complex of begomovirus species is a major damaging factor, which affects the overall production of papaya (Nascimento et al., 2010). PaLCD caused by a geminivirus was first reported by Thomas and Krishnaswamy in 1939 and, subsequently, the causative virus was further confirmed by Saxena et al. (1998). Since then, the number of begomovirus species infecting papaya crops has been increasing. Begomovirus infected papaya plants show typical symptoms of begomovirus infection, including yellowing, curling, leaf distortion, and stunting (Figure 3). India, a leading papaya-producing country in the world, accounts for a severe infection impact of PaLCuV, including other begomovirus species, which impedes commercial papaya production of papaya plants, and moreover, it could also reduce the growth of the pharmaceutical industry. Therefore, infection of begomovirus associated with PaLCD causes greater loss to the papaya crop, which not only had anatomical and physiological losses, but their pharmacological potential was also significantly diminished (Soni et al., 2022). In India, PaLCD is associated with a complex of 15 begomovirus species, such as Papaya leaf curl virus (PaLCuV), Papaya leaf crumple virus (PaLCrV), Chilli leaf curl virus (ChiLCV), Chilli leaf curl India virus (ChiLCINV), Duranta leaf curl virus (DLCV), Papaya yellow leaf curl virus (PaYLCV), Papaya severe leaf curl virus (PaSLCV), Tomato leaf curl New Delhi virus (ToLCNDV), Tomato leaf curl virus (ToLCV), Cotton leaf curl Multan virus (CLCuMuV), Tomato leaf curl Gujarat virus (ToLCGV), Croton yellow vein mosaic virus (CYVMV), Pedilanthus leaf curl virus (PeLCV), Ageratum enation virus (AEV), and Cotton leaf curl Burewala virus (CLCuBuV).
Figure 3. Observed symptoms in Carica papaya infected with PaLCD begomoviruses. Severe upward and downward curling, thickening, crinkling, and yellowing of leaves (A–E) along with healthy leaves (F).
In spite of being an economically important tropical fruit, little attention has been paid to assessing the genetic diversity at the molecular level of the papaya infecting begomoviruses (Fougat et al., 2015). The current information about the geographical distribution in India, genetic diversity, and evolutionary dynamics of begomovirus and associated satellites for papaya infecting plants remains insufficient or little information is available. Moreover, there is no data available regarding comparative analysis of genetic variability and evolutionary aspects to understand the virus and satellite populations arising from different geographical locations in India. In the present study, we focused on the current diversity of PaLCD-causing begomoviruses in India, genomic variability, population dynamics, and evolutionary patterns to get further insights into the complexity of PaLCD begomoviruses and their accompanying satellites.
Materials and Methods
Sequence Retrieval and Sequence Alignment
A total of 70 complete genomic sequences of begomoviruses and their associated satellites infecting papaya plants were retrieved from GenBank,1 including 44 sequences of DNA-A and 26 sequences of betasatellite (Figure 4; Table 1). The sequences of all begomoviruses and sub-viral satellites associated with PaLCD reported till June 30, 2021, were included in the study to explore its diversity and distribution in India. A total of eight datasets were formed (DNA-A, its six ORFs, and betasatellite), and each specific dataset was aligned through a multiple sequence alignment algorithm with Clustal W (Thompson et al., 1994), using MEGA X software (Kumar et al., 2018). Only two alphasatellites sequences associated with the papaya host were found at the NCBI gene data bank, which is not sufficient to show significant analysis and, therefore, we have not included them in sequence analysis in this study. All sequences were organised to begin at the nick site of the conserved nonanucleotide motif at the origin of replication (5′-TAATATT//AC-3′).
Figure 4. Distribution of distinct begomoviruses isolates and their satellites in India associated with leaf curl disease of Carica papaya (Follow Table 1).
Table 1. Features of begomoviruses and associated satellites causing leaf curl disease in papaya plants are identified in India.
Phylogenetic Analysis and Detection of Substitution Mutation Bias
A phylogenetic analysis was performed using the MEGA X program with bootstrap analysis utilising 1,000 replicates. The evolutionary history for sequence datasets DNA-A, its six ORFs and associated betasatellites was calculated using the maximum likelihood (ML) tree based on CLUSTAL W pairwise alignment and the best fit nucleotide substitution model, i.e., (GTR + G) for DNA-A, its ORFs (AV1: TN93 + G + I); (AV2: HKY + G); (AC1 and AC3: GTR + G); (AC2: TN93 + G); (AC4: HKY + G), and (TN93 + G + I) for betasatellites, based on the corrected Bayesian Information Criterion (BIC) score of the MEGA X program (Kumar et al., 2018). Moreover, the transversion and transition DNA substitution rates as well as the transition/transversion bias (R) were also calculated for the virus, its ORFs and betasatellites using MEGA X (Kumar et al., 2018).
Detection of Recombination
The recombination analysis was done by using seven methods, i.e., RDP (Martin and Rybicki, 2000), BOOTSCAN (Martin and Williamson, 2005), CHIMAERA (Posada and Crandall, 2001), GENECONV (Padidam et al., 1999), MAXCHI (Maynard, 1992), 3SEQ (Boni et al., 2007), and SISCAN (Gibbs et al., 2000) implemented in the recombination detection program (RDP) v.4.1 software (Martin et al., 2015). For analysis, toward 0.05, the highest acceptable Bonferroni corrected value of p and default detection thresholds, datasets were subjected. To reduce false-positive results, out of seven, at least three algorithms were considered appropriate to detect recombination events.
Coalescent Analysis
The rates of nucleotide substitution per site and the rates of mutation at three different codon positions (C1, C2, and C3) in the sequence datasets of begomoviruses, their ORFs and betasatellite were determined by using the Bayesian Markov Chain Monte Carlo (MCMC) parameter of Bayesian Evolutionary Analysis Sampling Tree (BEAST; v.1.10; Suchard et al., 2018). Coalescent constant demographic models and best-fit molecular clock were detected using BEAST, whose sample size was effectively achieved through the use of the Tracer program (v.1.5; Rambaut et al., 2018). The MCMC chain used a 10% burn-in value with 107 run lengths to provide a 95% highest probability density (HPD) interval for determining statistical uncertainty. For assessing temporal structure, which is important for the estimation of substitution rates, we repeated the BEAST analysis by reshuffling the sampling times for each dataset, which were randomised on the tips (Ramsden et al., 2008), and then the HPDs of these randomised sequences were compared with those of the real data.
The Population Structure Assay
To determine the genetic diversity of a virus population, several parameters of the DnaSP (v. 6.12) software were used (Rozas et al., 2017). The aligned sequence datasets were examined for the total number of segregating sites (s), the average number of nucleotide differences between sequences (k), the total number of mutations (η), the nucleotide diversity (π), and Watterson’s estimate of the population mutation rate based on the total number of segregating sites (θ − w) and the total number of mutations (θ – η) were also calculated along with a number of haplotypes (h), and haplotype diversity (Hd; Lima et al., 2017).
The Neutrality Test is executed to calculate the hypothesis of selection pressure occurring in the population using DnaSP (v.6.12) software (Rozas et al., 2017; Universitat de Barcelona; origin 1994). Therefore, sequence datasets separated by different geographical locations are tested by employing tests such as Tajima’s D, which signifies the difference between the two measures of genetic diversity, i.e., the number of segregating sites and the mean number of pairwise differences, Fu and Li’s D* for identifying the difference between the total number of mutations and the number of singletons, and Fu and Li’s F* for identifying the difference between the average number of nucleotide differences between paired sequences and the number of singletons, i.e., detecting the neutrality of mutations in DNA populations (Rozas et al., 2017).
Results
Geographical Distribution, Phylogenetic Analysis, and Detection of Substitution Mutation Bias
The complete genomic sequence of PaLCD-associated begomoviruses and their satellites was considered in the current study, which included a viral genome of 2.7–2.8 Kb in size and an associated satellite of 1.3 Kb in size (Figure 2). The total of 44 begomovirus isolates of Indian origin showed the wide distribution of the viruses, which predominantly constitute six sequences of Papaya leaf curl virus (PaLCV), 13 sequences of Papaya leaf crumple virus (PaLCrV), and 10 sequences of Chilli leaf curl virus (ChiLCV), infecting papaya plants in India (Table 1). Moreover, a total of 26 betasatellite isolates, which predominantly constitute seven sequences of Tomato leaf curl betasatellite (ToLCB), four sequences of Croton yellow vein mosaic betasatellite (CroYVMB), and three sequences of each for Cotton leaf curl Multan betasatellite (CLCuMuB) and Chilli leaf curl betasatellite (ChiLCB) respectively, showed the mixture of their species infecting papaya plants as per Table 1. Only two papaya plant-associated alphasatellite isolates, i.e., papaya leaf curl alphasatellite (KR052157 and JQ322970) were recovered and were found insufficient to give significant results for the present study. The time curve distribution of 44 begomovirus genomes by date of isolation in India shows a large variation from the year 1997 to the year 2021 (Figure 5).
Figure 5. A time-curve graph depicting the distribution of 44 begomovirus genomes in India by date of isolation. The numbers of complete genomes of begomovirus were plotted against the time of their collection from the year 1997 to the year 2021 (data from NCBI).
After collecting the various begomovirus sequences associated with the papaya host from the NCBI GenBank, we used phylogenetic analysis to discover the interconnection between different begomovirus species evolving from different regions and clustering into a distinct group. The evolutionary history of DNA A sequence datasets was observed with four major distant lineages with 1,000 bootstrap support and were grouped as (PaLCuV I), (PaLCrV II), (PaYLCuV III), and (ChiLCV IV a & b; Figure 6). The lineages comprise the isolates collected from different geographic locations in India (Table 1). The branch length among a population suggests the level of differentiation within them. Group I showed clustering among isolates of PaLCuV from New Delhi, Gujrat, and the Rajasthan region. Group II of PaLCrV forms a cluster between the isolates of New Delhi, Haryana, Punjab, and some regions of Madhya Pradesh and Uttar Pradesh. Further, PaYLCuV forms group III and get clustered between the isolates of New Delhi and Rajasthan. The two groups of ChiLCV were observed in which one group of ChiLCV (IVa) shows clustering with the major group I, which suggests the intra-species similarity with PaLCuV, whereas the second ChiLCV group (IVb) forms a separate clade expressing their inter-species relations. Additionally, some small groups are also observed, clustering with their closely related species. Similarly, the phylogenetic analysis was also performed for six different ORFs of DNA-A and observed the sequence homology for selected genome regions (Figure 7).
Figure 6. Maximum-likelihood phylogeny-based partitioning tree associated with PaLCD begomoviruses in India aligned using CLUSTAL W with a 1,000 bootstrap value within the MEGA v.10 program; 44 begomovirus isolates (DNA-A sequences) form four distinct major groups.
Figure 7. Maximum-likelihood phylogenetic dendrogram of all the six genes/ORFs of begomoviruses associated with PaLCD in India aligned using CLUSTAL W with a 1,000 bootstrap value within the MEGA v.10 program; six ORFs (CP; C4; REn; Rep; Pre-CP; and TrAp) of 44 begomovirus isolates.
In addition, well-defined clusters are observed in the case of betasatellite, depicting five distinct groups as (ToLCB I a & b), (ToLCBDB II), (ChiLCB III), (CroYVMB IV), and (CLCuMuB V; Figure 8). Further, different rates of transition, transversion DNA substitution mutation, and transition/transversion bias (R) were estimated for the begomovirus isolates causing PaLCD (Table 2). However, in our study, we found that the rate of transition and transversion substitution mutation in genomic regions ranged from 18.99–27.34 to 1–1.44, respectively, while the transition/transversion bias (R) was 1.00. Further, independent analysis of six ORF datasets showed variable values for different substitution mutation rates. We found the transition/transversion rate for all six ORF regions is AV2, AC2, and AC4 regions are most susceptible to transition and less susceptible to transversion, while AC3 and AC1 are more prone to transversion than transition. Moreover, we found transition/transversion bias (R) ratios were close to 1 (Table 2), except in AC2 and AV1 regions where they were equal to 1. Therefore, a higher transition/transversion ratio, especially in AV1 and AC2 regions (1.0), suggests that either negative selection is removing transversions and transitions are favoured, whereas a lower transition/transversion ratio, especially in AV2, AC3, AC1, and AC4 regions (0.9), suggests that either negative selection is removing transitions and transversions are favoured. Similarly, for betasatellite, the transition and transversion substitution mutation range were obtained as 0.23–55.55 and 0.42–0.72, respectively, while the transition/transversion bias (R) was found at 0.81. However, we conclude that selection is a contributor towards transition/transversion bias (R) in begomoviruses, which cannot be partially explained by either transition or transversion value. Therefore, the estimation suggests that the role of base substitutions causes transition mutation at a higher rate in a population than single nucleotide polymorphism.
Figure 8. Maximum-likelihood phylogeny-based partitioning tree of begomoviruses linked betasatellites associated with PaLCD in India aligned using CLUSTAL W with 1,000 bootstrap value in the MEGA v.10 program; 26 betasatellites isolates for five distinct groups.
Table 2. Estimation of substitution rate among DNA-A, open reading frames (ORFs), and betasatellite associated with Papaya Leaf Curl Disease (PaLCD) in India using the MEGA X program.
Detection of Recombination
Further, the tree-like phylogenetic divergence obtained for sequence datasets directed us to detect the occurrence of non-tree-like evolution within a population to explain the potential for recombination events in aligned sequences. Different parameters were used to determine the shared overlapping intra- and inter-specific recombination events distributed throughout the genome with different parental combinations. Forty-eight putative recombination events were observed for DNA-A datasets, in which 30 isolates were found to be recombinant. Out of 30 isolates, 20 had a single recombination event, whereas 10 showed multiple events of recombination (Table 3). The AC1 genome region DNA-A datasets are highly prone to recombination, exhibiting 19 putative breakpoints (1,527–2,612 nucleotide positions; Table 4), with 15 recombinants having major and minor parents. However, both intra and inter-specific recombination among different begomovirus isolates was predominately observed in the AC1, AV1, and AC2 rich genome regions, while the AV2, AC3, and AC4 genome regions were less susceptible to recombination. Additionally, 15 putative recombination breakpoints were identified for betasatellites, among which 12 isolates were found to be recombinant, of which eight isolates had a single recombination event and four isolates were found to have more than one event of recombination (Table 3). Moreover, recombination events distributed predominantly in the βC1 genome region of betasatellites support the prevalence of recombination that is involved in virus movement by suppressing host antiviral silencing genes (Kumar et al., 2015). Further, relevant recombination events were obtained by selecting at least three or more methods that minimise incompetent outcomes. Thus, significant amounts of genetic variation were supported by the maximum putative recombinational events among sequence datasets.
Table 3. Putative recombination events detected using the RDP4.1 program among begomoviruses and betasatellites associated with PaLCD, based on provided datasets from India; 44 DNA-A sequence and 26 associated betasatellite.
Table 4. Putative recombination events detected using the RDP4.1 program among genes of begomoviruses associated with PaLCD, based on provided datasets from India; six genes/ORF’s of 44 DNA-A sequences.
Coalescent Analysis
An evolutionary scenario signifies the role of nucleotide substitution along with recombination in gaining genetic variation and evolution among begomoviruses (Mishra et al., 2020). To detect the nucleotide substitution rate among reference begomoviruses and betasatellites, best-fit substitution models were selected based on the lowest BIC value along with both relaxed and strict molecular clocks. The mean substitution rate among the sequence datasets of DNA-A is 1.83 × 10−3 subsite−1 year−1 [DNA-A, 95% highest posterior density (HPD) interval ranging from 4.126 × 10−5 to 6.647 × 10−3], which was found to be higher when compared with the range of nucleotide substitution rates of RNA viruses reported so far (Jenkins et al., 2002; Duffy and Holmes, 2009; Kumar et al., 2015), but importantly, the high substitution frequency detected here shows a short-term mutational phenomenon acting on the population rather than a long-term substitution rate. To justify the above, the nucleotide substitution rate was also identified superficially in four gene datasets, i.e., AV2, AV1, AC3, AC2 (Table 5). Additionally, for betasatellites, the mean substitution rate was found to be 1.62 × 10−6 subsite−1 year−1 (β, 95% HPD interval ranging from 1.329 × 10−8 to 5.08 × 10−6). However, a relaxed molecular clock is used to get the suitable value of the mean substitution rate, and the obtained high substitution value is most likely to have resulted from strong positive selection (Duffy and Holmes, 2009). Since then, the population’s selection pressure has been significantly slowed by mutation, which causes codon degeneracy and genetic variation. Therefore, we measured the rate of mutation acting on three nucleotide codon positions, i.e., CP1, CP2, and CP3 respectively, and we found the highest rate of mutation at codon position C3 for DNA-A and among ORFs. The highest mutation rate was found in the AV1 (CP) gene at codon position C3, followed by the other five genes again at the C3 codon position or wobble codon position. Similarly, for betasatellites, the highest mutation rate was found at codon position C1 (Table 5).
Table 5. Estimation of Mean Substitution and Codon Degeneracy Rates for DNA-A, ORFs, and betasatellite associated with PaLCD in India.
Population Structure
Demography structure analysis was used to estimate the degree of genetic variability (>0.08) within and between populations (Table 6). However, we found the total number of polymorphic sites (s) to be 1757, with a total of 2,808 mutations (η) having a nucleotide diversity of 0.19525 (π = 0.1) for DNA-A datasets. Similarly, the analysis was performed for ORF datasets and we found the maximum number of polymorphic sites (s) and number of mutations (η) in AC1 (Rep) and AV1 (CP) genes having nucleotide diversity π = 0.1. Moreover, for betasatellite, we found the total number of polymorphic sites (s) 834, with a total 1,482 number of mutations (η) having a nucleotide diversity (π) of 0.30509 (π = 0.3) was again high. However, the resultant maximum π value explains the non-random distribution of nucleotides throughout viral and sub-viral genome regions, which significantly contributes to a high degree of genetic variability. Therefore, the estimation suggests diverse populations within and among populations.
Table 6. Estimation of the Genetic diversity of begomoviruses (DNA-A), ORFs, and betasatellites associated with PaLCD in India.
Further, genetic diversity within and among populations was also determined by the number of haplotypes (H) and haplotype diversity (Hd). Therefore, using DnaSP software (v. 6.0; Rozas et al., 2017; Universitat de Barcelona; origin 1994), analysis was performed for sequence datasets that contained haplotype distribution among reference DNA sequences of the begomovirus population, and we found the total number of haplotypes (H) was 45, and its haplotype diversity was identified as close to 1, i.e., (Hd = 0.99). Simultaneously, among ORF datasets, the number of haplotypes (H) and haplotype diversity (Hd) were found to be distributed between a range of 31–42 and 0.96–0.99, respectively. Thus, the resultant value supports the relative contribution of genes in DNA polymorphism. Similarly, for betasatellites, we found that though the total number of haplotypes (H) is less, i.e., 17, its gene diversity is detected as high, i.e., equal to 1 (1.000). Therefore, the overall result explains the low level of sequence divergence but the high frequency of unique mutations (Table 6).
Neutrality tests were used to assess and understand the demographic selection acting on the genetic population of begomoviruses and associated satellite molecules. For evaluation, Tajima’s D test was used, which statistically reflects the negative Tajima’s D value for DNA-A datasets at −0.77977, for ORFs range between −0.42227 and −1.45344, and for associated betasatellites at −1.06710. Predominantly, the statistically significant values were negative, which indicates that a large proportion of genetic segregation might be present within sequence datasets that are unique to individual sequences. Similarly, the other parameters, such as Fu & Li’s D and Fu & Li’s F tests of population statistics, were also evaluated, resulting in negative values for DNA-A, its ORFs, and betasatellites datasets, indicating reiterating of purifying selection and population expansion, which might be due to the inherent diversity. Nevertheless, the combination of Tajima’s D, Fu & Li’s D, and Fu & Li’s F negative values for DNA-A, its ORFs, and associated satellite population revealed the conserved nature of the gene. Such evidence of nucleotide diversity might be expected when a selective sweep succeeds in the expansion of the population and when most observable segregation functioning at the nucleotide level in a population is momentary and is eventually withdrawn by purifying selection (Table 7).
Table 7. Estimation of different neutrality tests for the datasets obtained from identified begomoviruses and betasatellites associated with PaLCD in India.
Discussion
India shares a large portion of the population that depends on small-area agricultural farming for their subsistence and income. A wide variety of diseases and their infection rates have been seen to cause devastating effects both on crop yield and human persistence. Undoubtedly, the cultivation practises and the presence of tropical climate conditions in the Indian subcontinent boost the prevalence of a large number of plant viruses. Papaya leaf curl virus (PaLCuV) is found to infect papaya plants and cause PaLCD, which affects plant growth, fruit size, quality, and quantity, slowing its yield (Shahid et al., 2013; Varun et al., 2017; Bananej et al., 2021), thus accelerating the spread of viral diseases. Additionally, climate change, adaptability, and the fast distribution of vectors and viruses are of major concern for the agriculture sector as they are greatly contributing to the Indian economy.
To explore the evolution of begomoviruses across India, a full-length sequence of reported begomoviruses infecting Carica papaya was collected from NCBI and arranged into eight specific datasets containing the sequence of DNA-A, its six ORFs, and associated betasatellites. Using the MEGA X program, we calculated the best-fit nucleotide substitution model for further analysis. i.e., (GTR + G) for DNA-A, ORFs (AV1:TN93 + G + I); (AV2: HKY + G); (AC1& AC3: GTR + G); (AC2:TN93 + G); (AC4: HKY + G); and (TN93 + G + I) for betasatellites. The recurrent occurrence of recombination and nucleotide substitution alike in RNA viruses is mostly attributed to factors contributing to the high degree of genetic variability among begomovirus populations, which may significantly step up their evolution by expanding the combinations of pre-existing nucleotide segregation created by mutation (Duffy and Holmes, 2008, 2009). Accordingly, recombination and mutation are often stated as the chief contributors to genetic variability, which is the subject matter of investigation in the present study using molecular and computational efficacy.
The phylogenetic results revealed homological diversification and the mean branch length observed for DNA-A and betasatellites datasets shows the possibility of a recombinational event in the studied population of begomovirus (Lima et al., 2017). Thus, the phylogenetic dendrogram signifies the evolution of the begomoviruses with the papaya host, which strongly encourages the coadaptation of associated betasatellites. We further compared the result of the maximum likelihood phylogenetic tree from MEGA X software with the maximum clade credibility phylogenetic tree from BEAST. We found that the ML tree shows very similar topologies to the maximum clade credibility phylogenetic tree, with many nodes in the DNA-A tree receives both strong bootstrap support and high posterior probabilities, and with respect to clade formation, they are found to be similar as both trees form four major groups (Supplementary Figure S1). To exclude other biases that strengthen the significant differences between the degrees of intra and inter-species variability, we checked for different rates of transition and transversion substitutions and transition/transversion bis (R). Pervious literature states that C → T and G → A are the most common base substitution transitions in the AC4 and AV2 regions, whereas A → T, A → C, and G → T transversions are more common in the IR than in the other four regions (Begun et al., 2007; Duchêne et al., 2015). However, our results indicate that, under the conditions studied, begomovirus AV1 and AC2 genome regions are more prone to transition substitution mutations and accept more drastic amino acid changes than other genomic regions. Therefore, the estimation suggests that the role of base substitutions causes transition mutation at a higher rate in a population than single nucleotide polymorphism (Sánchez et al., 2018; Mishra et al., 2020).
The phylogeny-based partitioning method was qualitatively estimated for eight datasets, resulting in mean branch length, and was found useful in quantifying the effect of recombination events. Previous studies have revealed that recombination happens at high frequencies in begomovirus populations that use a conserved feature, i.e., a rolling circle mechanism for replicating their genomes and making them mechanistically recombination-prone, thus generating recombination breakpoints in a non-random location (Martin et al., 2011). As recombination rates are threatened for plant viruses, our experimental analysis detected a recombination event in the genomic region of eight datasets using different algorithms of RDP 4.1 software (Martin et al., 2015), showing variable major and minor parents causing an uneven distribution of recombination breakpoints, leading to genetic diversity. The AC1 gene region of the genome sequence shows the maximum number of recombination break points within nucleotide positions 1,527–2,612, followed by the AV1 gene, and the least recombination break points were observed in the AC4 region of the gene. The distribution of events was maximally detected among intra-species isolates and the least was found between inter-species isolates. Additionally, for betasatellites, a low recombination breakpoint is detected in βC1 region within nucleotide position 200–562. This indicates that the tolerant capacity could be greater for some portions of the genome towards recombination events than others, involving more divergent parental viruses (Martin and Williamson, 2005). However, the statistically measurable recombination rate seems to be lower than the mutation rate in sequence datasets. Even in such cases, recombination events act actively, but consequently, the mutation dynamics were found to be a leading force in shaping the standing genetic variability (Kumar et al., 2015).
Significant aspects of population genetics are possibly accompanied by mutation, along with recombination, neutral selection, genetic drift, and gene flow, which encourage shaping the genetic structure of populations (Xavier et al., 2021). We calculated the rate of nucleotide substitution using coalescent best fit substitution modal with strict and relaxed molecular clocks for each dataset and found the relaxed molecular clock to be promising for having a high mean substitution rate for DNA-A, ORFs, and betasatellite. However, this condition could possibly be influenced by either spontaneous mutations or the usage of error-prone DNA polymerases in virus replication (Richter et al., 2016). To find out whether our estimates of substitution rate are robust and contain clear temporal structure, we randomly reshuffled the dates of isolation for viruses in each of the eight datasets and reran BEAST using the best-fitting parameters from the actual analyses. For five datasets, i.e., AV1, AC1, AC2, and AC3 genes and betasatellites, the reshuffled control 95% HPD values excluded the mean nucleotide substitution rates estimated from the actual data, and for the remaining three datasets, i.e., DNA-A, AV2, and C4 genes, the case is different in that the reshuffled results were superimposed on the substitution rates from the real data, indicating that there was insufficient temporal structure to reliably estimate substitution rates. Importantly, all these datasets were sampled over a 21-year span. In contrast, the AV1, AC1, AC2, and AC3 genes and betasatellites had more distinct substitution rate estimates from their reshuffled controls, reflecting a clear temporal signal (Supplementary Figure S2). Moreover, we also identified codon position C3 to be significantly affected by the selection pressure acting on the population caused by mutation.
Additionally, it is important to address the key issue that refers to the uneven distribution of variation across begomovirus genomes. In this context, the combination of various factors is responsible for affecting genetic variability and acts by distributing polymorphisms in a non-random manner in the genomic regions of begomoviruses (Mishra et al., 2020). However, we calculated the degree of genetic variability and haplotype diversity within and among populations by using several parameters of DnaSP software (v. 6.0) and found a high degree of genetic variability. The resulting data explain the non-random distribution of nucleotides throughout viral and sub-viral genome regions, resulting in population diversity (Sobrinho et al., 2014). Further, haplotype diversity indicates the low level of sequence divergence but high frequency of unique mutations, causing uniqueness within and among populations.
Moreover, to understand the population genetics and selection forces acting on the begomovirus population and satellite molecules, neutrality tests were evaluated, resulted in a negative value, suggesting that populations might be influenced by purifying selection or have experienced recent expansion (Mishra et al., 2020) rather than neutral selection. Nevertheless, the combination of Tajima’s D, Fu & Li’s D, and Fu & Li’s F negative values for DNA-A, its ORFs, and associated satellite population revealed the conserved nature of the gene. Such evidence of nucleotide diversity might be expected when most observable segregation functioning at the nucleotide level in a population is momentary and is ultimately withdrawn by purifying selection. Our results indicate that begomoviruses infecting Carica papaya are not restricted to any solitary geographical region of India.
Although the number of sequence data for DNA-A, betasatellites, and particularly alphasatellites is insufficient, the current study’s findings may provide meaningful basic information that contributed significantly to the diversification of begomoviruses, particularly those causing PaLCDs, and acknowledge the evolutionary potential, particularly in the context of recombination, adaptation, genetic diversity, emergence, and evolution.
Conclusion
The present study, based on bioinformatics approaches, provides the current understanding of the genetic diversification of the begomovirus population associated with PaLCD. The coexistence of the begomovirus population and its satellite molecules, as explained by geographical distribution, indicates their interdependence in accelerating disease severity and spread among economically important crops such as papaya across India. A wide begomovirus population having a common host might be attributed to an uncontrolled evolutionary variation in its DNA genome, which is mainly driven by a high frequency of mutational changes. Furthermore, the variable frequency of mutation demonstrated the random allocation of the recombination breakpoints, leading to a diversified distribution of recombinational patterns. Moreover, this phenomenon uses different mechanisms to overwhelm selection pressure and to effectively adapt to environments with new hosts. Thus, that geographically separates begomovirus and its associated satellites. The continuous evolution of new recombinants and their satellites might lead to efficient vector transmission, expansion of host range, and breaking of resistance, which poses a threat to crop production at an alarming rate, thus affecting the agroeconomic value and disease management. However, for expanding knowledge in an area, the depth study of molecular fundamentals together with vector-mediated and host-dependent dispersal might provide an understanding of the expanding begomovirus disease complex.
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary Material; further inquiries can be directed to the corresponding authors.
Author Contributions
AS and VP performed experiments and wrote the manuscript. AKS helped in data collection. RG and DY guided the design of the whole test scheme, and AA-S and MS critically reviewed the manuscript. All authors contributed to the article and approved the submitted version.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
AA-S and MS are thankful to Sultan Qaboos University, Oman for financial support. The authors are thankful to Barbara Hohn, Friedrich Miescher Institute for Biomedical Research, Switzerland and Vinoth Kumar, National Centre for Biological Sciences, Tata Institute of Fundamental Research, GKVK Campus, India for their valuable suggestions during the study.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2022.879413/full#supplementary-material
Supplementary Figure S1 | Maximum clade credibility phylogenetic tree associated with papaya leaf curl disease begomoviruses in India aligned using BEAST software v.1.10’s FigTree program. Tree is formed using the relax.tree file, which is automatically rooted by the use of a relaxed molecular clock.
Supplementary Figure S2 | Graphical representation of a temporal signal generated by the BEAST software; Estimated rates of nucleotide substitution from actual date of isolation for begomoviruses, ORFs, and betasatellites associated with PaLCD (eight datasets) with 95% HPD values, are shown in dark grey. The reshuffled mean estimated rates for eight datasets with 95% HPD values are shown in light grey.
Footnotes
References
Bananej, K., Shafiq, M., and Shahid, M. S. (2021). Association of cotton leaf curl Gezira virus with tomato leaf curl betasatellite infecting Carica papaya in Iran. Aust. Plant Dis. Notes. 16, 1–4. doi: 10.1007/s13314-021-00417-z
Begun, D. J., Holloway, A. K., Stevens, K., Hillier, L. W., Poh, Y. P., and Hahn, M. W. (2007). Population genomics: whole-genome analysis of polymorphism and divergence in Drosophila simulans. PLoS Biol. 5:e310. doi: 10.1371/journal.pbio.0050310
Boni, M. F., Posada, D., and Feldman, M. W. (2007). An exact nonparametric method for inferring mosaic structure in sequence triplets. Genetics 176, 1035–1047. doi: 10.1534/genetics.106.068874
Brown, J. K., Fauquet, C. M., Briddon, R. W., Zerbini, M., Moriones, E., and Navas Castillo, J. (2012). “Geminiviridae” in Virus Taxonomy-Ninth Report of the International Committee on Taxonomy of Viruses. eds. A. M. Q. King, M. J. Adams, E. B. Carstens, and E. J. Lefkowitz (San Diego, CA: Elsevier Academic Press), 351–373.
Duchêne, S., Ho, S. Y., and Holmes, E. C. (2015). Declining transition/transversion ratios through time reveal limitations to the accuracy of nucleotide substitution models. BMC Evol. Biol. 15:36. doi: 10.1186/s12862-015-0312-6
Duffy, S., and Holmes, E. C. (2008). Phylogenetic evidence for rapid rates of molecular evolution in the single-stranded DNA begomovirus tomato yellow leaf curl virus. J. Virol. 82, 957–965. doi: 10.1128/JVI.01929-07
Duffy, S., and Holmes, E. C. (2009). Validation of high rates of nucleotide substitution in Geminiviruses: phylogenetic evidence from east African cassava mosaic viruses. J. Gen. Virol. 90, 1539–1547. doi: 10.1099/vir.0.009266-0
FAOSTAT (2019). Food and Agriculture organization of the United Nation Database. Available at: http://www.fao.org/faostat/en/#data/QC (Accessed June 26, 2021).
Fougat, R. S., Purohit, A. R., Kumar, S., Parekh, M. J., and Kumar, M. (2015). SSR based genetic diversity in Abelmoschus species. Indian J. Agric. Sci. 26, 172–178. doi: 10.1007/s13562-016-0378-2
Gibbs, M. J., Armstrong, J. S., and Gibbs, A. J. (2000). Sister-scanning: A Monte Carlo procedure for assessing signals in recombinant sequences. Bioinformatics 16, 573–582. doi: 10.1093/bioinformatics/16.7.573
Guo, T., Guo, Q., Cui, X., Liu, Y., Hu, J., and Liu, S. (2015). Comparison of transmission of papaya leaf curl China virus among four cryptic species of the whitefly Bemisia tabaci complex. Sci. Rep. 5:15432. doi: 10.1038/srep15432
Hanley-Bowdoin, L., Bejarano, E. R., Robertson, D., and Mansoor, S. (2013). Geminiviruses: masters at redirecting and reprogramming plant processes. Nat. Rev. Microbiol. 11, 777–788. doi: 10.1038/nrmicro3117
Jenkins, G. M., Rambaut, A., Pybus, O. G., and Holmes, E. C. (2002). Rates of molecular evolution in RNA viruses: a quantitative phylogenetic analysis. J. Mol. Evol. 54, 156–165. doi: 10.1007/s00239-001-0064-3
Kumar, R. V., Singh, A. K., Singh, A. K., Yadav, T., Basu, S., Kushwaha, N., et al. (2015). Complexity of begomovirus and betasatellite populations associated with chilli leaf curl disease in India. J. Gen. Virol. 96, 3143–3158. doi: 10.1099/jgv.0.000254
Kumar, S., Stecher, G., Li, M., Knyaz, C., and Tamura, K. (2018). MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 35, 1547–1549. doi: 10.1093/molbev/msy096
Lima, A. T. M., Silva, J. C. F., Silva, F. N., Castillo-Urquiza, G. P., Silva, F. F., and Seah, Y. M. (2017). The diversification of begomovirus populations is predominantly driven by mutational dynamics. Virus Evol. 3:vex0005. doi: 10.1093/ve/vex0005
Lozano, G., Trenado, H. P., Fiallo-Olivé, E., Chirinos, D., Geraud-Pouey, F., and Briddon, R. W. (2016). Characterization of non – coding DNA satellites associated with sweepoviruses (genus Begomovirus, Geminiviridae) – definition of a distinct class of begomovirus – associated satellite. Front. Microbiol. 7:162. doi: 10.3389/fmicb.2016.00162
Martin, D. P., Lefeuvre, P., Varsani, A., Hoareau, M., Semegni, J. Y., Dijoux, B., et al. (2011). Complex recombination patterns arising During Geminivirus Coinfections preserve and demarcate biologically important intra-genome interaction networks. PLoS Pathog. 7:e1002203. doi: 10.1371/journal.ppat.1002203
Martin, D. P., Murrell, B., Golden, M., Khoosal, A., and Muhire, B. (2015). RDP4: detection and analysis of recombination patterns in virus genomes. Virus Evol. 1:vev003. doi: 10.1093/ve/vev003
Martin, D. P., and Rybicki, E. (2000). RDP: detection of recombination amongst aligned sequences. Bioinformatics 16, 562–563. doi: 10.1093/bioinformatics/16.6.562
Martin, C., and Williamson, P. D. (2005). RDP2: recombination detection and analysis from sequence alignments. Bioinformatics 21, 260–262. doi: 10.1093/bioinformatics/bth490
Marwal, A., Nehra, C., Verma, R. K., Mishra, M., Srivastava, D., Choudhary, P., et al. (2021). First report of papaya leaf curl virus and its associated papaya leaf curl betasatellite infecting Catharanthus roseus plants in India. J. Hortic. Sci. Biotechnol. 96, 808–813. doi: 10.1080/14620316/2021/1912646
Mishra, M., Verma, R. K., Marwal, A., Sharma, P., and Gaur, R. K. (2020). Biology and interaction of the natural occurrence of distinct Monopartite Begomoviruses associated with satellites in Capsicum annum from India. Front. Microbiol. 11:512957. doi: 10.3389/fmicb.2020.512957
Moreno-Delafuente, A., Garzo, E., Moreno, A., and Fereres, A. (2013). A plant virus manipulates the behavior of its whitefly vector to enhance its transmission efficiency and spread. PLoS One 8:e61543. doi: 10.1371/journal.pone.0061543
Nascimento, A. K. Q., Lima, J. A. A., Nascimento, A. L. L., and Beserra, E. A. Jr. (2010). Biological, physical, and molecular properties of a Papaya lethal yellowing virus isolate. Plant Dis. 94, 1206–1212. doi: 10.1094/PDIS-11-09-0733
Nawaz-ul-Rehman, M. S., and Fauquet, C. M. (2009). Evolution of geminiviruses and their satellites. FEBS Lett. 583, 1825–1832. doi: 10.1016/j.febslet.2009.05.045
Nehra, C., Marwal, A., Verma, R. K., Mishra, M., Sharma, P., and Gaur, R. K. (2019). Papaya yellow leaf curl virus: a newly identified Begomovirus infecting Carica papaya L. from the Indian Subcontinent. J. Hortic. Sci. Biotechnol. 94, 475–480. doi: 10.1080/14620316.2019.1570827
Padidam, M., Sawyer, S., and Fauquet, C. M. (1999). Possible emergence of new geminiviruses by frequent recombination. Virology 265, 218–225. doi: 10.1006/viro.1999.0056
Posada, D., and Crandall, K. A. (2001). Evaluation of methods for detecting recombination from DNA sequences: computer simulations. Proc. Natl. Acad. Sci. U. S. A. 98, 13757–13762. doi: 10.1073/pnas.241370698
Rambaut, A., Drummond, A. J., Xie, D., Baele, G., and Suchard, M. A. (2018). Posterior summarization in Bayesian Phylogenetics using tracer 1.7. Syst. Biol. 67, 901–904. doi: 10.1093/sysbio/syy032
Ramsden, C., Melo, F. L., Figueiredo, L. M., Holmes, E. C., and Zanotto, P. M. (2008). High rates of molecular evolution in hantaviruses. Mol. Biol. Evol. 25, 1488–1492. doi: 10.1093/molbev/msn093
Richter, K. S., Götz, M., Winter, S., and Jeske, H. (2016). The contribution of translation synthesis polymerases on geminiviral replication. Virology 488, 137–148. doi: 10.1016/j.virol.2015.10.027
Rozas, J., Ferrer-Mata, A., Sánchez-DelBarrio, J. C., Guirao-Rico, S., Librado, P., and Ramos-Onsins, S. E. (2017). DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol. Biol. Evol. 34, 3299–3302. doi: 10.1093/molbev/msx248
Sánchez, C. S., Domínguez, H. G., Díaz, M. L., Tomás, D. M., Navas, C. J., Moriones, E., et al. (2018). Differential shape of Geminivirus mutant spectra across cultivated and wild hosts with invariant viral consensus sequences. Front. Plant Sci. 9:932. doi: 10.3389/fpls.2018.00932
Saxena, S., Hallan, V., Singh, B. P., and Sane, P. V. (1998). Nucleotide sequence and inter geminiviral homologies of the DNA A of papaya leaf curl geminivirus from India. Biochem. Mol. Biol. Int. 45, 101–113. doi: 10.1080/15216549800202472
Shahid, M. S., Yoshida, S., Khatri-Chhetri, G. B., Briddon, R. W., and Natsuaki, K. T. (2013). Complete nucleotide sequence of a monopartite Begomovirus and associated satellites infecting Carica papaya in Nepal. Virus Genes 46, 581–584. doi: 10.1007/s11262-013-0888-0
Sobrinho, R. R., Xavier, C. A. D., Pereira, H. M. B., Lima, G. S. A., Assunção, I. P., Mizubuti, E. S. G., et al. (2014). Contrasting genetic structure between two begomoviruses infecting the same leguminous hosts. J. Gen. Virol. 95, 2540–2552. doi: 10.1099/vir.0.067009-0
Soni, S. K., Mishra, M. K., Mishra, M., Kumari, S., Saxena, S., Shukla, V., et al. (2022). Papaya leaf curl virus (PaLCuV) infection on papaya (Carica papaya L.) plants alters anatomical and physiological properties and reduces bioactive components. Plan. Theory 11:579. doi: 10.3390/plants11050579
Stanley, J., Bisaro, D. M., Briddon, R. W., Brown, J. K., Fauquet, C. M., Harrison, B. D., et al. (2005). “Geminiviridae” in VIIIth Report of the International Committee on Taxonomy of Viruses. Virus Taxonomy. eds. C. M. Fauquet, M. A. Mayo, J. Maniloff, U. Desselberger, and L. A. Ball (London: Elsevier/Academic Press), 1163–1169.
Suchard, M. A., Lemey, P., Baele, G., Ayres, D. L., Drummond, A. J., and Rambaut, A. (2018). Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol. 4:vey016. doi: 10.1093/ve/vey016
Thompson, J. D., Higgins, D. G., and Gibson, T. J. (1994). CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22, 4673–4680. doi: 10.1093/nar/22.22.4673
Varma, M., and Malathi, V. G. (2003). Emerging geminivirus problem: A serious threat to crop production. Annals Appl. Biol. 142, 145–164. doi: 10.1111/j.1744-7348.2003.tb00240.x
Varun, P., Ranade, S. A., and Saxena, S. (2017). A molecular insight into papaya leaf curl – a severe viral disease. Protoplasma 254, 2055–2070. doi: 10.1007/s00709-017-1126-8
Varun, P., and Saxena, S. (2018). Association of tomato leaf curl Gujarat virus and tomato leaf curl Bangladesh betasatellite on papaya showing typical leaf curl symptoms in North India. 3 Biotech. 8:243. doi: 10.1007/s13205-018-1254-7
Xavier, C. A. D., Godinho, M. T., Mar, T. B., Ferro, C. G., Sande, O. F. L., Silva, J. C., et al. (2021). Evolutionary dynamics of bipartite begomoviruses revealed by complete genome analysis. Mol. Ecol. 30:171728. doi: 10.1111/mec.15997
Yadav, J., Yadav, S., and Mishra, S. (2017). Experimental evidences showing nutritional and medicinal property of Carica papaya plant. Int. J. Sci. Res. 6, 21–25. doi: 10.21275/ART20178569
Keywords: papaya leaf curl diseases, phylogenetic analysis, recombination, genetic variability, population structure
Citation: Srivastava A, Pandey V, Sahu AK, Yadav D, Al-Sadi AM, Shahid MS and Gaur RK (2022) Evolutionary Dynamics of Begomoviruses and Its Satellites Infecting Papaya in India. Front. Microbiol. 13:879413. doi: 10.3389/fmicb.2022.879413
Edited by:
Mohammad Arif, University of Hawaii at Manoa, United StatesReviewed by:
Donna Ria del Rosario Josue-Canacan, Mindanao State University – General Santos, PhilippinesDiwaker Tripathi, University of Washington, United States
Alejandro Olmedo-Velarde, University of Hawaii, United States
Karen Barandoc Alviar, University of the Philippines Los Baños, Philippines
Vishal Singh Negi, The University of Georgia, Tifton Campus, United States
Copyright © 2022 Srivastava, Pandey, Sahu, Yadav, Al-Sadi, Shahid and Gaur. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Muhammad Shafiq Shahid, bXNoYWhpZEBzcXUuZWR1LmNvbQ==; R. K. Gaur, Z2F1cnJhamFyc2hpQGhvdG1haWwuY29t
†These authors have contributed equally to this work