Skip to main content

ORIGINAL RESEARCH article

Front. Immunol., 28 November 2023
Sec. Systems Immunology
This article is part of the Research Topic Systems immunology to advance vaccine development View all 16 articles

Genomic annotation for vaccine target identification and immunoinformatics-guided multi-epitope-based vaccine design against Songling virus through screening its whole genome encoded proteins

  • 1Department of Biochemistry, Abdul Wali Khan University Mardan, Mardan, Pakistan
  • 2Department of Biochemistry, College of Science, King Saud University, Riyadh, Saudi Arabia
  • 3Department of Microbiology and Virology, Astana Medical University, Astana, Kazakhstan
  • 4Preventive Medicine, Astana Medical University, Astana, Kazakhstan

Songling virus (SGLV), a newly discovered tick-borne orthonairovirus, was recently identified in human spleen tissue. It exhibits cytopathic effects in human hepatoma cells and is associated with clinical symptoms including headache, fever, depression, fatigue, and dizziness, but no treatments or vaccines exist for this pathogenic virus. In the current study, immunoinformatics techniques were employed to identify potential vaccine targets within SGLV by comprehensively analyzing SGLV proteins. Four proteins were chosen based on specific thresholds to identify B-cell and T-cell epitopes, validated through IFN-γ epitopes. Six overlap MHC-I, MHC-II, and B cell epitopes were chosen to design a comprehensive vaccine candidate, ensuring 100% global coverage. These structures were paired with different adjuvants for broader protection against international strains. Vaccine constructions’ 3D models were high-quality and validated by structural analysis. After molecular docking, SGLV-V4 was selected for further research due to its lowest binding energy (-66.26 kcal/mol) and its suitable immunological and physiochemical properties. The vaccine gene is expressed significantly in E. coli bacteria through in silico cloning. Immunological research and MD simulations supported its molecular stability and robust immune response within the host cell. These findings can potentially be used in designing safer and more effective experimental SGLV-V4 vaccines.

GRAPHICAL ABSTRACT
www.frontiersin.org

Graphical Abstract

1 Introduction

The recent discovered Songling virus (SGLV) in China marked a significant moment in pathogenic viruses (1). Its genomic configuration exhibited profound structural homologies with reputable orthonairoviruses, spanning sequence similarities from 46.5% to 65.7%. Phylogenetic analyses situated SGLV distinctly within the Tamdy orthonairovirus group, encapsulating its place within the broader framework of the Nairoviridae family. Microscopic scrutiny validated SGLV’s morphological congruence with the hallmark attributes of orthonairoviruses (2). Notably, it’s essential to mention that SGLV is a single-stranded RNA (ssRNA) virus, contributing to its classification within this viral group.

Functionally, isolated SGLV strains sourced from patients exhibited the capacity to induce prominent cytopathic effects in human hepatoma cells, accentuating its potential for pathogenesis. Between 2017 and 2018, SGLV’s impact on human health materialized, materializing as symptoms encompassing headaches, fever, depression, fatigue, and dizziness. Serological investigations illuminated a pivotal facet: a significant 69% of patients exhibited the development of virus-specific antibody responses during the acute phase (3).

Remarkably, the absence of discernible SGLV viral RNA and the conspicuous scarcity of specific antibodies within healthy cohorts underscored its nuanced selectivity in its interaction with human physiology. Beyond human hosts, SGLV found ecological footing within ticks such as Ixodes crenulatus, Haemaphysalis longicornis, Haemaphysalis concinna, and Ixodes persulcatus within the northeastern precincts of China. Significantly, the viral L segments of SGLV came to the fore, manifesting in 2.2% of spleen samples from great gerbils. BLASTn alignments divulged a compelling narrative of genetic resonance, as the SGLV in great gerbils demonstrated a remarkable alignment of 93.7% (236/252 nucleotides) and 94.0% (78/83 amino acids) with its counterpart detected in human patients with a history of tick encounters within the confines of northeastern China (1). This multifaceted interplay of genetics, ecology, and clinical ramifications has woven a comprehensive tapestry, enriching our comprehension of SGLV’s influence on human health within the geographical landscape of northeastern China (1, 4).

Reverse vaccinology is a cutting-edge strategy that has been widely applied to the introduction of new vaccinations. The strategy aims to combine immunogenomics and immunogenetics with bioinformatics for the development of novel vaccine targets (5). With the recent advancements in genome or protein sequence databases, this rapid in silico method has gained significant appeal (6). An innovative vaccine fuses CTL and HTL segments with specialized linkers and adjuvants, showcasing high antigenicity, non-allergenicity, and stability. Molecular docking finds strong binding energy to TLRs, promising robust immunogenic responses. Immune simulation employed to simulates a natural immune response, where the top candidate activates essential immune components (IgG, IgM, T-cells, B-cells, and cytokines), promising protection against the Songling virus (7). Further investigations, including molecular dynamics and computational cloning for efficient E. coli expression, solidify the vaccine’s prospects. Vaccine may recognize and boost immunity against infection in the body. Therefore, predicting allergenicity is a crucial stage in the creation of a neuropeptide vaccine. Immunoinformatics techniques and tools were utilized to design a non-allergic, immunogenic, and thermostable recombinant vaccine against the Songling virus, and we expect wet lab researchers to confirm our prediction.

2 Methodology

The systematic methodology employed in this study to design a multi-epitope vaccine construct targeting the Songling virus (SGLV) (Figure 1).

FIGURE 1
www.frontiersin.org

Figure 1 Systematic flow chart of in-silico based multi-epitope vaccine design.

2.1 Protein sequence retrieval and filtration

Songling virus proteome data was downloaded from the National Centre for Biotechnology Information (NCBI) database under reference taxonomy ID: 2795181, and sequences were verified from Virus Pathogen Resource (ViPR) (8, 9). The protein sequence data for SGLV was in FASTA format and submitted to NCBI on May 6, 2023. Different parameters of CD-hit were used for obtaining 85% sequence similarity and removing redundancy to acquire non-paralogous sequences of proteins (10). BLASTp of NCBI was used for sequence homology to human proteins with thresholds of percent identity < 35, query coverage 75, e-value 10-4, bit score <100, and others used as defaults (11). For identification of the allergenicity of SGLV proteins, the AllerTop online server was used (12). The antigenicity was determined by setting the 0.4 parameter in VaxiJen online server (13). For the identification of the toxicity of SGLV proteins, the Toxinpred 3.0 online server was implemented (14).

2.2 Prediction of T-cell and B-cell

MHC-I epitopes were predicted using the IEDB MHC-I stickiness predictions program (15) accessed on June 23, 2022. The prediction algorithm used the SMM approach, and sequences were provided in FASTA format. It has been determined that humans will be the host species. The output format was set to XHTML tables, and all other options and parameters were left as default. Similarly, the IEDB MHC-II binding prediction tool (16) accessed on June 25, 2023, was used to predict the MHC-II epitopes by selecting the SMM prediction method. Data was provided in FASTA format. The HLA-DR was chosen as the species/locus couple, and then alleles were chosen using the typical length values associated with each species/locus (17). The other variables were kept at their default settings, and the final result format was set to XHTML table.

The B cell is an important element of the body’s defense system. It is responsible for secreting antibodies that provide long-term immunity (18). For the detection of a continuously growing 12-mer long B-cell lymphoid (BCL) for the selected amino acids with a threshold number of 8.0, ABCPred (http://crdd.osdd.net/raghava/abcpred/) tool was used for this analysis (18). linear B-Cell and MHCI & II overlapped epitopes are selected based on physiochemical properties by employing VaxiJen v2.0, AllerTOP v.2.0, & ToxinPred 3.0 tools (12, 13, 19).

2.3 Population coverage

The IEDB population coverage assessment tool (https://tools.iedb.org/population/) was utilized to check the designed vaccine had successfully covered the entire world population (20). Populations research in China, eastern the People’s Republic, southern the Caribbean, the Southeast Asian region, and the ocean was conducted to understand the global nature of the Songling virus pandemic. Default values were used to evaluate coverage for MHC class I and class II HLA binding alleles (15). This strategy takes advantage of the worldwide distribution of HLA-binding genotypes to calculate the abundance of certain epitopes.

2.4 Multi-epitope vaccine construct design

Effective vaccine construct design and proper epitope separation depend on all candidate epitopes being connected together through linkers and adjuvants. The B-cell epitope was linked to the CTL targets using the EAAAK, AAY linkers, and the HTL epitopes were linked to the CTL targets using the GPGPG linker (Figure 2). To facilitate future glycosylation with a carrier protein, a cysteine residue was included in the N-terminal of the multi-epitope vaccine construct (21). The antigenicity, allergenicity, toxicities, and physicochemical features of the vaccine construct were analyzed further using the ProtParam tool (https://web.expasy.org/protparam/) (19).

FIGURE 2
www.frontiersin.org

Figure 2 Graphical illustration of multi-epitope vaccine construct, showing CTL, and T-cell epitopes joined by appropriate links and Adjuvant.

2.5 2D and 3D structure modeling and validation

PDBsum (http://www.ebi.ac.uk/thornton-srv/databases/pdbsum/) was used to determine the secondary structure of the new vaccine construct, which was then validated through the PSIPRED server (http://bioinf.cs.ucl.ac.uk/psipred/) (2224). SWISS Model (https://swissmodel.expasy.org/) was used to model proteins, and then ProSAweb Server (https://prosa.services.came.sbg.ac.at/prosa.php) was used to analyze protein structure and validate the model (25, 26). Godard plot analysis utilizing the ERRAT server (https://servicesn.mbi.ucla.edu/ERRAT/) and RAMPAGE server (http://mordred.bioc.cam.ac.uk/rapper/rampage.php) (27).

2.6 Molecular docking with TLRs receptors

Employing TLRs alongside their precisely designed synthetic ligands in vaccines can initiate a potent chain of cytokines, crucial for robust immune reactions (28). Understanding the pattern of interactions among design vaccines with TLR3, TLR4, and TLR8 immune cell receptors is crucial for efficiently inducing immunological responses. The vaccine constructs were docked into the human TLR3 receptor (PDB ID: 2a0z) (29), TLR4 receptor (PDB ID: 4G8A) and TLR8 receptor [PDB ID: 3w3m (2931)], using web servers Hdock (http://hdock.phys.hust.edu.cn/) and HawkDock (http://cadd.zju.edu.cn/hawkdock/), in order to evaluate the chemical reactions between immune receptors (TLR3, TLR4, and TLR8) and vaccine constructs (V1, V2, V3, and V4) (32). Another webserver HADDOCK (High ambiguity driven protein–protein Docking server) was utilized to generates informative visual representations of the docking outcome, facilitating a comprehensive analysis of the results. These graphical plots allow for easy comparison of the top-docked structure with the complete set of generated structures, providing insights into key parameters such as docking score and RMSD (root mean square deviation) (33).

2.7 MD simulation

The best docking results for the SGLV-V4-TLR4 molecule were used to study a chemical dynamics (MD) research simulation. MD simulations, energy efficiency, and protein flexibility were all calculated using the iMODS web server (https://imods.iqfr.csic.es/) (34). iMODS is based on normal mode analysis (NMA) in the internal (dihedral) coordinates of macromolecules that naturally reproduce the collective functional movements of biological macromolecules. Using these modes, iMODS builds pathways for functional transitions between two proteins with homologous structures. The server can simulate potential with several coarse-grained atomic representations and provides an enhanced arrow model based on an affine model to describe the complicated domain dynamics of macromolecules. The service analyses the dynamic molecular structure and the docked protein structure with other ligands as an amino acid of interest in order to deliver elastic network-related data according to NMA, which is equal for the particular instance of deform Eigenvalues, which changes the B-factor (mobility profiles), along with a variation map. The SGLV-V4-TLR4 complex docked PDB file was uploaded to the iMODS service, and results were obtained with all parameters set to their default values (34).

2.8 Immune simulation

The C-ImmSim webserver (http://www.cbs.dtu.dk/services/C-ImmSim-10.1/) was used to model computational immunological simulation of our prioritized vaccine design. This platform employs a potent combination of predictive modeling techniques, including Position-Specific Scoring Matrices (PSSM) and a variety of cutting-edge machine-learning algorithms, to assess and predict the cellular and humoral responses elicited by our antigenic vaccine candidate (35). The application leverages antigenic peptide sequences and lymphocyte receptors to replicate the intricate dynamics of immunogenic responses. Throughout our investigation, we precisely observed to a standard clinical protocol, administering two doses of the vaccine with a four-week interval to assess immune responses (36). Our focus lay on six specific human leukocyte antigens: HLA-A0101, HLA-A0201, HLA-B0702, HLA-B3901, HLA-DRB10101, and HLA-DRB10401, each monitored at time intervals of 1, 84, and 168 hours. Immune simulation was executed using the application’s default settings, encompassing 1000 iterative steps (37).

2.9 Codon optimization

The JAVA Codon optimization Tool (Jcat) (http://www.jcat.de) was used to optimize the coding and execute a reverse translation of the sequence of amino acid sequences for the suggested immunization (38). After that, an E. coli production gene vector called pET28 was used alongside Snapgene version 5.2 to digitally clone the genetic code sequence (39).

2.10 Prediction of the vaccine mRNA secondary structure

The Transcription and Translation web based Tool (http://biomodel.uah.es/en/lab/cybertory/analysis/trans.htm) was employed to acquire the mRNA sequence of the vaccine. To predict the secondary structure of the vaccine mRNA, two web-based servers, Mfold v2.3 (http://www.unafold.org/mfold/applications/rna-folding-form-v2.php) (40) and RNAfold (http://rna.tbi.univie.ac.at/cgi-bin/RNAWebSuite/RNAfold.cgi) (41), were utilized. The primary outcome of interest centered on the minimum free energy (expressed in units of Kcal/mol), with lower values indicating a greater degree of stability within the mRNA’s folding structure.

3 Results

3.1 Proteins prediction of SGLV vaccine candidate

The complete proteome of the SGLV strain, containing 40 proteins from different strains across the world, was extracted from NCBI in FASTA format (Supplementary File S1). Utilizing CD-hit, redundancy was minimized, and human blast yielded four distinct proteins (Supplementary File S2). Subsequently, we screened these proteins for allergenicity, antigenicity, and toxicity, identifying optimal candidates with high antigenicity, non-allergen, and non-toxic properties for epitope prediction (Supplementary Table 1).

3.2 Prediction of T-cell and B-cell and population coverage analysis

For further analysis, Four proteins are selected to recognize the lead epitopes for producing a chimeric vaccine construct against SGLV. T-cell (major histocompatibility complex class I and class II) epitopes were determined for the selected proteins using the IEDB server, with an IC50 threshold of 50 nM. The ABCpred scores reached greater than 0.8, and the specificities of the estimated ubiquitous B-cell epitopes were 75%. Vaccines were developed based on predictions of twelve (13) overlapping lead regions for each prioritized protein. the top 12 epitopes based on their antigenicity, IFN positivity, toxicity, and allergenicity (Table 1). The main goal was to recognize lead epitopes with the potential to induce humoral and cell-mediated immunogenic responses and host interferons.

TABLE 1
www.frontiersin.org

Table 1 IFN-g epitope prediction, investigation of allergenicity and toxicity, and prediction of overlapping T- and B-cell epitopes.

The epitopes chosen exhibited 100% coverage across the global population (Supplementary Table 2). Analysis from the IEDB database indicated a notably high population coverage for these predicted epitopes, particularly in regions significantly impacted by SGLV, such as east Asia, Europe and south east Asia (Figure 3).

FIGURE 3
www.frontiersin.org

Figure 3 The population coverage was determined using the IEDB webserver, (A) population coverage across the world’s countries and (B) population coverage across different ethnicities.

3.3 Multi-epitope vaccine design

To generate a multi-epitope vaccine EAAAK, CTL, HTL, and GGGS/HEYGAEALERAG linkers and adjuvants were used. When administered intramuscularly, vaccine containing linkers offer superior protection against each epitope (42). Increased immunogenic responses were achieved by coupling the epitopes to various adjuvants, such as HBHA protein molecules, beta-defensin, 50S ribosome enzyme L7/L12 adjuvants, and N-terminally abbreviated HBHA comparable amino acid sequences. Immunization strategies have used the PADRE peptide sequence to protect against difficulties caused by local variations in HLA-DR. Previous studies have shown that vaccine formulations including PADRE provide enhanced immune protection and high cytotoxic T lymphocyte (CTL) responses, as shown in Supplementary Table 3.

3.4 Immunological and physiochemical properties

Based on their immunological features, none of the immunization strategies were found to be harmful or allergic. Each one of the multi-epitope vaccine formulations has a substantial antigenic property, as demonstrated by antigenicity scores > 0.8 as estimated by the VaxiJen 2.0 server. Using cross-validation on the peptide sequence based on established datasets, VaxiJen 2.0 determines the antigenicity of viral sequences and identifies their protective properties. Each structure VaxiJen 2.0 score fell between 0.4333 and 0.5197, which is the same as the default threshold for viruses (13). Using the ProtParam service, we were able to determine the physiochemical parameters of the vaccine compositions and found that the molecular weights of each epitope in these innovations ranged from 30 kDa to 55 kDa. The selected vaccine designs have GRAVY values around -0.128 and -0.310, indicating that they are hydrophilic. The numerical pI values fell between 8.87 and 10.07. The thermostability of these structures was demonstrated by aliphatic index values between 69.09 and 82.50. The stability of these constructs at different temperatures was projected to be shown by their unpredictability index values, which ranged between 30.93 and 41.81 (Table 2). The only real difference among the designs was the adjuvant; therefore, none of them changed much in terms of their physicochemical qualities. The vaccine designs’ ability to elicit robust immunogenic reactions in the human host was deduced from an examination of their immunogenic and physiochemical properties (43). More experimental work is needed to confirm the reliability of these results.

TABLE 2
www.frontiersin.org

Table 2 Physiochemical properties of the vaccine constructs using ProtParam server and JCAT server.

3.5 Structures modeling, validation, and refinement

Computational approaches that anticipated secondary structure components were used to analyze the structural characteristics of the vaccine constructs. The PDBsum server displays proteins in their residue conservation and 2D structure. It shows which parts of the protein are not the same (colored blue) and which parts are very similar (colored red). Figure 4A. Similarly, the PSIPRED 4.0 server, which uses position-specific scoring matrices (PSSM), was utilized to predict transmembrane helices and topology within the peptide sequence, as well as identify fold and domain regions, as shown in Supplementary Figure 4. A stable and functional 3D structure of a vaccine is crucial for studying its molecular interactions with immune receptor proteins. Figure 4B displays the vaccine constructions predicted by the homology modeling techniques implemented in the Swiss Modelling server; these constructs were further refined by the DeepRefiner web server and submitted to a physical validation study. The binding energy of the JSmol structure is shown in Figure 4C. 73.7% of the V1 construct, 82.8% of the V2, 91.5% of the V3, and 97.4% of the V4 acids remained in the plots’ favorable region (Figure 4D), indicating that the vaccine constructions were highly stable. The improved vaccine designs had ERRAT quality ratings between 58% and 97%. The ProSA-web server found that the Z score of all vaccine constructions might range from -0.88 to -4.71 (Figure 4E). Table 3 displays the 3D structural validation of vaccine constructs.

FIGURE 4
www.frontiersin.org

Figure 4 SGLV-V4 three-dimensional structural analysis, refinement, and validation (A) Protein’s Secondary Structure with Graceful Elements: strands (elegant pink arrows), helices (royal purple springs), and captivating motifs in shades of red (-hairpins, mesmerizing -turns, and more), (B) The Swiss Model designed a 3D model of the multi-epitope vaccination using a homology modeling method. (C) Binding energy of the JSmol structure (D) ProSA-web yields a Z-score of -4.52. (E) Ramachandran plot analysis reveals 90% of the residues, 20% are in the allowed region, and 1% are in the prohibited portion of the plot.

TABLE 3
www.frontiersin.org

Table 3 3D structural validation of vaccine constructs via ERRAT, PROCHECK (Ramachandran plot favored region), and ProSA-Web Server.

3.6 Molecular docking

Molecular docking is used for predicting the suitable binding between multi-epitope vaccine (MEV) and receptor molecules. Human surface TLR3, TLR4, and TLR8 immune system receptors were used to dock MEV with the help of Hdock Server (a blind docking technique) and Hawkdock Server. In the result, only one structure is prioritized for each dock based on its high score and lowest binding energy (Table 4). And the 3D structure of each docking is shown in Supplementary Figures 1–3. In this study, the binding energy of the V4 was found to be lower with TLR4 area by a significant margin (-66.26 kcal/mol) as compared to docking with TLR3 and TLR8 receptors. The prioritized complex TRL4-V4 was then assigned to HADDOCK to evaluate various parameters in our analysis, including HADDOCK scores, cluster size, van der Waals energy, electrostatic energy, desolvation energy, restraints violation energy, buried surface area, and Z score (Figure 5). Notably, Cluster 6 exhibited exceptional characteristics with a Z score of -2.0, HADDOCK scores of -9.5, a cluster size of 7, van der Waals energy of -26.2, electrostatic energy of -309.1, desolvation energy of -2.5, restraints violation energy of 2998.6, and a substantial buried surface area of 1708.2. Consequently, we selected the most promising structure from Cluster 6 for molecular dynamics simulation. The docking investigation reveals that the vaccine designs have strong binding capabilities to the TLR4 protein.

TABLE 4
www.frontiersin.org

Table 4 Docking scores and Binding energies of multi-epitope vaccine constructs and TLRs.

FIGURE 5
www.frontiersin.org

Figure 5 (A) The docked complex of TRL4-V4. The TLR4 receptor is depicted in brown, while yellow represents the SGLV-V4 vaccine construct. (B) Haddock score against a fraction of frequent contacts. (C) Haddock score against ligand RMSD. (D) Electrostatic Solvation Energy (EDESOLV) against Initial-RMSD in Molecular Simulations (I-RMSD), (E) van der Waals energy against interface I-RMSD, (F) Electrostatic energy (Eelec) of docked molecule against interface-RMSD, (G) (Ensemble-Averaged Interaction-Reweighted Simulation) EAIR outperforms I-RMSD in predicting the structure of receptor-ligand complexes.

3.7 Molecular dynamic simulation

The TLR4 receptor was chosen due to its lowest binding energy with the SGLV-V4 construct. To comprehensively evaluate the stability of proteins and the enthalpy efficiency within SGLV-V4-TLR4 complexes, molecular dynamics (MD) simulations were employed. In parallel, the iMODS platform facilitated an in-depth analysis of atomic and molecular movements within the vaccine’s biological context, elucidating macromolecular mobility via the normal mode analysis (NMA) methodology. For a more detailed understanding, Figure 6 provides a visual representation of the outcomes stemming from MD simulations and NMA conducted on the SGLV-V4 and TLR4 docked complexes. Drawing from the work of Ichiye and Karplus in 1991 (44), we utilized Equation 2 in conjunction with C Cartesian coordinates to compute the correlation matrix, thereby revealing the intricate interplay of atoms through an elastic network model. Each point on the graph symbolizes a spring connecting specific atom pairs, with varying shades of grey denoting differing levels of stiffness (as seen in Figure 6A). The complexity of molecular interactions within the system is further elucidated by the covariance map of the complex. By utilizing covariance analysis, this map highlights correlated (red), uncorrelated (white), or anti-correlated (blue) atomic movements, thus providing valuable insights into the dynamics of the complex molecule (Figure 6B). Additionally, eigenvalues, reflecting the stiffness of motion, hold a direct proportionality to the energy required for structural deformations. A lower eigenvalue indicates greater ease of deformation for the carbon alpha atoms. Notably, the SGLV-V4-TLR4 complex exhibited an eigenvalue of 2.395982e-05, signifying its stability (as observed in Figure 6C).

FIGURE 6
www.frontiersin.org

Figure 6 MD simulation results of SGLV-V4 with TLR4 (A) The elastic network model uses springs between atoms, indicated by colored dots for stiffness. (B) Covariance matrix Shoes paired residue mobility motions, i.e., uncorrelated (white), correlated (red), and anti-correlated (blue). (C) Eigenvalues, (D) Averaged RMS indicated by B-factor, (E) Deformability, and (F) Shows variances in Colored bars (purple) represent individuals, and cumulative is represented by green.

Furthermore, NMA-derived B-factor analysis was instrumental in portraying the relative amplitude of atomic displacements within the molecular complex. Figure 6D, displaying the B-factor graph, illustrates the correlation between the mobility identified in the docked complex NMA and the PDB scores. In this context, RMSD minimization based on local and global structure superposition enabled iterative deformation of the input structure, modeling potential transitions. Meanwhile, the total atomic displacements across all modes of residues at individual atomic sites provide an insightful measure of main-chain deformability. The complex’s deformability graph, illustrated in Figure 6E, identifies peak regions representing the protein’s more flexible areas, while inflexible sections exhibit lower values. Additionally, the variance graph, inversely linked to the eigenvalue (as demonstrated in Figure 6F), is connected to each normal mode of the complex, elucidating both individual and cumulative variances for a comprehensive depiction of the system’s dynamics.

3.8 Immune response simulation

The focused MEV significantly boosted secondary responses, as predicted by immune modeling. In principle, this sequence can help the immune system quickly respond to threats. High levels of IgM were the prime simulated response. The simulated secondary and tertiary responses revealed considerable increases in B-cell populations as well as high concentrations of IgG1 + IgG2, IgM, and IgM + IgG antibodies. However, there was a decrease in the antigen levels (Figures 7A, B). The increased level of memory B-cell population and isotype switching indicate the formation of immunological memory in this case. Following the subsequent exposure to chimeric antigens, this caused a fast antigen reduction (Figure 7C). After further antigen exposure, it was hypothesized that both cytotoxic (TC) and helper (TH) cell subsets would form a similar memory (Figures 7D, E). High levels of activity in the macrophage, flare cell, and natural killer cell populations were also sustained during the vaccination period (Figures 7F–H). Higher levels of cytokines like interleukins IL-2 and IFN-y were also present (Figure 7I). These results provide support for the research showing that the anticipated vaccine formulation induced successful immune reactions against SGLV.

FIGURE 7
www.frontiersin.org

Figure 7 The in silico inflamed simulation used by the C-ImmSim Servers allows for an estimation of the SGLV-V4 recombinant peptide vaccination’s immunological potential. (A) Vaccines cause an increase in immunoglobin antibodies and a decrease in antigen levels. As seen in (B), B-cell numbers increase and antigen titers fall after immunization. Increased B-cell counts as a result of repeated antigen exposure (C). T-cytotoxic and T-helper cell counts rise (D, E) after repeated antigen exposure. Dendritic cells, macrophages, and natural killer cells all grew in number during the vaccination window (F–H). Increased antigen exposure leads to increased cytokine and interleukin (I) production. This danger signal is depicted alongside leukocytes and the rate of expansion factor IL-2 in the inset graphic.

3.9 Molecular cloning and codon optimization

Designing a vaccine with an appropriate expression system is the initial stage in evaluating a vaccination candidate, which requires a serological study. Prior to in vitro expression, a similar strategy was employed in earlier experiments for in silico-designed vaccines. The bacterial cell expresser E. coli was selected. Cloning and transcription are greatly facilitated by the Java Codon Adaptation Test (JCAT), which makes the E. coli K12 strain a great host organism. The estimated GC content of the improved sequence was 52.7%, which is significantly higher than the value of 50.73 found in E. coli. The modified sequence had a CAI (codon adaptation index) of 1.0. The multi-epitope vaccine (MEV) vector’s codon usage curve is shown in Figure 8. Finally, SnapGene software was used to create a recombinant plasmid sequence by inserting the final vaccine construct V4 modified codon sequence into the plasmid vector pET28a (+), ensuring heterologous cloning and expression in the E. coli system (Figure 8).

FIGURE 8
www.frontiersin.org

Figure 8 (A) Reverse-translated primary DNA sequence of the SGLV-V4, (B) SGLV-V4 cloned specifically into the E. coli expression vector [pET28a(+)].

3.10 Secondary structure of vaccine mRNA

The RNAfold server predicts the vaccine mRNA’s secondary structure with a minimum free energy of -268.90 kcal/mol (Figure 9A), while the centroid secondary structure shows -229.37 kcal/mol (Figure 9B). mFold v2.3 server calculates the optimal secondary structure’s minimum free energy at -283.12 kcal/mol (Figure 9C). A lower minimal free energy suggests greater stability for the vaccine mRNA post-expression in vivo.

FIGURE 9
www.frontiersin.org

Figure 9 (A) Optimal secondary structure of the vaccine mRNA (B) Central secondary configuration of the vaccine mRNA (C) thermodynamic ensemble of mRNA structure, and the centroid structure are vividly depicted in the mountain plot representation. Additionally, the positional entropy plot unveils the intricacies of each position.

4 Discussion

In response to the growing concern over the flow of Songling virus (SGLV) cases worldwide and the absence of available vaccines, this study investigated the challenge of preventing future Songling virus epidemics. Employing cutting-edge immunoinformatics techniques, we embarked on designing innovative multi-epitope Songling virus vaccine constructs by examining the proteome of the Songling virus to pinpoint targets for a potential vaccine. Using strict standards, they forecasted epitopes for B-cells, MHC-I, and MHC-II. These epitopes play a crucial role by sparking a protective immune response that blocks viruses and establishes long-term defense (45).

The overlapped epitopes are prioritized from MHCI & II, and B-cell which are highly antigenic, non-allergen, and produce humoral response, that combats infections by eliminating infected cells or releasing antiviral substances to establish lasting immunity (46). To enhance this response, a novel vaccine was created using various CTL and HTL segments combined with specialized suitable linkers and adjuvants. Additionally, the vaccine’s design incorporates EAAAK, AAY, and GPGPG linkers and adjuvants, which improve the structure and stability of the vaccine. Four vaccine constructs were designed from selected epitopes. These vaccine models displayed impressive traits: high antigenicity, non-allergenicity, and non-toxicity. Analysis of the vaccine’s physiochemical characteristics indicated its robustness, alkaline nature, and hydrophobic properties, all of which indicate its potential to induce potent and targeted immunogenic responses in infected individuals.

Molecular docking analysis was then employed to explore the interaction between the vaccine constructs and the crucial immune cell receptors, i.e., TLRs. TLRs are known for their pivotal role in immune cell activation and the recognition of viral peptide structures (46). The results revealed strong binding affinities of SGLV-V4 toward TLR4, suggesting that the designed vaccine constructs have the capacity to generate robust immunogenic responses upon exposure. The C-ImmSim server, an immune response evaluation tool, was used to assess a newly designed vaccine’s ability to induce an immunological response. This method simulates key components of the mammalian immune system and tracks how various immune cells respond to the vaccine (7). The goal is to design a vaccine that not only offers immediate protection but also triggers a long-lasting immune response, simulating natural immunity. The top-ranked vaccine candidate, SGLV-V4, activated essential immune components, including antibodies (IgG and IgM), T-cells, B-cells, and cytokines (Figure 7) (47). This multi-epitope-based subunit vaccine shows suitable in protecting against the Songling virus. For further investigation to assess the stability and biomolecular process of SGLV-V4, the molecular dynamic simulation and NMA were performed. To enhance vaccine expression, computational cloning was performed on a pET28a (+) vector after codon optimization with the JCAT web service. The optimized codon adaptation index (CAI) and GC content fall within acceptable ranges, ensuring efficient expression in E. coli (strain K12). This optimization is crucial for successful vaccine production.

As we progress toward the next critical steps, we anticipate in vitro immunological assays to be conducted to confirm and validate the immunogenicity of the designed vaccine. Subsequently, a challenge-protection preclinical trial will be initiated, presenting a crucial opportunity to rigorously evaluate and substantiate the efficacy and safety of the SGLV-V4-TLR4 vaccine construct. These endeavors aim to provide a comprehensive framework to combat Songling virus infections effectively, potentially mitigating their impact and safeguarding public health against this evolving threat. However, to determine the vaccine’s safety and efficacy, further experimental validation is required, which may involve the production of vaccine proteins with thorough in vivo and in vitro tests. However, the current research relies entirely on the results of computational approaches for technical equipment.

5 Conclusion

In our study on Songling virus (SGLV), a tick-borne pathogen lacking treatment or vaccines, we employed immunoinformatics to identify four potential vaccine target proteins. Designing a comprehensive vaccine candidate with broad global coverage, we combined B-cell and T-cell epitopes and validated them through IFN-γ epitopes. SGLV-V4, selected for its strong performance in molecular docking and favorable properties, was efficiently expressed in E. coli bacteria. Immunological research and simulations confirmed its stability and robust immune response, offering a promising avenue for safer and more effective SGLV-V4 vaccine development.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Author contributions

SA: Writing – original draft, Writing – review & editing, Conceptualization, Data curation, Formal Analysis, Investigation, Methodology. AAli: Writing – original draft, Writing – review & editing, Investigation, Software, Visualization. AAla: Funding acquisition, Validation, Writing – review & editing. AB: Validation, Writing – review & editing. MD: Writing – review & editing. AAb: Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research received funding form King Saud University project number: RSPD2023R552.

Acknowledgments

Authors are thankful for Researchers Supporting Project number (RSPD2023R552), King Saud University, Riyadh, Saudi Arabia.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2023.1284366/full#supplementary-material

References

1. Ma J, Lv X-L, Zhang X, Han S-Z, Wang Z-D, Li L, et al. Identification of a new orthonairovirus associated with human febrile illness in China. Nat Med (2021) 27:434–9. doi: 10.1038/s41591-020-01228-y

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Cotmore SF, Agbandje-McKenna M, Canuti M, Chiorini JA, Eis-Hubinger A-M, Hughes J, et al. ICTV virus taxonomy profile: Parvoviridae. J Gen Virol (2019) 100:367–8. doi: 10.1099/jgv.0.001212

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Ji N, Wang N, Liu G, Zhao S, Liu Z, Tan W, et al. Tacheng tick virus 1 and songling virus infection in great gerbils (Rhombomys opimus) in Northwestern China. J Wildl Dis (2023) 59:138–42. doi: 10.7589/JWD-D-21-00137

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Cai X, Cai X, Xu Y, Shao Y, Fu L, Men X, et al. Virome analysis of ticks and tick-borne viruses in Heilongjiang and Jilin Provinces, China. Virus Res (2023) 323:199006. doi: 10.1016/j.virusres.2022.199006

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Poland GA, Ovsyannikova IG, Jacobson RM. Application of pharmacogenomics to vaccines. Pharmacogenomics (2009) 10(5). doi: 10.2217/pgs.09.25

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Flower DR. Bioinformatics for vaccinology. Wiley: John Wiley & Sons (2008).

Google Scholar

7. Suleman M, Rashid F, Ali S, Sher H, Luo S, Xie L, et al. Immunoinformatic-based design of immune-boosting multiepitope subunit vaccines against monkeypox virus and validation through molecular dynamics and immune simulation. Front Immunol (2022) 13:1042997. doi: 10.3389/fimmu.2022.1042997

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Pruitt KD, Tatusova T, Maglott DR. NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res (2005) 33:D501–4. doi: 10.1093/nar/gki025

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Wheeler DL, Barrett T, Benson DA, Bryant SH, Canese K, Chetvernin V, et al. Database resources of the national center for biotechnology information. Nucleic Acids Res (2007) 35:D5–D12. doi: 10.1093/nar/gkl1031

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Fu L, Niu B, Zhu Z, Wu S, Li W. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics (2012) 28:3150–2. doi: 10.1093/bioinformatics/bts565

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Mahram A, Herbordt MC. NCBI BLASTP on high-performance reconfigurable computing systems. ACM Trans Reconfigurable Technol Syst (2015) 7:1–20. doi: 10.1145/2629691

CrossRef Full Text | Google Scholar

12. Dimitrov I, Flower DR, Doytchinova I. AllerTOP-a server for in silico prediction of allergens. BMC Bioinf (2013) 14: 1–9. doi: 10.1186/1471-2105-14-S6-S4

CrossRef Full Text | Google Scholar

13. Doytchinova IA, Flower DR. VaxiJen: a server for prediction of protective antigens, tumour antigens and subunit vaccines. BMC Bioinf (2007) 8:1–7. doi: 10.1186/1471-2105-8-4

CrossRef Full Text | Google Scholar

14. Rathore AS, Arora A, Choudhury SPS, Tijare P, Raghava GPS. ToxinPred 3.0: An improved method for predicting the toxicity of peptides. bioRxiv (2023) p. 2008–23.

Google Scholar

15. Dhanda SK, Mahajan S, Paul S, Yan Z, Kim H, Jespersen MC, et al. IEDB-AR: immune epitope database—analysis resource in 2019. Nucleic Acids Res (2019) 47:W502–6. doi: 10.1093/nar/gkz452

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Zhang Q, Wang P, Kim Y, Haste-Andersen P, Beaver J, Bourne PE, et al. Immune epitope database analysis resource (IEDB-AR). Nucleic Acids Res (2008) 36:W513–8. doi: 10.1093/nar/gkn254

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Roewer L, Nagy M, Schmidt P, Epplen JT, Herzog-Schröder G. Microsatellite and HLA class II oligonucleotide typing in a population of Yanomami Indians. DNA fingerprinting State Sci (1993), 221–30. doi: 10.1007/978-3-0348-8583-6_18

CrossRef Full Text | Google Scholar

18. Malik AA, Ojha SC, SChaduangrat N, Nantasenamat C. ABCpred: a webserver for the discovery of acetyl-and butyryl-cholinesterase inhibitors. Mol Divers (2022) 18:1–21. doi: 10.1007/s11030-021-10292-6

CrossRef Full Text | Google Scholar

19. ExPASy BRP. ProtParam tool. SIB bioinforma resour portal . Available at: http//webexpasyorg/protparam/ (Accessed Oct 2016).

Google Scholar

20. Bui H-H, Sidney J, Dinh K, Southwood S, Newman MJ, Sette A. Predicting population coverage of T-cell epitope-based diagnostics and vaccines. BMC Bioinf (2006) 7:1–5. doi: 10.1186/1471-2105-7-153

CrossRef Full Text | Google Scholar

21. Bandyopadhyay A, Cambray S, Gao J. Fast and selective labeling of N-terminal cysteines at neutral pH via thiazolidino boronate formation. Chem Sci (2016) 7:4589–93. doi: 10.1039/C6SC00172F

PubMed Abstract | CrossRef Full Text | Google Scholar

22. McGuffin LJ, Bryson K, Jones DT. The PSIPRED protein structure prediction server. Bioinformatics (2000) 16:404–5. doi: 10.1093/bioinformatics/16.4.404

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Jones DT. Protein secondary structure prediction based on position-specific scoring matrices. J Mol Biol (1999) 292:195–202. doi: 10.1006/jmbi.1999.3091

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Laskowski RA. PDBsum: summaries and analyses of PDB structures. Nucleic Acids Res (2001) 29:221–2. doi: 10.1093/nar/29.1.221

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Biasini M, Bienert S, Waterhouse A, Arnold K, Studer G, Schmidt T, et al. SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information. Nucleic Acids Res (2014) 42:W252–8. doi: 10.1093/nar/gku340

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Wiederstein M, Sippl MJ. ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins. Nucleic Acids Res (2007) 35:W407–10. doi: 10.1093/nar/gkm290

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Lovell SC, Davis IW, Arendall WB III, de Bakker PI, Word JM, Prisant MG, et al. Structure validation by Calpha geometry: phi, psi and Cbeta deviation, Proteins50, 437– 450. Google Sch There is no Corresp Rec this Ref (2003) 50(3). doi: 10.1002/prot.10286

CrossRef Full Text | Google Scholar

28. Koganty RR. Vaccines: exploiting the role of toll-like receptors. Expert Rev Vaccines (2002) 1:123–4. doi: 10.1586/14760584.1.2.123

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Matsumoto M, Seya T. TLR3: interferon induction by double-stranded RNA including poly (I: C). Adv Drug Delivery Rev (2008) 60:805–12. doi: 10.1016/j.addr.2007.11.005

CrossRef Full Text | Google Scholar

30. Ciesielska A, Matyjek M, Kwiatkowska K. TLR4 and CD14 trafficking and its influence on LPS-induced pro-inflammatory signaling. Cell Mol Life Sci (2021) 78:1233–61. doi: 10.1007/s00018-020-03656-y

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Cervantes JL, Weinerman B, Basole C, Salazar JC. TLR8: the forgotten relative revindicated. Cell Mol Immunol (2012) 9:434–8. doi: 10.1038/cmi.2012.38

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Yan Y, Zhang D, Zhou P, Li B, Huang S-Y. HDOCK: a web server for protein–protein and protein–DNA/RNA docking based on a hybrid strategy. Nucleic Acids Res (2017) 45:W365–73. doi: 10.1093/nar/gkx407

PubMed Abstract | CrossRef Full Text | Google Scholar

33. De Vries SJ, Van Dijk M, Bonvin AMJJ. The HADDOCK web server for data-driven biomolecular docking. Nat Protoc (2010) 5:883–97. doi: 10.1038/nprot.2010.32

PubMed Abstract | CrossRef Full Text | Google Scholar

34. López-Blanco JR, Aliaga JI, Quintana-Ortí ES, Chacón P. iMODS: internal coordinates normal mode analysis server. Nucleic Acids Res (2014) 42:W271–6. doi: 10.1093/nar/gku339

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Nain Z, Abdulla F, Rahman MM, Karim MM, Khan MSA, Bin SS, et al. Proteome-wide screening for designing a multi-epitope vaccine against emerging pathogen Elizabethkingia anophelis using immunoinformatic approaches. J Biomol Struct Dyn (2020) 38:4850–67. doi: 10.1080/07391102.2019.1692072

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Kroger A, Bahta L, Long S, Sanchez P. General best practice guidelines for immunization: best practices guidance of the Advisory Committee on Immunization Practices (ACIP). (2023).

Google Scholar

37. Aiman S, Ali F, Zia A, Aslam M, Han Z, Shams S, et al. Core genome mediated potential vaccine targets prioritization against Clostridium difficile via reverse vaccinology-an immuno-informatics approach. J Biol Res (2022) 29. doi: 10.26262/jbrt.v29i0.8481

CrossRef Full Text | Google Scholar

38. Grote A, Hiller K, Scheer M, Münch R, Nörtemann B, Hempel DC, et al. JCat: a novel tool to adapt codon usage of a target gene to its potential expression host. Nucleic Acids Res (2005) 33:W526–31. doi: 10.1093/nar/gki376

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Biotech G. Snapgene viewer. Glick B, editor (2020), p. 3.

Google Scholar

40. Zuker M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res (2003) 31:3406–15. doi: 10.1093/nar/gkg595

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Gruber AR, Lorenz R, Bernhart SH, Neuböck R, Hofacker IL. The vienna RNA websuite. Nucleic Acids Res (2008) 36:W70–4. doi: 10.1093/nar/gkn188

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Rahman N, Ali F, Basharat Z, Shehroz M, Khan MK, Jeandet P, et al. Vaccine design from the ensemble of surface glycoprotein epitopes of SARS-CoV-2: an immunoinformatics approach. Vaccines (2020) 8:423. doi: 10.3390/vaccines8030423

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Mehmood A, Kaushik AC, Wei D. Prediction and validation of potent peptides against herpes simplex virus type 1 via immunoinformatic and systems biology approach. Chem Biol Drug Des (2019) 94:1868–83. doi: 10.1111/cbdd.13602

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Ichiye T, Karplus M. Collective motions in proteins: a covariance analysis of atomic fluctuations in molecular dynamics and normal mode simulations. Proteins Struct Funct Bioinforma (1991) 11:205–17. doi: 10.1002/prot.340110305

CrossRef Full Text | Google Scholar

45. Bacchetta R, Gregori S, Roncarolo M-G. CD4+ regulatory T cells: mechanisms of induction and effector function. Autoimmun Rev (2005) 4:491–6. doi: 10.1016/j.autrev.2005.04.005

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Vaure C, Liu Y. A comparative review of toll-like receptor 4 expression and functionality in different animal species. Front Immunol (2014) 5:316. doi: 10.3389/fimmu.2014.00316

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Chen R. Bacterial expression systems for recombinant protein production: E. coli beyond. Biotechnol Adv (2012) 30:1102–7. doi: 10.1016/j.biotechadv.2011.09.013

CrossRef Full Text | Google Scholar

Keywords: immunoinformatics, Songling virus, reverse vaccinology, molecular docking, MD simulation

Citation: Ali SL, Ali A, Alamri A, Baiduissenova A, Dusmagambetov M and Abduldayeva A (2023) Genomic annotation for vaccine target identification and immunoinformatics-guided multi-epitope-based vaccine design against Songling virus through screening its whole genome encoded proteins. Front. Immunol. 14:1284366. doi: 10.3389/fimmu.2023.1284366

Received: 04 September 2023; Accepted: 01 November 2023;
Published: 28 November 2023.

Edited by:

Joe Hou, Fred Hutchinson Cancer Center, United States

Reviewed by:

Anuj Kumar, ICMR-National Institute of Cancer Prevention and Research, India
Junjie Yue, Beijing Institute of Biotechnology, China

Copyright © 2023 Ali, Ali, Alamri, Baiduissenova, Dusmagambetov and Abduldayeva. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Abdulaziz Alamri, abalamri@ksu.edu.sa

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.