Skip to main content

ORIGINAL RESEARCH article

Front. Mol. Biosci., 15 October 2020
Sec. Biological Modeling and Simulation
This article is part of the Research Topic Coronavirus Disease (COVID-19): Molecular Mechanisms, Translational Approaches and Therapeutics View all 118 articles

Determine the Potential Epitope Based Peptide Vaccine Against Novel SARS-CoV-2 Targeting Structural Proteins Using Immunoinformatics Approaches

  • 1Department of Bioinformatics and Biotechnology, Government College University Faisalabad, Faisalabad, Pakistan
  • 2Institute of Basic Medical Sciences, Khyber Medical University, Peshawar, Pakistan
  • 3Department of Bioinformatics, University of Okara, Okara, Pakistan

Coronaviruses (CoVs) belong to the Coronaviridae-family. The genus Beta-coronaviruses, are enveloped positive strand RNA viruses with club-like spikes at the surface with a unique replication process and a large RNA genome (∼25 kb). CoVs are known as one of the major pathogenic viruses causing a variety of diseases in birds and mammals including humans (lethal respiratory dysfunctions). Recently, a new strain of coronavirus has been identified and named as SARS-CoV-2. A large number of COVID-19 (disease caused by SARS-CoV-2) cases are being diagnosed all over the World especially in China (Wuhan). COVID-19 showed high mortality rate exponentially, however, not even a single effective cure is being introduced yet against COVID-19. In the current study, immunoinformatics approaches were employed to predict the antigenic epitopes against COVID-19 for the development of a coronavirus peptide vaccine. Cytotoxic T-lymphocyte (CTL) and B-cell epitopes were predicted for SARS-CoV-2 coronavirus structural proteins (Spikes, Membrane, Envelope, and Nucleocapsid). The docking complexes of the top 10 epitopes having antigenic sites were analyzed led by binding affinity and binding interactional analyses of top ranked predicted peptides with the MHC-I HLA molecule. The predicted peptides may have potential to be used as peptide vaccine against COVID-19.

Background

There are still a variety of human diseases with unknown etiology. A viral parentage has been purposed for numerous diseases which also has significance to search for new viruses (Cascella et al., 2020). However, there are various difficulties involved in scrutinizing new viruses, as some viruses do not replicate in vitro and have cytopathic effects (CPE). The viruses that are unable to replicate in vitro lead to the failure of virus discovery. The DNA Amplified Restriction Fragment Length Polymorphism (cDNA-AFLP 4) technique helps to identify new viruses, including the discovery of the new coronavirus (CoV) (Cascella et al., 2020). The SARS-CoV-2 strain from the genus Beta-coronavirus of the Coronaviridae family, are enveloped viruses with a large plus strand RNA genome, complete classification is provided in Supplementary Material. The size of the genomic RNA is 27–32 kb and poly-adenylated. There are three serologically distinct groups of CoVs. Viruses are characterized by their genomic sequence and host range (Guy et al., 2000). CoVs have been discovered in mice, turkeys, cats, horses, and humans, leading to many diseases including respiratory tract issues and gastroenteritis (International Committee on Taxonomy of Viruses, 2020). Two human viruses (HCoV-229E and HCoV-OC43) were identified in the mid-1960s and are known to cause the common cold. The recently identified SARS-CoV-2 causes a life-threatening pneumonia and is the most pathogenic human CoV identified thus far (Peiris et al., 2003). SARS-CoV-2 is likely to have been occupied in an animal source and recently initiated the pandemic in humans through zoonotic transmission (Martina et al., 2003). SARS-CoV-2 is the first member of a fourth group of CoVs (Snijder et al., 2003).

In Wuhan (Hubei Province, China), a number of patients linked with Hunan South China seafood market have the third zoonotic human CoV of the century which emerged on the 31st of December, 2019. CoV is similar to Severe Acute Respiratory Syndrome coronavirus (SARS-CoV) and Middle East Respiratory Syndrome Coronavirus (MERS-CoV) infections including fever, lung infiltration and difficulty in breathing (de Wilde et al., 2018; Wuhan Municipal Health Commission, 2020). After an extensive speculation about the causative agent of CoV, the identification of the novel CoV was announced by the Chinese Center for Disease Control (CDS) on the 19th of January, 2020 (Kahn, 2020). The novel CoV SARS-CoV-2 was insulate from a single patient and later corroborated by 16 more patients (World Health Organization [WHO]., 2020). The viral pneumonia of COVID-19 was quickly predicted as a likely causative agent and the sequence of SARS-CoV-2 was submitted (VoNCGAohvotn-c-gaoJ, 2020). Later, five more sequences of SARS-CoV-2 were submitted on the GSAID database on 11th of January, 2020 from the Chinese institutes (GDCAohwgoCaoJ, 2020). Multiple sequence alignment of SARS-CoV, MERS-CoV, and SARS-CoV-2 was carried out and the conserved part of DNA and protein sequences was observed to be similar. Hundreds of deaths linked with this deadliest infection increase the morbidities in the age of 50 years and above. Various diseases have been discovered and associated including dry-cough, leukopenia, fever, and shortness of breath. The extracorporeal membrane oxygenation of the patients considered as severe cases need supportive care. The infection of SARS-CoV-2 in elderly patients is known to be less virulent as compared to SARS-CoV (10% mortality) and MERS-CoV (35% mortality) in the initial stage, later on SARS-CoV-2 caused a huge mortality rate in all over the world (Imai et al., 2020). For this infection, no reliable mediation is currently available. Preventative measures are urgently needed due to the significant global disease burden resultant of SARS-CoV-2 (Douglas et al., 2018). SARS-CoV-2 has a far higher mortality rate as compared to the other known members of corona virus family and researchers are trying their best to develop a successful vaccine against COVID-19. Peptide-based vaccines and multi-epitope adjuvant based vaccines approaches (Tahir ul Qamar et al., 2020) are used widely for the development of successful vaccine. Moreover, naturally occurring compounds are also employed to inhibit SARS-CoV-2 efficiently by using virtual screening approaches (Xiao et al., 2020).

The vaccine development process essentially involves the determination of effective B-cell epitopes and Cytotoxic T lymphocytes (CTL). The advanced methodology has emerged to determine the response of T-cells against numerous vaccine candidates for the process of vaccine development (Ip et al., 2015). The present effort struggles to elucidate and scrutinize the effective T-cells and B-cell (conformational and linear) epitopes act as potential candidates for vaccine by utilizing the immunoinformatics approaches. Furthermore, the crucial step for the development of a vaccine is the identification of potential peptides from the virulent pathogen proteome having interactions with the major histocompatibility complex (MHC). The efficiency of the epitopes binding to MHC molecules is linked with the T-cell immunogenicity (Lazarski et al., 2005). An immunoinformatics approach was utilized to predict the peptide-MHC complexes and comparative molecular docking analyses leads to scrutiny of the potential peptides for peptide vaccine development. Recently, similar approaches and methodology were used against Zika virus, MERS-CoV virus, and Ebola virus for peptide-based vaccine prediction (Ashfaq and Ahamed, 2016; Ahmad et al., 2019; Tahir ul Qamar et al., 2019a).

Materials and Methods

Sequence Retrieval

The primary amino acid sequences of the structural proteins of CoV were extracted from NCBI (Geer et al., 2010). The amino acid sequences of the selected structural protein of CoV have 222 residues for membrane protein (NCBI_Protein = QHQ82467.1), 75 residues for envelope protein (NCBI_Protein = QHW06051.1), 419 residues for nucleocapsid protein (NCBI_Protein = QHZ00386.1) and 1273 amino acids for spikes protein (NCBI_Protein = QHR63260.2). The physiochemical properties of the selected protein were evaluated by using Protparam and VOLPES (Wilkins et al., 1999).

Multiple Sequence Alignment (MSA)

Multiple Sequence Alignment was performed on all the three full length genomes (SARS-CoV = NC_004718, MERS-CoV = NC_019843.3 and SARS-CoV-2 = NC_045512.2) and the genomic sequences were retrieved through GenBank (Sayers et al., 2019, 2020). The genomic sequences of the selected genomes were utilized and a hierarchical approach along with a series of different pair-score matrices including sum-of-pairs and Hidden Markov Model (HMM) was employed for MSA. Clustal Omega (Sievers and Higgins, 2014, 2018) was utilized to analyze the MSA of the selected genomic sequences and the conserved domains were observed by using WebLogo3 (Crooks et al., 2004).

Conformational and Linear B-Cell Epitopes Prediction

The antigen B-cell epitope interactions against B-lymphocyte leads to the differentiation of B-lymphocytes into two different types of cells as antibody-secreting plasma and memory cells (Nair et al., 2002). The hydrophilic nature and surface accessibility of B-cell epitopes were assumed as the key characteristics of predicted B-cell epitopes as predicted B-cells epitopes should be water loving in nature for better solubility (Parker et al., 1986) by accessing the immune epitope database and analysis resource (IEDB)1 as stated by hydrophilicity prediction of Parker (Parker et al., 1986), flexibility prediction of Karplus and Schulz (1985), Emini surface accessibility prediction (Pettersen et al., 2004) and antigenicity scale of Kolaskar and Tongaonkar (Alexander et al., 2011). The conformational B-cell epitopes were predicted by employing ElliPro2 (Pettersen et al., 2004) from IEDB analysis resource having three diverse algorithms comprising protein shape approximation (Emini et al., 1985), residues Protrusion Index (PI) (Nain et al., 2019) and the adjacent residues clustering based on PI.

Potential Epitope Prediction of Cytotoxic T-Lymphocyte (CTL)

The CTL epitopes predictions were analyzed through utilizing NetCTL.1.2 server (Beijing News, 2020). The molecules of MHC behave as antigens and utilize their surface for the activation of CTLs. The NetCTL.1.2 server was utilized to integrate the MHC class I binding prediction, proteasomal C-terminal cleavage and transporter associated with antigen processing (TAP) transport efficiency. The FASTA format sequences of the organism were subjected to the server and Human leukocyte antigen (HLA) alleles and peptide lengths were observed and analyzed. Additionally, the prediction of T-cell epitopes and weight matrix algorithm was employed for the prediction of TAP transport efficiency and artificial neural network was implemented to predict the MHC class-I binding and proteasomal C-terminal cleavage.

World Population Coverage Analyses

The World population coverage analyses were performed by utilizing the IEDB server. The selected CTL epitopes were used and analyzed against the respective allele sets and major world populations were covered. The key purpose of the coverage analyses was to analyze whether the selected candidates were suitable for major populations or not. The analyses were performed against China, Iran, Japan, Korea, Pakistan, Italy, France, and other countries which are being affected by SARS-CoV-2 in the 2020 viral outbreak (Vita et al., 2019).

Molecular Docking Analyses and Peptide-MHC Protein Complex

The predicted epitopes of SARS-CoV-2 structural proteins with antigenic residues were selected for molecular docking analyses. The PEP-FOLD3 server (Lamiable et al., 2016) was utilized to predict the 3D structures of the selected peptides with 200 simulation runs to sample the conformations. The conformational models clustered by the PEP-FOLD3 server were evaluated on the basis of sOPEP energy scores (Maupetit et al., 2007). The analyzed peptides which had higher scores were selected for molecular docking experiments with MHC class I binding molecule comprising HLA-B (PDB ID: 3VCL) through PatchDock docking server (Huang et al., 2010). All the docked complexes having undesirable penetrations of the receptor’s atoms into the ligand were rejected and geometric shape complementarity score was applied to classify the other complexes. Subsequently, the FireDock server (Andrusier et al., 2007; Mashiach et al., 2008) was utilized to refine the docked complexes and also predict the score of the docking outputs.

The FireDock server was utilized to improve the flexibility and scoring errors observed during the molecular docking calculations through fast rigid-body docking tools (Kingsford et al., 2005). The molecular visualization programs PyMOL (Alexander et al., 2011), Ligplot and UCSF Chimera 1.11 (Pettersen et al., 2004) were utilized to visualize, analyze and identify the hydrogen bonding interactions of the docked complexes (Nair et al., 2002; Palatnik-de-Sousa et al., 2018; Tahir ul Qamar et al., 2019b). The schematic diagram illustrating the applied approaches and strategies along with tools and software are mentioned in Figure 1.

FIGURE 1
www.frontiersin.org

Figure 1. Schematic workflow for the prediction of peptide based vaccine against SARS-CoV-2.

Results

A variety of tools and servers have resulted through recent advancements in immunological bioinformatics, which lessen the time and cost of traditional vaccine advancement. The development of an effective multiple epitope vaccine remains difficult due to problems in selection of suitable antigen candidates and immune-dominant epitopes. Thus, it is important to predict the appropriate antigen epitopes of the targeted protein by immune-informatics approaches to design a multiple epitope vaccine (Nain et al., 2019). The main target was to use the immune-informatics approaches and the prediction of peptide vaccine through recognizing MHC binding, B-cells and CTL epitopes. The discovery of effective vaccines is possible through pathogenomics analyses on a genome wide scale, though these conventional experimental methods have multiple limitations (Rodrigues et al., 2019). Immune-informatics approaches help to analyze the complete spectrum of the potential antigen, and furthermore complications regarding in vitro expression of antigen and pathogen culturing can also be evaded. By means of computational methods, the immune research groups have reported various vaccine candidates as having promising preclinical outputs (Davies and Flower, 2007). In current efforts, epitopes have been identified to design the peptide vaccine against HLA-B protein (Tahir et al., 2018). The development of epitopes based vaccines targeting the structural proteins of SARS-CoV-2 and epitopes of the target proteins were predicted to support the host’s immune response. The antigenicity and allergenicity of the predicted epitopes were observed through VaxiJen and Allergen F.P 1.0 (Dimitrov et al., 2014). The estimation of population coverage of predicted epitopes was calculated and it was observed that the coverage in China was 0.5639 with average hits of 4.0 for MHC class I, and with average 0.2462 and hits of 0.91 for MHC class II (Supplementary Table 1). The peptides were designed against ten epitopes by utilizing Pepfold-3.0. The molecular docking analyses of the selected ten peptides were performed through PatchDock and further refined through Fire Dock (Andrusier et al., 2007; Mashiach et al., 2008; Huang et al., 2010) to identify the effective binding sites (Table 1).

TABLE 1
www.frontiersin.org

Table 1. Predicted CTL epitopes from the SARS-CoV-2 structural proteins having antigenic sites.

Analysis for SARS-CoV-2 Structural Proteins Surface Properties

A peptide with surface-accessibility probability of >1.0 reflects more probable chances for a peptide to be found on the surface (Parker et al., 1986). Numerous peptides were predicted and the top ranked predicted peptides of SARS-CoV-2 structural proteins on the basis of surface probability (Y-axis) and sequence position (X-axis) were selected for further analyses (Supplementary File 1–4). The maximum surface probability scores for the membrane protein, envelope protein, nucleocapsid protein and spikes protein were analyzed as “YANRNR” 5.199, “YSRVKN: 4.136, “KKDKKK” 6.966, and “QDKNTQ” 6.051, respectively. Similarly, minimum surface probability scores for the membrane protein, envelope protein, nucleocapsid protein and spikes protein were observed as “LACFVL” 0.078, “LCAYCC” 0.088, “LALLLL” 0.05, and “VFLVLL” 0.07, respectively (Table 2).

TABLE 2
www.frontiersin.org

Table 2. The maximum and minimum values of the predicted peptides.

The Karplus and Schulz (1985) flexibility method was utilized to calculate and analyze the atomic vibrational motions in the protein structure designated through B-factor and temperature. The stability and organization of the structure depends upon the B-factor values. The quality of the predicted models depends upon the B-factor values as a lower B-factor value is considered as an effective model while higher B-factor values lead to the less organized and poorly ordered structures (Karplus and Schulz, 1985; Table 2).

The hydrophilicity scale process of Parker was carried out to observe the peptides hydrophilicity based on the peptide retention times through HPLC on reversed phase column. Immunological analyses have revealed the association of antigenic sites with the hydrophilic regions (Parker et al., 1986). The antigenicity of SARS-CoV-2 was calculated through the Kolaskar & Tongaonkar method (Table 2). The predicted facts and data for all selected four protein properties are mentioned in the Supplementary Material (Supplementary File 1–4).

Structure-Based Epitope Prediction for SARS-CoV-2 Structural Proteins

The correlation among the protein structure antigenicity, epitope prediction, accessibility and flexibility within 3D structures were determined through ElliPro (Ponomarenko et al., 2008). The significant properties including protein-antibody interactions were analyzed to differentiate the predicted epitopes. The top-ranked five conformational epitopes for SARS-CoV-2 which had a score of ≥0.6 were observed and selected for further analyses. The PI (Isoelectric Point value) (Ponomarenko et al., 2008) score was observed to analyze the percentage of the atoms which extend over the molecular bulk and are also liable for the antibody binding. The top ranked 2 conformational predicted epitopes along with the residues name, length and locations were critically analyzed (Table 3) and the score was observed 0.703 and 0.706.

TABLE 3
www.frontiersin.org

Table 3. Top ranked selected discontinues epitopes, interacting residues and scores.

Molecular Docking Analyses of SARS-CoV-2 Structural Proteins With HLA-B

The comparative molecular docking analyses were executed for the top ranked 10 selected epitopes of SARS-CoV-2 out of 87 designed peptides with MHC class I HLA-B. The effective binding affinities have been observed for all the selected CTL epitopes having van der Waals (VdW) energy values ranges from −21.80 to −27.52 kcal/mol and the observed global energy was −25.01 to −53.65 kcal/mol (Table 4). The molecular docking analyses of the selected 10 CTL predicted epitopes were carried out and effective binding affinities with HLA-B were observed (Supplementary File 5).

TABLE 4
www.frontiersin.org

Table 4. The designed peptides against SARS-CoV-2 peptides-MHC class I HLA-B interactions.

The top 10 docked complexes were visualized (Figure 2) and a similar binding pocket was observed in all the selected peptides. It was observed that Tyr9, Ile66, Gln70, Tyr99, Tyr116, and Arg156 residues were conserved in all the selected peptides (Table 3).

FIGURE 2
www.frontiersin.org

Figure 2. Peptide-MHC class I HLA-B (pink color helices denotes the conserved binding domain of HLA-B and the remaining protein structure is presented in the wire shape), binding interacting residues of the top-ranked 10 peptides represented in different colors, 6 spike peptides brown color residues, 2 membrane peptides red color residues, 1 nucleocapsid, and 1 envelope peptide with purple and blue color residues, respectively.

Population Coverage Analyses

The population coverage analyses were performed with the selected MHC class I and MHC class II epitopes and also with the associated HLA alleles. It was observed that the selected MHC class I and MHC class II epitopes have the world’s population of 58.49 and 34.71%, respectively. MHC class I epitopes showed the highest coverage in the population of Italy (90.19%) and China (56.39%). The MHC class II epitopes also showed the highest coverage in the Philippines (71.92%) (Supplementary File 6).

Multiple Sequence Alignment

Multiple sequence alignment was performed for three CoV genomes and conserved binding residues were observed. It was observed that all the selected strains of the CoV have conserved domains, which is reconciled with the latest outbreak strain SARS-CoV-2. Interestingly, it was observed that the reported binding domain of the previously reported strain has a similar region of binding with latest outbreak of CoV, 2019. The binding residues of SARS-CoV-2 showed similar binding domains with MERS and SARS (Supplementary File 7).

Discussion

The need of dealing with CoVs has been increased since its recent breakout in China (Wuhan) affecting millions of humans. This SARS-CoV-2 viral attack has become a worldwide emergency in different regions of the World, especially in China (Mcclain, 1995). As an immediate response, numerous efforts from all over the world have been made to develop a peptide based vaccine against SARS-CoV-2, and the peptide inhibitors are of great interest to develop vaccines (Chew et al., 2017; Usman Mirza et al., 2017). The peptide targets are more preferable than traditional ligand-based drugs and vaccines due to different aspects including less toxic, fewer side-effects and their ultra-fast action. Immunoinformatics approaches help by reducing the work-load of laboratory trials, additionally these approaches are less time consuming and cost efficient than traditional approaches (Vanhee et al., 2011; Heurich et al., 2013; Xu et al., 2017). In the last 10 years, there has been much progress in in silico drug designing (Sehgal, 2017). Numerous biological problems are being solved by the implementation of different bioinformatics approaches (Sehgal et al., 2013; Sehgal, 2017; Tahir et al., 2018).

Researchers are striving mutually for a successful vaccine development and cure against COVID-19. Computational approaches were employed to analyze the synergistic effect by the combination of lopinavir, oseltamivir and ritonavir through molecular docking studies (Muralidharan et al., 2020).

Recently, molecular docking analyses along with virtual screening were performed against the drug candidates in clinical trials and approved drugs. Elbasvir, lopinavir, valrubicin, and carfilzomib were identified as potential compounds (Wang, 2020). Molecular docking analyses also revealed that luteolin and chloroquine also have the potential to inhibit the SARS-CoV-2 (Yu et al., 2020).

Recently, numerous research groups have struggled to design the subunit vaccines against SARS-CoV-2; though, the utilized workflow involved in the research either employ of a single protein to design the vaccine (Abdelmageed et al., 2020; Bhattacharya et al., 2020) or only CTL epitopes was used without considering the significance of HTL or B-cell epitopes (Seema, 2019). In current research work, all of these significant factors were considered to design the vaccine. Through extensive bioinformatics analyses, four proteins were utilized to design an epitope-based vaccine against SARS-CoV-2. The selected proteins for the analyses were membrane glycoprotein (M), nucleocapsid protein (N), envelop protein (E), and surface spike glycoprotein (S). The protein M helps in immunogenicity and assembly of the virus particles. The protein N has the ability to package the viral genome into a helical ribonucleocapsid and has a key role during viral self-assembly (Chang et al., 2013). The protein S has the ability to mediate the movement of the virus to human cells. The protein S is classified into two regions as S1 for the binding of the host receptor cell and S2 for the fusion of membrane. Due to the active involvement of protein S, it is considered as a key target for vaccine development, diagnostics and therapeutic antibodies for coronavirus (Du et al., 2009; Al-Amri et al., 2017; Prompetchara et al., 2020). By keeping the importance of protein S in mind, six different peptides were designed and analyzed.

The observed findings of antigenicity analysis range from 7.6 to 6.12% which is considered as an effective antigenic ability for a potent peptide, and similar ranges were observed in both studies of immunoinformatics analyses. Moreover, the binding domain of HLA-B was observed to be conserved in both studies and reconcile with the present research efforts (Usman Mirza et al., 2017; Tahir ul Qamar et al., 2020).

The potential CTL epitopes have been predicted for structural proteins of SARS-CoV-2. The molecular docking tools were used to analyze MHC-1 and peptide binding affinities for the selected peptides (Alam et al., 2016). Other evidences including C-terminal cleavage affinities also validated the binding affinity of the peptide-MHC-I complexes. In this study, ten peptides were reported as potential targets that showed effective interactions with the MHC-I protein (HLA-B), having maximum binding affinities and antigenicity. This increases the probability of the potential vaccine targets for the observed residues to be promising targets. The surface accessibility, surface flexibility as well as hydrophobicity and antigenicity for SARS-CoV-2 structural proteins were calculated and cross-verified by using the IEDB server (Sieker et al., 2009). An extensive literature review was performed and it was observed that the selected peptides were not reported against SARS-CoV-2. The predicted peptides were modeled by PEP-FOLD3 server and docked to MHC-1 using PatchDock and FireDock was used for further refinement. PyMOL and UCSF Chimera 1.11 were used to check the interactions of docked complexes.

The design and development of a potent vaccine needs an extensive investigation and analyses of immunological correlations with SARS-CoV-2. However, the experimental techniques would not be able to serve the urgency due to the severity and emergency of the COVID-19 outbreak. Therefore, in silico and computational predictions are helpful to guide the researchers to design a potential vaccine and help to control COVID-19. The vaccine development is an expensive and lengthy procedure with a high rate of failure, and several years are required to develop an effective commercial vaccine. Computational analyses suggest that the reported epitope-based vaccine peptides may have the ability to be protective against SARS-CoV-2 infection.

Conclusion

The aim of this work was to identify the effective peptide based inhibitors against SARS-CoV-2 structural protein (Membrane, Envelope, Nucleocapsid, and Spikes). The predicted epitopes were designed leading to the molecular docking analyses against MHC-I and interactional analyses of the selected docked complexes were analyzed. In conclusion, 10 Epitopes (six from spikes protein “LTDEMIAQY, WTAGAAAYY, TSNQVAVLY, CVADYSVLY, KTSVDCTMY, and STECSNLLL,” two from membrane protein “SSDNIALLV and ATSRTLSYY,” one from nucleocapsid and one from envelope protein “LSPRWYFYY and LTALRLCAY,” respectively), were predicted which might be potential targets as peptide vaccine against deadly SARS -CoV-2.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation, to any qualified researcher.

Author Contributions

MW, AH, MS, SS, and SAS performed the analyses and drafted the manuscript. All authors contributed to the article and approved the submitted version.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

Thanks to Department of Bioinformatics and Biotechnology, Government College University Faisalabad and Department of Bioinformatics, University of Okara for providing the platform for this work.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmolb.2020.00227/full#supplementary-material

Abbreviations

CTL, cytotoxic T-lymphocyte; HLA, human leukocyte antigen; IEDB, immune epitope database; MERS-CoV, middle east respiratory syndrome coronavirus; MHC, major histocompatibility complex; PI, isoelectric point; RBD, receptor binding domain; SARS-CoV, severe acute respiratory syndrome coronavirus.

Footnotes

  1. ^ http://www.iedb.org/
  2. ^ http://tools.immuneepitope.org/toolsElliPro/

References

Abdelmageed, M. I., Abdelmoneim, A. H., Mustafa, M. I., Elfadol, N. M., Murshed, N. S., Shantier, S. W., et al. (2020). Design of a multiepitope-based peptide vaccine against the E protein of human COVID-19: an immunoinformatics approach. Biomed Res. Int. 2020:268 3286.

Google Scholar

Ahmad, B., Ashfaq, U. A., Rahman, M. U., Masoud, M. S., and Yousaf, M. Z. (2019). Conserved B and T cell epitopes prediction of Ebola virus glycoprotein for vaccine development: an immuno-informatics approach. Microb. Pathog. 132, 243–253. doi: 10.1016/j.micpath.2019.05.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Alam, A., Ali, S., Ahamad, S., Malik, M. Z., and Ishrat, R. (2016). From ZikV genome to vaccine: in silico approach for the epitope-based peptide vaccine against Zika virus envelope glycoprotein. Immunology 149, 386–399. doi: 10.1111/imm.12656

PubMed Abstract | CrossRef Full Text | Google Scholar

Al-Amri, S. S., Abbas, A. T., Siddiq, L. A., Alghamdi, A., Sanki, M. A., Al-Muhanna, M. K., et al. (2017). Immunogenicity of Candidate MERS-CoV DNA vaccines based on the spike protein. Sci. Rep. 7:44875.

Google Scholar

Alexander, N., Woetzel, N., and Meiler, J. (2011). “bcl::Cluster : a method for clustering biological molecules coupled with visualization in the Pymol Molecular Graphics System,” in Proceedings of the IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences, ICCABS 2011, Orlando, FL, 13–18.

Google Scholar

Andrusier, N., Nussinov, R., and Wolfson, H. J. (2007). FireDock: fast interaction refinement in molecular docking. Proteins 69, 139–159. doi: 10.1002/prot.21495

PubMed Abstract | CrossRef Full Text | Google Scholar

Ashfaq, U. A., and Ahamed, B. (2016). De novo structural modeling and conserved epitopes prediction of Zika virus envelop protein for vaccine development. Viral Immunol. 29, 436–443. doi: 10.1089/vim.2016.0033

PubMed Abstract | CrossRef Full Text | Google Scholar

Beijing News (2020). Wen XMSiWCrNCP, and Another Suspected. Available online at: http://china.qianlong.com/2020/0121/3600877.shtml (accessed January 21, 2020).

Google Scholar

Bhattacharya, M., Sharma, A. R., Patra, P., Ghosh, P., Sharma, G., Patra, B. C., et al. (2020). Development of epitope-based peptide vaccine against novel coronavirus 2019 (SARS-COV-2): immunoinformatics approach. J. Med. Virol. 92, 618–631. doi: 10.1002/jmv.25736

PubMed Abstract | CrossRef Full Text | Google Scholar

Cascella, M., Rajnik, M., Cuomo, A., Dulebohn, S. C., and Di Napoli, R. (2020). Features, Evaluation and Treatment Coronavirus (COVID-19). Treasure Island, FL: StatPearls Publishing.

Google Scholar

Chang, C. K., Chen, C. M., Chiang, M. H., Hsu, Y. L., and Huang, T. H. (2013). Transient oligomerization of the SARS-CoV N protein–implication for virus ribonucleoprotein packaging. PLoS One 8:e65045. doi: 10.1371/journal.pone.0065045

PubMed Abstract | CrossRef Full Text | Google Scholar

Chew, M. F., Poh, K. S., and Poh, C. L. (2017). Peptides as therapeutic agents for dengue virus. Int. J. Med. Sci. 14, 1342–1359. doi: 10.7150/ijms.21875

PubMed Abstract | CrossRef Full Text | Google Scholar

Crooks, G. E., Hon, G., Chandonia, J. M., and Brenner, S. E. (2004). WebLogo: a sequence logo generator. Genome Res. 14, 1188–1190. doi: 10.1101/gr.849004

PubMed Abstract | CrossRef Full Text | Google Scholar

Davies, M. N., and Flower, D. R. (2007). Harnessing bioinformatics to discover new vaccines. Drug Discov. Today 12, 389–395. doi: 10.1016/j.drudis.2007.03.010

PubMed Abstract | CrossRef Full Text | Google Scholar

de Wilde, A. H., Snijder, E. J., Kikkert, M., and van Hemert, M. J. (2018). Host Factors in Coronavirus Replication. Curr. Top. Microbiol. Immunol. 419, 1–42. doi: 10.1007/82_2017_25

PubMed Abstract | CrossRef Full Text | Google Scholar

Dimitrov, I., Naneva, L., Doytchinova, I., and Bangov, I. (2014). AllergenFP: allergenicity prediction by descriptor fingerprints. Bioinformatics 30, 846–851. doi: 10.1093/bioinformatics/btt619

PubMed Abstract | CrossRef Full Text | Google Scholar

Douglas, M. G., Kocher, J. F., Scobey, T., Baric, R. S., and Cockrell, A. S. (2018). Adaptive evolution influences the infectious dose of MERS-CoV necessary to achieve severe respiratory disease. Virology 517, 98–107. doi: 10.1016/j.virol.2017.12.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, L., He, Y., Zhou, Y., Liu, S., Zheng, B. J., and Jiang, S. (2009). The spike protein of SARS-CoV–a target for vaccine and therapeutic development. Nat. Rev. Microbiol. 7, 226–236. doi: 10.1038/nrmicro2090

PubMed Abstract | CrossRef Full Text | Google Scholar

Emini, E. A., Hughes, J. V., Perlow, D. S., and Boger, J. (1985). Induction of hepatitis A virus-neutralizing antibody by a virus-specific synthetic peptide. J. Virol. 55, 836–839. doi: 10.1128/jvi.55.3.836-839.1985

PubMed Abstract | CrossRef Full Text | Google Scholar

GDCAohwgoCaoJ (2020). GDCAohwgoCaoJ.

Google Scholar

Geer, L. Y., Marchler-Bauer, A., Geer, R. C., Han, L., He, J., He, S., et al. (2010). The NCBI biosystems database. Nucleic Acids Res. 38(Suppl._1), D492–D496.

Google Scholar

Guy, J. S., Breslin, J. J., Breuhaus, B., Vivrette, S., and Smith, L. G. (2000). Characterization of a coronavirus isolated from a diarrheic foal. J. Clin. Microbiol. 38, 4523–4526. doi: 10.1128/jcm.38.12.4523-4526.2000

PubMed Abstract | CrossRef Full Text | Google Scholar

Heurich, M., Altintas, Z., and Tothill, I. E. (2013). Computational design of peptide ligands for ochratoxin A. Toxins 5, 1202–1218. doi: 10.3390/toxins5061202

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, P. T., Lo, P. H., Wang, C. H., Pang, C. T., and Lou, K. L. (2010). PPDock-Portal Patch Dock: a web server for drug virtual screen and visualizing the docking structure by GP and X-Score. Acta Crystallogr. A 66, S233–S234.

Google Scholar

Imai, N. D. I, Cori, A., Riley, S., and Ferguson, N. M. (2020). Estimating the Potential Total Number of Novel Coronavirus (2019-nCoV) Cases in Wuhan City, China. Available online at: https://www.imperial.ac.uk/mrcglobal-infectious-disease-analysis/news–wuhan-coronavirus/ (accessed January 19, 2020).

Google Scholar

International Committee on Taxonomy of Viruses (2020).

Google Scholar

Ip, P. P., Nijman, H. W., and Daemen, T. (2015). Epitope prediction assays combined with validation assays strongly narrows down putative cytotoxic T Lymphocyte epitopes. Vaccines 3, 203–220.

Google Scholar

Kahn, N. N. W. (2020). doi: 10.3390/vaccines3020203

PubMed Abstract | CrossRef Full Text | Google Scholar

Karplus, P. A., and Schulz, G. E. (1985). Prediction of chain flexibility in proteins - a tool for the selection of peptide antigens. Naturwissenschaften 72, 212–213. doi: 10.1007/bf01195768

CrossRef Full Text | Google Scholar

Kingsford, C. L., Chazelle, B., and Singh, M. (2005). Solving and analyzing side-chain positioning problems using linear and integer programming. Bioinformatics 21, 1028–1036. doi: 10.1093/bioinformatics/bti144

PubMed Abstract | CrossRef Full Text | Google Scholar

Lamiable, A., Thevenet, P., Rey, J., Vavrusa, M., Derreumaux, P., and Tuffery, P. (2016). PEP-FOLD3: faster de novo structure prediction for linear peptides in solution and in complex. Nucleic Acids Res. 44, W449–W454.

Google Scholar

Lazarski, C. A., Chaves, F. A., Jenks, S. A., Richards, K. A., Weaver, J., et al. (2005). The kinetic stability of MHC class II: peptide complexes is a key parameter that dictates immunodominance. Immunity 23, 29–40. doi: 10.1016/j.immuni.2005.05.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Martina, B. E., Haagmans, B. L., Kuiken, T., Fouchier, R. A., Rimmelzwaan, G. F., and Van Amerongen, G. (2003). Virology: SARS virus infection of cats and ferrets. Nature 425:915. doi: 10.1038/425915a

PubMed Abstract | CrossRef Full Text | Google Scholar

Mashiach, E., Schneidman-Duhovny, D., Andrusier, N., Nussinov, R., and Wolfson, H. J. (2008). FireDock: a web server for fast interaction refinement in molecular docking. Nucleic Acids Res. 36, W229–W232.

Google Scholar

Maupetit, J., Tuffery, P., and Derreumaux, P. (2007). A coarse-grained protein force field for folding and structure prediction. Proteins 69, 394–408. doi: 10.1002/prot.21505

PubMed Abstract | CrossRef Full Text | Google Scholar

Mcclain, C. S. (1995). A new look at an old disease - smallpox and biotechnology. Perspect. Biol. Med. 38, 624–639. doi: 10.1353/pbm.1995.0000

PubMed Abstract | CrossRef Full Text | Google Scholar

Muralidharan, N., Sakthivel, R., Velmurugan, D., and Gromiha, M. M. (2020). Computational studies of drug repurposing and synergism of lopinavir, oseltamivir and ritonavir binding with SARS-CoV-2 protease against COVID-19. J. Biomol. Struct. Dyn. 16, 1–6. doi: 10.1080/07391102.2020.1752802

PubMed Abstract | CrossRef Full Text | Google Scholar

Nain, Z., Abdullah, F., Rahman, M. M., Karim, M. M., Khan, M. S. A., Bin Sayed, S., et al. (2019). Proteome-wide screening for designing a multi-epitope vaccine against emerging pathogen Elizabethkingia anophelis using immunoinformatic approaches. J. Biomol. Struct. Dyn. 38, 4850–4867. doi: 10.1080/07391102.2019.1692072

PubMed Abstract | CrossRef Full Text | Google Scholar

Nair, D. T., Singh, K., Siddiqui, Z., Nayak, B. P., Rao, K. V. S., and Salunke, D. M. (2002). Epitope recognition by diverse antibodies suggests conformational convergence in an antibody response. J. Immunol. 168, 2371–2382. doi: 10.4049/jimmunol.168.5.2371

PubMed Abstract | CrossRef Full Text | Google Scholar

Palatnik-de-Sousa, C. B., Soares, I. D. S., and Rosa, D. S. (2018). Editorial: epitope discovery and synthetic vaccine design. Front. Immunol. 9:826. doi: 10.3389/fimmu.2018.00826

PubMed Abstract | CrossRef Full Text | Google Scholar

Parker, J. M. R., Guo, D., and Hodges, R. S. (1986). New hydrophilicity scale derived from high-performance liquid-chromatography peptide retention data - correlation of predicted surface residues with antigenicity and X-ray-derived accessible sites. Biochemistry 25, 5425–5432. doi: 10.1021/bi00367a013

PubMed Abstract | CrossRef Full Text | Google Scholar

Peiris, J. S., Chu, C. M., Cheng, V. C., Chan, K. S., Hung, I. F., and Poon, L. L. (2003). Clinical progression and viral load in a community outbreak of coronavirus-associated SARS pneumonia: a prospective study. Lancet 361, 1767–1772. doi: 10.1016/s0140-6736(03)13412-5

CrossRef Full Text | Google Scholar

Pettersen, E. F., Goddard, T. D., Huang, C. C., Couch, G. S., Greenblatt, D. M., Meng, E. C., et al. (2004). UCSF chimera - a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612. doi: 10.1002/jcc.20084

PubMed Abstract | CrossRef Full Text | Google Scholar

Ponomarenko, J., Bui, H. H., Li, W., Fusseder, N., Bourne, P. E., Sette, A., et al. (2008). ElliPro: a new structure-based tool for the prediction of antibody epitopes. BMC Bioinformatics 9:514. doi: 10.1186/1471-2105-9-514

PubMed Abstract | CrossRef Full Text | Google Scholar

Prompetchara, E., Ketloy, C., and Palaga, T. (2020). Immune responses in COVID-19 and potential vaccines: lessons learned from SARS and MERS epidemic. Asian Pac. J. Allergy Immunol. 38, 1–9.

Google Scholar

Rodrigues, T. C. V., Jaiswal, A. K., de Sarom, A., Oliveira, L. D., Oliveira, C. J. F., Ghosh, P., et al. (2019). Reverse vaccinology and subtractive genomics reveal new therapeutic targets against Mycoplasma pneumoniae: a causative agent of pneumonia. R. Soc. Open Sci. 6:190907. doi: 10.1098/rsos.190907

PubMed Abstract | CrossRef Full Text | Google Scholar

Sayers, E. W., Cavanaugh, M., Clark, K., Ostell, J., Pruitt, K. D., and Karsch-Mizrachi, I. (2019). GenBank. Nucleic Acids Res. 47, D94–D99.

Google Scholar

Sayers, E. W., Cavanaugh, M., Clark, K., Ostell, J., Pruitt, K. D., and Karsch-Mizrachi, I. (2020). GenBank. Nucleic Acids Res 48, D84–D86.

Google Scholar

Seema, M. T. (2019). Cell epitope-based vaccine design for pandemic novel Coronavirus -nCoV2020. ChemRxiv [Preprint]. doi: 10.26434/chemrxiv.12029523.v2

CrossRef Full Text | Google Scholar

Sehgal, S. A. (2017). Pharmacoinformatics, adaptive evolution, and elucidation of six novel compounds for schizophrenia treatment by targeting DAOA (G72) isoforms. Biomed Res. Int. 2017:5925714.

Google Scholar

Sehgal, S. A., Khattak, N. A., and Mir, A. (2013). Structural, phylogenetic and docking studies of D-amino acid oxidase activator (DAOA), a candidate schizophrenia gene. Theor. Biol. Med. Model. 10:3. doi: 10.1186/1742-4682-10-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Sieker, F., May, A., and Zacharias, M. (2009). Predicting affinity and specificity of antigenic peptide binding to major histocompatibility class I molecules. Curr. Protein Pept. Sci. 10, 286–296. doi: 10.2174/138920309788452191

PubMed Abstract | CrossRef Full Text | Google Scholar

Sievers, F., and Higgins, D. G. (2014). Clustal omega. Curr. Protoc. Bioinformatics 48, 3.13.1–3.13.16.

Google Scholar

Sievers, F., and Higgins, D. G. (2018). Clustal Omega for making accurate alignments of many protein sequences. Protein Sci. 27, 135–145. doi: 10.1002/pro.3290

PubMed Abstract | CrossRef Full Text | Google Scholar

Snijder, E. J., Bredenbeek, P. J., Dobbe, J. C., Thiel, V., Ziebuhr, J., and Poon, L. L. (2003). Unique and conserved features of genome and proteome of SARS-coronavirus, an early split-off from the coronavirus group 2 lineage. J. Mol. Biol. 331, 991–1004. doi: 10.1016/s0022-2836(03)00865-9

CrossRef Full Text | Google Scholar

Tahir, R. A., Wu, H., Rizwan, M. A., Jafar, T. H., Saleem, S., and Sehgal, S. A. (2018). Immunoinformatics and molecular docking studies reveal potential epitope-based peptide vaccine against DENV-NS3 protein. J. Theor. Biol. 459, 162–170. doi: 10.1016/j.jtbi.2018.10.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Tahir ul Qamar, M. T., Rehman, A., Ashfaq, U. A., Awan, M. Q., Fatima, I., Shahid, F., et al. (2020). Designing of a next generation multiepitope based vaccine (MEV) against SARS-COV-2: immunoinformatics and in silico approaches. bioRxiv [Preprint]. doi: 10.1101/2020.02.28.970343

CrossRef Full Text | Google Scholar

Tahir ul Qamar, M., Saleem, S., Ashfaq, U. A., Bari, A., Anwar, F., and Alqahtani, S. (2019a). Epitope-based peptide vaccine design and target site depiction against Middle East Respiratory Syndrome Coronavirus: an immune-informatics study. J. Transl. Med. 17:362.

Google Scholar

Tahir ul Qamar, M., Saleem, S., Ashfaq, U. A., Bari, A., Anwar, F., and Alqahtani, S. (2019b). Epitope-based peptide vaccine design and target site depiction against Middle East Respiratory Syndrome Coronavirus: an immune-informatics study. J. Transl. Med. 17:362.

Google Scholar

Usman Mirza, M., Rafique, S., Ali, A., Munir, M., Ikram, N., Manan, A., et al. (2017). Towards peptide vaccines against Zika virus: immunoinformatics combined with molecular dynamics simulations to predict antigenic epitopes of Zika viral proteins. Sci. Rep. 6:37313.

Google Scholar

Vanhee, P., van der Sloot, A. M., Verschueren, E., Serrano, L., Rousseau, F., and Schymkowitz, J. (2011). Computational design of peptide ligands. Trends Biotechnol. 29, 231–239. doi: 10.1016/j.tibtech.2011.01.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Vita, R., Mahajan, S., Overton, J. A., Dhanda, S. K., Martini, S., Cantrell, J. R., et al. (2019). The Immune Epitope Database (IEDB): 2018 update. Nucleic Acids Res. 47, D339–D343.

Google Scholar

VoNCGAohvotn-c-gaoJ (2020). VoNCGAohvotn-c-gaoJ.

Google Scholar

Wang, J. (2020). Fast identification of possible drug treatment of Coronavirus Disease-19 (COVID-19) through computational drug repurposing study. J. Chem. Inf. Model. 60, 3277–3286. doi: 10.1021/acs.jcim.0c00179

PubMed Abstract | CrossRef Full Text | Google Scholar

Wilkins, M. R., Gasteiger, E., Bairoch, A., Sanchez, J. C., Williams, K. L., Appel, R. D., et al. (1999). Protein identification and analysis tools in the ExPASy server. Methods Mol. Biol. 112, 531–552. doi: 10.1385/1-59259-584-7:531

CrossRef Full Text | Google Scholar

World Health Organization [WHO]. (2020). WHO Statement Regarding Cluster of Pneumonia Cases in Wuhan CAohwwicnd—w-s-r-c. Beijing: WHO.

Google Scholar

Wuhan Municipal Health Commission (2020). http://en.nhc.gov.cn/2020-04/06/c_78861_2.htm (accessed January 19, 2020).

Google Scholar

Xiao, Y., Li, Z., Wang, X., Wang, Y., Wang, Y., Wang, G., et al. (2020). Comparison of three TaqMan real-time reverse transcription-PCR assays in detecting SARS-CoV-2. BioRxiv [Preprint]. doi: 10.1101/2020.07.06.189860

CrossRef Full Text | Google Scholar

Xu, D. R., Bian, H. L., Cai, J. L., Bao, D. C., Jin, Q., Zhu, M., et al. (2017). Computational design of peptide ligands to target the intermolecular interaction between viral envelope protein and pediatric receptor. Comput. Biol. Chem. 69, 120–125. doi: 10.1016/j.compbiolchem.2017.06.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, R., Chen, L., Lan, R., Shen, R., and Li, P. (2020). Computational screening of antagonists against the SARS-CoV-2 (COVID-19) coronavirus by molecular docking. Int. J. Antimicrob. Agents. 56:106012. doi: 10.1016/j.ijantimicag.2020.106012

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: immunoinformatics, SARS-CoV-2, SARS-CoV, peptide vaccines, corona virus disease 2019

Citation: Waqas M, Haider A, Sufyan M, Siraj S and Sehgal SA (2020) Determine the Potential Epitope Based Peptide Vaccine Against Novel SARS-CoV-2 Targeting Structural Proteins Using Immunoinformatics Approaches. Front. Mol. Biosci. 7:227. doi: 10.3389/fmolb.2020.00227

Received: 01 April 2020; Accepted: 11 August 2020;
Published: 15 October 2020.

Edited by:

Francesco Luigi Gervasio, University College London, United Kingdom

Reviewed by:

Jan Prchal, University of Chemistry and Technology, Prague, Czechia
Valentina Tozzini, Consiglio Nazionale delle Ricerche, Italy
Luisa Di Paola, Campus Bio-Medico University, Italy
Yassmine Chebaro, Centre National de la Recherche Scientifique (CNRS), France

Copyright © 2020 Waqas, Haider, Sufyan, Siraj and Sehgal. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Sheikh Arslan Sehgal, YXJzbGFuc2VoZ2FsQHlhaG9vLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.