- 1Department of Bioinformatics and Biotechnology, Government College University Faisalabad, Faisalabad, Pakistan
- 2Institute of Basic Medical Sciences, Khyber Medical University, Peshawar, Pakistan
- 3Department of Bioinformatics, University of Okara, Okara, Pakistan
Coronaviruses (CoVs) belong to the Coronaviridae-family. The genus Beta-coronaviruses, are enveloped positive strand RNA viruses with club-like spikes at the surface with a unique replication process and a large RNA genome (∼25 kb). CoVs are known as one of the major pathogenic viruses causing a variety of diseases in birds and mammals including humans (lethal respiratory dysfunctions). Recently, a new strain of coronavirus has been identified and named as SARS-CoV-2. A large number of COVID-19 (disease caused by SARS-CoV-2) cases are being diagnosed all over the World especially in China (Wuhan). COVID-19 showed high mortality rate exponentially, however, not even a single effective cure is being introduced yet against COVID-19. In the current study, immunoinformatics approaches were employed to predict the antigenic epitopes against COVID-19 for the development of a coronavirus peptide vaccine. Cytotoxic T-lymphocyte (CTL) and B-cell epitopes were predicted for SARS-CoV-2 coronavirus structural proteins (Spikes, Membrane, Envelope, and Nucleocapsid). The docking complexes of the top 10 epitopes having antigenic sites were analyzed led by binding affinity and binding interactional analyses of top ranked predicted peptides with the MHC-I HLA molecule. The predicted peptides may have potential to be used as peptide vaccine against COVID-19.
Background
There are still a variety of human diseases with unknown etiology. A viral parentage has been purposed for numerous diseases which also has significance to search for new viruses (Cascella et al., 2020). However, there are various difficulties involved in scrutinizing new viruses, as some viruses do not replicate in vitro and have cytopathic effects (CPE). The viruses that are unable to replicate in vitro lead to the failure of virus discovery. The DNA Amplified Restriction Fragment Length Polymorphism (cDNA-AFLP 4) technique helps to identify new viruses, including the discovery of the new coronavirus (CoV) (Cascella et al., 2020). The SARS-CoV-2 strain from the genus Beta-coronavirus of the Coronaviridae family, are enveloped viruses with a large plus strand RNA genome, complete classification is provided in Supplementary Material. The size of the genomic RNA is 27–32 kb and poly-adenylated. There are three serologically distinct groups of CoVs. Viruses are characterized by their genomic sequence and host range (Guy et al., 2000). CoVs have been discovered in mice, turkeys, cats, horses, and humans, leading to many diseases including respiratory tract issues and gastroenteritis (International Committee on Taxonomy of Viruses, 2020). Two human viruses (HCoV-229E and HCoV-OC43) were identified in the mid-1960s and are known to cause the common cold. The recently identified SARS-CoV-2 causes a life-threatening pneumonia and is the most pathogenic human CoV identified thus far (Peiris et al., 2003). SARS-CoV-2 is likely to have been occupied in an animal source and recently initiated the pandemic in humans through zoonotic transmission (Martina et al., 2003). SARS-CoV-2 is the first member of a fourth group of CoVs (Snijder et al., 2003).
In Wuhan (Hubei Province, China), a number of patients linked with Hunan South China seafood market have the third zoonotic human CoV of the century which emerged on the 31st of December, 2019. CoV is similar to Severe Acute Respiratory Syndrome coronavirus (SARS-CoV) and Middle East Respiratory Syndrome Coronavirus (MERS-CoV) infections including fever, lung infiltration and difficulty in breathing (de Wilde et al., 2018; Wuhan Municipal Health Commission, 2020). After an extensive speculation about the causative agent of CoV, the identification of the novel CoV was announced by the Chinese Center for Disease Control (CDS) on the 19th of January, 2020 (Kahn, 2020). The novel CoV SARS-CoV-2 was insulate from a single patient and later corroborated by 16 more patients (World Health Organization [WHO]., 2020). The viral pneumonia of COVID-19 was quickly predicted as a likely causative agent and the sequence of SARS-CoV-2 was submitted (VoNCGAohvotn-c-gaoJ, 2020). Later, five more sequences of SARS-CoV-2 were submitted on the GSAID database on 11th of January, 2020 from the Chinese institutes (GDCAohwgoCaoJ, 2020). Multiple sequence alignment of SARS-CoV, MERS-CoV, and SARS-CoV-2 was carried out and the conserved part of DNA and protein sequences was observed to be similar. Hundreds of deaths linked with this deadliest infection increase the morbidities in the age of 50 years and above. Various diseases have been discovered and associated including dry-cough, leukopenia, fever, and shortness of breath. The extracorporeal membrane oxygenation of the patients considered as severe cases need supportive care. The infection of SARS-CoV-2 in elderly patients is known to be less virulent as compared to SARS-CoV (10% mortality) and MERS-CoV (35% mortality) in the initial stage, later on SARS-CoV-2 caused a huge mortality rate in all over the world (Imai et al., 2020). For this infection, no reliable mediation is currently available. Preventative measures are urgently needed due to the significant global disease burden resultant of SARS-CoV-2 (Douglas et al., 2018). SARS-CoV-2 has a far higher mortality rate as compared to the other known members of corona virus family and researchers are trying their best to develop a successful vaccine against COVID-19. Peptide-based vaccines and multi-epitope adjuvant based vaccines approaches (Tahir ul Qamar et al., 2020) are used widely for the development of successful vaccine. Moreover, naturally occurring compounds are also employed to inhibit SARS-CoV-2 efficiently by using virtual screening approaches (Xiao et al., 2020).
The vaccine development process essentially involves the determination of effective B-cell epitopes and Cytotoxic T lymphocytes (CTL). The advanced methodology has emerged to determine the response of T-cells against numerous vaccine candidates for the process of vaccine development (Ip et al., 2015). The present effort struggles to elucidate and scrutinize the effective T-cells and B-cell (conformational and linear) epitopes act as potential candidates for vaccine by utilizing the immunoinformatics approaches. Furthermore, the crucial step for the development of a vaccine is the identification of potential peptides from the virulent pathogen proteome having interactions with the major histocompatibility complex (MHC). The efficiency of the epitopes binding to MHC molecules is linked with the T-cell immunogenicity (Lazarski et al., 2005). An immunoinformatics approach was utilized to predict the peptide-MHC complexes and comparative molecular docking analyses leads to scrutiny of the potential peptides for peptide vaccine development. Recently, similar approaches and methodology were used against Zika virus, MERS-CoV virus, and Ebola virus for peptide-based vaccine prediction (Ashfaq and Ahamed, 2016; Ahmad et al., 2019; Tahir ul Qamar et al., 2019a).
Materials and Methods
Sequence Retrieval
The primary amino acid sequences of the structural proteins of CoV were extracted from NCBI (Geer et al., 2010). The amino acid sequences of the selected structural protein of CoV have 222 residues for membrane protein (NCBI_Protein = QHQ82467.1), 75 residues for envelope protein (NCBI_Protein = QHW06051.1), 419 residues for nucleocapsid protein (NCBI_Protein = QHZ00386.1) and 1273 amino acids for spikes protein (NCBI_Protein = QHR63260.2). The physiochemical properties of the selected protein were evaluated by using Protparam and VOLPES (Wilkins et al., 1999).
Multiple Sequence Alignment (MSA)
Multiple Sequence Alignment was performed on all the three full length genomes (SARS-CoV = NC_004718, MERS-CoV = NC_019843.3 and SARS-CoV-2 = NC_045512.2) and the genomic sequences were retrieved through GenBank (Sayers et al., 2019, 2020). The genomic sequences of the selected genomes were utilized and a hierarchical approach along with a series of different pair-score matrices including sum-of-pairs and Hidden Markov Model (HMM) was employed for MSA. Clustal Omega (Sievers and Higgins, 2014, 2018) was utilized to analyze the MSA of the selected genomic sequences and the conserved domains were observed by using WebLogo3 (Crooks et al., 2004).
Conformational and Linear B-Cell Epitopes Prediction
The antigen B-cell epitope interactions against B-lymphocyte leads to the differentiation of B-lymphocytes into two different types of cells as antibody-secreting plasma and memory cells (Nair et al., 2002). The hydrophilic nature and surface accessibility of B-cell epitopes were assumed as the key characteristics of predicted B-cell epitopes as predicted B-cells epitopes should be water loving in nature for better solubility (Parker et al., 1986) by accessing the immune epitope database and analysis resource (IEDB)1 as stated by hydrophilicity prediction of Parker (Parker et al., 1986), flexibility prediction of Karplus and Schulz (1985), Emini surface accessibility prediction (Pettersen et al., 2004) and antigenicity scale of Kolaskar and Tongaonkar (Alexander et al., 2011). The conformational B-cell epitopes were predicted by employing ElliPro2 (Pettersen et al., 2004) from IEDB analysis resource having three diverse algorithms comprising protein shape approximation (Emini et al., 1985), residues Protrusion Index (PI) (Nain et al., 2019) and the adjacent residues clustering based on PI.
Potential Epitope Prediction of Cytotoxic T-Lymphocyte (CTL)
The CTL epitopes predictions were analyzed through utilizing NetCTL.1.2 server (Beijing News, 2020). The molecules of MHC behave as antigens and utilize their surface for the activation of CTLs. The NetCTL.1.2 server was utilized to integrate the MHC class I binding prediction, proteasomal C-terminal cleavage and transporter associated with antigen processing (TAP) transport efficiency. The FASTA format sequences of the organism were subjected to the server and Human leukocyte antigen (HLA) alleles and peptide lengths were observed and analyzed. Additionally, the prediction of T-cell epitopes and weight matrix algorithm was employed for the prediction of TAP transport efficiency and artificial neural network was implemented to predict the MHC class-I binding and proteasomal C-terminal cleavage.
World Population Coverage Analyses
The World population coverage analyses were performed by utilizing the IEDB server. The selected CTL epitopes were used and analyzed against the respective allele sets and major world populations were covered. The key purpose of the coverage analyses was to analyze whether the selected candidates were suitable for major populations or not. The analyses were performed against China, Iran, Japan, Korea, Pakistan, Italy, France, and other countries which are being affected by SARS-CoV-2 in the 2020 viral outbreak (Vita et al., 2019).
Molecular Docking Analyses and Peptide-MHC Protein Complex
The predicted epitopes of SARS-CoV-2 structural proteins with antigenic residues were selected for molecular docking analyses. The PEP-FOLD3 server (Lamiable et al., 2016) was utilized to predict the 3D structures of the selected peptides with 200 simulation runs to sample the conformations. The conformational models clustered by the PEP-FOLD3 server were evaluated on the basis of sOPEP energy scores (Maupetit et al., 2007). The analyzed peptides which had higher scores were selected for molecular docking experiments with MHC class I binding molecule comprising HLA-B (PDB ID: 3VCL) through PatchDock docking server (Huang et al., 2010). All the docked complexes having undesirable penetrations of the receptor’s atoms into the ligand were rejected and geometric shape complementarity score was applied to classify the other complexes. Subsequently, the FireDock server (Andrusier et al., 2007; Mashiach et al., 2008) was utilized to refine the docked complexes and also predict the score of the docking outputs.
The FireDock server was utilized to improve the flexibility and scoring errors observed during the molecular docking calculations through fast rigid-body docking tools (Kingsford et al., 2005). The molecular visualization programs PyMOL (Alexander et al., 2011), Ligplot and UCSF Chimera 1.11 (Pettersen et al., 2004) were utilized to visualize, analyze and identify the hydrogen bonding interactions of the docked complexes (Nair et al., 2002; Palatnik-de-Sousa et al., 2018; Tahir ul Qamar et al., 2019b). The schematic diagram illustrating the applied approaches and strategies along with tools and software are mentioned in Figure 1.
Results
A variety of tools and servers have resulted through recent advancements in immunological bioinformatics, which lessen the time and cost of traditional vaccine advancement. The development of an effective multiple epitope vaccine remains difficult due to problems in selection of suitable antigen candidates and immune-dominant epitopes. Thus, it is important to predict the appropriate antigen epitopes of the targeted protein by immune-informatics approaches to design a multiple epitope vaccine (Nain et al., 2019). The main target was to use the immune-informatics approaches and the prediction of peptide vaccine through recognizing MHC binding, B-cells and CTL epitopes. The discovery of effective vaccines is possible through pathogenomics analyses on a genome wide scale, though these conventional experimental methods have multiple limitations (Rodrigues et al., 2019). Immune-informatics approaches help to analyze the complete spectrum of the potential antigen, and furthermore complications regarding in vitro expression of antigen and pathogen culturing can also be evaded. By means of computational methods, the immune research groups have reported various vaccine candidates as having promising preclinical outputs (Davies and Flower, 2007). In current efforts, epitopes have been identified to design the peptide vaccine against HLA-B protein (Tahir et al., 2018). The development of epitopes based vaccines targeting the structural proteins of SARS-CoV-2 and epitopes of the target proteins were predicted to support the host’s immune response. The antigenicity and allergenicity of the predicted epitopes were observed through VaxiJen and Allergen F.P 1.0 (Dimitrov et al., 2014). The estimation of population coverage of predicted epitopes was calculated and it was observed that the coverage in China was 0.5639 with average hits of 4.0 for MHC class I, and with average 0.2462 and hits of 0.91 for MHC class II (Supplementary Table 1). The peptides were designed against ten epitopes by utilizing Pepfold-3.0. The molecular docking analyses of the selected ten peptides were performed through PatchDock and further refined through Fire Dock (Andrusier et al., 2007; Mashiach et al., 2008; Huang et al., 2010) to identify the effective binding sites (Table 1).
Analysis for SARS-CoV-2 Structural Proteins Surface Properties
A peptide with surface-accessibility probability of >1.0 reflects more probable chances for a peptide to be found on the surface (Parker et al., 1986). Numerous peptides were predicted and the top ranked predicted peptides of SARS-CoV-2 structural proteins on the basis of surface probability (Y-axis) and sequence position (X-axis) were selected for further analyses (Supplementary File 1–4). The maximum surface probability scores for the membrane protein, envelope protein, nucleocapsid protein and spikes protein were analyzed as “YANRNR” 5.199, “YSRVKN: 4.136, “KKDKKK” 6.966, and “QDKNTQ” 6.051, respectively. Similarly, minimum surface probability scores for the membrane protein, envelope protein, nucleocapsid protein and spikes protein were observed as “LACFVL” 0.078, “LCAYCC” 0.088, “LALLLL” 0.05, and “VFLVLL” 0.07, respectively (Table 2).
The Karplus and Schulz (1985) flexibility method was utilized to calculate and analyze the atomic vibrational motions in the protein structure designated through B-factor and temperature. The stability and organization of the structure depends upon the B-factor values. The quality of the predicted models depends upon the B-factor values as a lower B-factor value is considered as an effective model while higher B-factor values lead to the less organized and poorly ordered structures (Karplus and Schulz, 1985; Table 2).
The hydrophilicity scale process of Parker was carried out to observe the peptides hydrophilicity based on the peptide retention times through HPLC on reversed phase column. Immunological analyses have revealed the association of antigenic sites with the hydrophilic regions (Parker et al., 1986). The antigenicity of SARS-CoV-2 was calculated through the Kolaskar & Tongaonkar method (Table 2). The predicted facts and data for all selected four protein properties are mentioned in the Supplementary Material (Supplementary File 1–4).
Structure-Based Epitope Prediction for SARS-CoV-2 Structural Proteins
The correlation among the protein structure antigenicity, epitope prediction, accessibility and flexibility within 3D structures were determined through ElliPro (Ponomarenko et al., 2008). The significant properties including protein-antibody interactions were analyzed to differentiate the predicted epitopes. The top-ranked five conformational epitopes for SARS-CoV-2 which had a score of ≥0.6 were observed and selected for further analyses. The PI (Isoelectric Point value) (Ponomarenko et al., 2008) score was observed to analyze the percentage of the atoms which extend over the molecular bulk and are also liable for the antibody binding. The top ranked 2 conformational predicted epitopes along with the residues name, length and locations were critically analyzed (Table 3) and the score was observed 0.703 and 0.706.
Molecular Docking Analyses of SARS-CoV-2 Structural Proteins With HLA-B
The comparative molecular docking analyses were executed for the top ranked 10 selected epitopes of SARS-CoV-2 out of 87 designed peptides with MHC class I HLA-B. The effective binding affinities have been observed for all the selected CTL epitopes having van der Waals (VdW) energy values ranges from −21.80 to −27.52 kcal/mol and the observed global energy was −25.01 to −53.65 kcal/mol (Table 4). The molecular docking analyses of the selected 10 CTL predicted epitopes were carried out and effective binding affinities with HLA-B were observed (Supplementary File 5).
The top 10 docked complexes were visualized (Figure 2) and a similar binding pocket was observed in all the selected peptides. It was observed that Tyr9, Ile66, Gln70, Tyr99, Tyr116, and Arg156 residues were conserved in all the selected peptides (Table 3).
Figure 2. Peptide-MHC class I HLA-B (pink color helices denotes the conserved binding domain of HLA-B and the remaining protein structure is presented in the wire shape), binding interacting residues of the top-ranked 10 peptides represented in different colors, 6 spike peptides brown color residues, 2 membrane peptides red color residues, 1 nucleocapsid, and 1 envelope peptide with purple and blue color residues, respectively.
Population Coverage Analyses
The population coverage analyses were performed with the selected MHC class I and MHC class II epitopes and also with the associated HLA alleles. It was observed that the selected MHC class I and MHC class II epitopes have the world’s population of 58.49 and 34.71%, respectively. MHC class I epitopes showed the highest coverage in the population of Italy (90.19%) and China (56.39%). The MHC class II epitopes also showed the highest coverage in the Philippines (71.92%) (Supplementary File 6).
Multiple Sequence Alignment
Multiple sequence alignment was performed for three CoV genomes and conserved binding residues were observed. It was observed that all the selected strains of the CoV have conserved domains, which is reconciled with the latest outbreak strain SARS-CoV-2. Interestingly, it was observed that the reported binding domain of the previously reported strain has a similar region of binding with latest outbreak of CoV, 2019. The binding residues of SARS-CoV-2 showed similar binding domains with MERS and SARS (Supplementary File 7).
Discussion
The need of dealing with CoVs has been increased since its recent breakout in China (Wuhan) affecting millions of humans. This SARS-CoV-2 viral attack has become a worldwide emergency in different regions of the World, especially in China (Mcclain, 1995). As an immediate response, numerous efforts from all over the world have been made to develop a peptide based vaccine against SARS-CoV-2, and the peptide inhibitors are of great interest to develop vaccines (Chew et al., 2017; Usman Mirza et al., 2017). The peptide targets are more preferable than traditional ligand-based drugs and vaccines due to different aspects including less toxic, fewer side-effects and their ultra-fast action. Immunoinformatics approaches help by reducing the work-load of laboratory trials, additionally these approaches are less time consuming and cost efficient than traditional approaches (Vanhee et al., 2011; Heurich et al., 2013; Xu et al., 2017). In the last 10 years, there has been much progress in in silico drug designing (Sehgal, 2017). Numerous biological problems are being solved by the implementation of different bioinformatics approaches (Sehgal et al., 2013; Sehgal, 2017; Tahir et al., 2018).
Researchers are striving mutually for a successful vaccine development and cure against COVID-19. Computational approaches were employed to analyze the synergistic effect by the combination of lopinavir, oseltamivir and ritonavir through molecular docking studies (Muralidharan et al., 2020).
Recently, molecular docking analyses along with virtual screening were performed against the drug candidates in clinical trials and approved drugs. Elbasvir, lopinavir, valrubicin, and carfilzomib were identified as potential compounds (Wang, 2020). Molecular docking analyses also revealed that luteolin and chloroquine also have the potential to inhibit the SARS-CoV-2 (Yu et al., 2020).
Recently, numerous research groups have struggled to design the subunit vaccines against SARS-CoV-2; though, the utilized workflow involved in the research either employ of a single protein to design the vaccine (Abdelmageed et al., 2020; Bhattacharya et al., 2020) or only CTL epitopes was used without considering the significance of HTL or B-cell epitopes (Seema, 2019). In current research work, all of these significant factors were considered to design the vaccine. Through extensive bioinformatics analyses, four proteins were utilized to design an epitope-based vaccine against SARS-CoV-2. The selected proteins for the analyses were membrane glycoprotein (M), nucleocapsid protein (N), envelop protein (E), and surface spike glycoprotein (S). The protein M helps in immunogenicity and assembly of the virus particles. The protein N has the ability to package the viral genome into a helical ribonucleocapsid and has a key role during viral self-assembly (Chang et al., 2013). The protein S has the ability to mediate the movement of the virus to human cells. The protein S is classified into two regions as S1 for the binding of the host receptor cell and S2 for the fusion of membrane. Due to the active involvement of protein S, it is considered as a key target for vaccine development, diagnostics and therapeutic antibodies for coronavirus (Du et al., 2009; Al-Amri et al., 2017; Prompetchara et al., 2020). By keeping the importance of protein S in mind, six different peptides were designed and analyzed.
The observed findings of antigenicity analysis range from 7.6 to 6.12% which is considered as an effective antigenic ability for a potent peptide, and similar ranges were observed in both studies of immunoinformatics analyses. Moreover, the binding domain of HLA-B was observed to be conserved in both studies and reconcile with the present research efforts (Usman Mirza et al., 2017; Tahir ul Qamar et al., 2020).
The potential CTL epitopes have been predicted for structural proteins of SARS-CoV-2. The molecular docking tools were used to analyze MHC-1 and peptide binding affinities for the selected peptides (Alam et al., 2016). Other evidences including C-terminal cleavage affinities also validated the binding affinity of the peptide-MHC-I complexes. In this study, ten peptides were reported as potential targets that showed effective interactions with the MHC-I protein (HLA-B), having maximum binding affinities and antigenicity. This increases the probability of the potential vaccine targets for the observed residues to be promising targets. The surface accessibility, surface flexibility as well as hydrophobicity and antigenicity for SARS-CoV-2 structural proteins were calculated and cross-verified by using the IEDB server (Sieker et al., 2009). An extensive literature review was performed and it was observed that the selected peptides were not reported against SARS-CoV-2. The predicted peptides were modeled by PEP-FOLD3 server and docked to MHC-1 using PatchDock and FireDock was used for further refinement. PyMOL and UCSF Chimera 1.11 were used to check the interactions of docked complexes.
The design and development of a potent vaccine needs an extensive investigation and analyses of immunological correlations with SARS-CoV-2. However, the experimental techniques would not be able to serve the urgency due to the severity and emergency of the COVID-19 outbreak. Therefore, in silico and computational predictions are helpful to guide the researchers to design a potential vaccine and help to control COVID-19. The vaccine development is an expensive and lengthy procedure with a high rate of failure, and several years are required to develop an effective commercial vaccine. Computational analyses suggest that the reported epitope-based vaccine peptides may have the ability to be protective against SARS-CoV-2 infection.
Conclusion
The aim of this work was to identify the effective peptide based inhibitors against SARS-CoV-2 structural protein (Membrane, Envelope, Nucleocapsid, and Spikes). The predicted epitopes were designed leading to the molecular docking analyses against MHC-I and interactional analyses of the selected docked complexes were analyzed. In conclusion, 10 Epitopes (six from spikes protein “LTDEMIAQY, WTAGAAAYY, TSNQVAVLY, CVADYSVLY, KTSVDCTMY, and STECSNLLL,” two from membrane protein “SSDNIALLV and ATSRTLSYY,” one from nucleocapsid and one from envelope protein “LSPRWYFYY and LTALRLCAY,” respectively), were predicted which might be potential targets as peptide vaccine against deadly SARS -CoV-2.
Data Availability Statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation, to any qualified researcher.
Author Contributions
MW, AH, MS, SS, and SAS performed the analyses and drafted the manuscript. All authors contributed to the article and approved the submitted version.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
Thanks to Department of Bioinformatics and Biotechnology, Government College University Faisalabad and Department of Bioinformatics, University of Okara for providing the platform for this work.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmolb.2020.00227/full#supplementary-material
Abbreviations
CTL, cytotoxic T-lymphocyte; HLA, human leukocyte antigen; IEDB, immune epitope database; MERS-CoV, middle east respiratory syndrome coronavirus; MHC, major histocompatibility complex; PI, isoelectric point; RBD, receptor binding domain; SARS-CoV, severe acute respiratory syndrome coronavirus.
Footnotes
References
Abdelmageed, M. I., Abdelmoneim, A. H., Mustafa, M. I., Elfadol, N. M., Murshed, N. S., Shantier, S. W., et al. (2020). Design of a multiepitope-based peptide vaccine against the E protein of human COVID-19: an immunoinformatics approach. Biomed Res. Int. 2020:268 3286.
Ahmad, B., Ashfaq, U. A., Rahman, M. U., Masoud, M. S., and Yousaf, M. Z. (2019). Conserved B and T cell epitopes prediction of Ebola virus glycoprotein for vaccine development: an immuno-informatics approach. Microb. Pathog. 132, 243–253. doi: 10.1016/j.micpath.2019.05.010
Alam, A., Ali, S., Ahamad, S., Malik, M. Z., and Ishrat, R. (2016). From ZikV genome to vaccine: in silico approach for the epitope-based peptide vaccine against Zika virus envelope glycoprotein. Immunology 149, 386–399. doi: 10.1111/imm.12656
Al-Amri, S. S., Abbas, A. T., Siddiq, L. A., Alghamdi, A., Sanki, M. A., Al-Muhanna, M. K., et al. (2017). Immunogenicity of Candidate MERS-CoV DNA vaccines based on the spike protein. Sci. Rep. 7:44875.
Alexander, N., Woetzel, N., and Meiler, J. (2011). “bcl::Cluster : a method for clustering biological molecules coupled with visualization in the Pymol Molecular Graphics System,” in Proceedings of the IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences, ICCABS 2011, Orlando, FL, 13–18.
Andrusier, N., Nussinov, R., and Wolfson, H. J. (2007). FireDock: fast interaction refinement in molecular docking. Proteins 69, 139–159. doi: 10.1002/prot.21495
Ashfaq, U. A., and Ahamed, B. (2016). De novo structural modeling and conserved epitopes prediction of Zika virus envelop protein for vaccine development. Viral Immunol. 29, 436–443. doi: 10.1089/vim.2016.0033
Beijing News (2020). Wen XMSiWCrNCP, and Another Suspected. Available online at: http://china.qianlong.com/2020/0121/3600877.shtml (accessed January 21, 2020).
Bhattacharya, M., Sharma, A. R., Patra, P., Ghosh, P., Sharma, G., Patra, B. C., et al. (2020). Development of epitope-based peptide vaccine against novel coronavirus 2019 (SARS-COV-2): immunoinformatics approach. J. Med. Virol. 92, 618–631. doi: 10.1002/jmv.25736
Cascella, M., Rajnik, M., Cuomo, A., Dulebohn, S. C., and Di Napoli, R. (2020). Features, Evaluation and Treatment Coronavirus (COVID-19). Treasure Island, FL: StatPearls Publishing.
Chang, C. K., Chen, C. M., Chiang, M. H., Hsu, Y. L., and Huang, T. H. (2013). Transient oligomerization of the SARS-CoV N protein–implication for virus ribonucleoprotein packaging. PLoS One 8:e65045. doi: 10.1371/journal.pone.0065045
Chew, M. F., Poh, K. S., and Poh, C. L. (2017). Peptides as therapeutic agents for dengue virus. Int. J. Med. Sci. 14, 1342–1359. doi: 10.7150/ijms.21875
Crooks, G. E., Hon, G., Chandonia, J. M., and Brenner, S. E. (2004). WebLogo: a sequence logo generator. Genome Res. 14, 1188–1190. doi: 10.1101/gr.849004
Davies, M. N., and Flower, D. R. (2007). Harnessing bioinformatics to discover new vaccines. Drug Discov. Today 12, 389–395. doi: 10.1016/j.drudis.2007.03.010
de Wilde, A. H., Snijder, E. J., Kikkert, M., and van Hemert, M. J. (2018). Host Factors in Coronavirus Replication. Curr. Top. Microbiol. Immunol. 419, 1–42. doi: 10.1007/82_2017_25
Dimitrov, I., Naneva, L., Doytchinova, I., and Bangov, I. (2014). AllergenFP: allergenicity prediction by descriptor fingerprints. Bioinformatics 30, 846–851. doi: 10.1093/bioinformatics/btt619
Douglas, M. G., Kocher, J. F., Scobey, T., Baric, R. S., and Cockrell, A. S. (2018). Adaptive evolution influences the infectious dose of MERS-CoV necessary to achieve severe respiratory disease. Virology 517, 98–107. doi: 10.1016/j.virol.2017.12.006
Du, L., He, Y., Zhou, Y., Liu, S., Zheng, B. J., and Jiang, S. (2009). The spike protein of SARS-CoV–a target for vaccine and therapeutic development. Nat. Rev. Microbiol. 7, 226–236. doi: 10.1038/nrmicro2090
Emini, E. A., Hughes, J. V., Perlow, D. S., and Boger, J. (1985). Induction of hepatitis A virus-neutralizing antibody by a virus-specific synthetic peptide. J. Virol. 55, 836–839. doi: 10.1128/jvi.55.3.836-839.1985
Geer, L. Y., Marchler-Bauer, A., Geer, R. C., Han, L., He, J., He, S., et al. (2010). The NCBI biosystems database. Nucleic Acids Res. 38(Suppl._1), D492–D496.
Guy, J. S., Breslin, J. J., Breuhaus, B., Vivrette, S., and Smith, L. G. (2000). Characterization of a coronavirus isolated from a diarrheic foal. J. Clin. Microbiol. 38, 4523–4526. doi: 10.1128/jcm.38.12.4523-4526.2000
Heurich, M., Altintas, Z., and Tothill, I. E. (2013). Computational design of peptide ligands for ochratoxin A. Toxins 5, 1202–1218. doi: 10.3390/toxins5061202
Huang, P. T., Lo, P. H., Wang, C. H., Pang, C. T., and Lou, K. L. (2010). PPDock-Portal Patch Dock: a web server for drug virtual screen and visualizing the docking structure by GP and X-Score. Acta Crystallogr. A 66, S233–S234.
Imai, N. D. I, Cori, A., Riley, S., and Ferguson, N. M. (2020). Estimating the Potential Total Number of Novel Coronavirus (2019-nCoV) Cases in Wuhan City, China. Available online at: https://www.imperial.ac.uk/mrcglobal-infectious-disease-analysis/news–wuhan-coronavirus/ (accessed January 19, 2020).
Ip, P. P., Nijman, H. W., and Daemen, T. (2015). Epitope prediction assays combined with validation assays strongly narrows down putative cytotoxic T Lymphocyte epitopes. Vaccines 3, 203–220.
Karplus, P. A., and Schulz, G. E. (1985). Prediction of chain flexibility in proteins - a tool for the selection of peptide antigens. Naturwissenschaften 72, 212–213. doi: 10.1007/bf01195768
Kingsford, C. L., Chazelle, B., and Singh, M. (2005). Solving and analyzing side-chain positioning problems using linear and integer programming. Bioinformatics 21, 1028–1036. doi: 10.1093/bioinformatics/bti144
Lamiable, A., Thevenet, P., Rey, J., Vavrusa, M., Derreumaux, P., and Tuffery, P. (2016). PEP-FOLD3: faster de novo structure prediction for linear peptides in solution and in complex. Nucleic Acids Res. 44, W449–W454.
Lazarski, C. A., Chaves, F. A., Jenks, S. A., Richards, K. A., Weaver, J., et al. (2005). The kinetic stability of MHC class II: peptide complexes is a key parameter that dictates immunodominance. Immunity 23, 29–40. doi: 10.1016/j.immuni.2005.05.009
Martina, B. E., Haagmans, B. L., Kuiken, T., Fouchier, R. A., Rimmelzwaan, G. F., and Van Amerongen, G. (2003). Virology: SARS virus infection of cats and ferrets. Nature 425:915. doi: 10.1038/425915a
Mashiach, E., Schneidman-Duhovny, D., Andrusier, N., Nussinov, R., and Wolfson, H. J. (2008). FireDock: a web server for fast interaction refinement in molecular docking. Nucleic Acids Res. 36, W229–W232.
Maupetit, J., Tuffery, P., and Derreumaux, P. (2007). A coarse-grained protein force field for folding and structure prediction. Proteins 69, 394–408. doi: 10.1002/prot.21505
Mcclain, C. S. (1995). A new look at an old disease - smallpox and biotechnology. Perspect. Biol. Med. 38, 624–639. doi: 10.1353/pbm.1995.0000
Muralidharan, N., Sakthivel, R., Velmurugan, D., and Gromiha, M. M. (2020). Computational studies of drug repurposing and synergism of lopinavir, oseltamivir and ritonavir binding with SARS-CoV-2 protease against COVID-19. J. Biomol. Struct. Dyn. 16, 1–6. doi: 10.1080/07391102.2020.1752802
Nain, Z., Abdullah, F., Rahman, M. M., Karim, M. M., Khan, M. S. A., Bin Sayed, S., et al. (2019). Proteome-wide screening for designing a multi-epitope vaccine against emerging pathogen Elizabethkingia anophelis using immunoinformatic approaches. J. Biomol. Struct. Dyn. 38, 4850–4867. doi: 10.1080/07391102.2019.1692072
Nair, D. T., Singh, K., Siddiqui, Z., Nayak, B. P., Rao, K. V. S., and Salunke, D. M. (2002). Epitope recognition by diverse antibodies suggests conformational convergence in an antibody response. J. Immunol. 168, 2371–2382. doi: 10.4049/jimmunol.168.5.2371
Palatnik-de-Sousa, C. B., Soares, I. D. S., and Rosa, D. S. (2018). Editorial: epitope discovery and synthetic vaccine design. Front. Immunol. 9:826. doi: 10.3389/fimmu.2018.00826
Parker, J. M. R., Guo, D., and Hodges, R. S. (1986). New hydrophilicity scale derived from high-performance liquid-chromatography peptide retention data - correlation of predicted surface residues with antigenicity and X-ray-derived accessible sites. Biochemistry 25, 5425–5432. doi: 10.1021/bi00367a013
Peiris, J. S., Chu, C. M., Cheng, V. C., Chan, K. S., Hung, I. F., and Poon, L. L. (2003). Clinical progression and viral load in a community outbreak of coronavirus-associated SARS pneumonia: a prospective study. Lancet 361, 1767–1772. doi: 10.1016/s0140-6736(03)13412-5
Pettersen, E. F., Goddard, T. D., Huang, C. C., Couch, G. S., Greenblatt, D. M., Meng, E. C., et al. (2004). UCSF chimera - a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612. doi: 10.1002/jcc.20084
Ponomarenko, J., Bui, H. H., Li, W., Fusseder, N., Bourne, P. E., Sette, A., et al. (2008). ElliPro: a new structure-based tool for the prediction of antibody epitopes. BMC Bioinformatics 9:514. doi: 10.1186/1471-2105-9-514
Prompetchara, E., Ketloy, C., and Palaga, T. (2020). Immune responses in COVID-19 and potential vaccines: lessons learned from SARS and MERS epidemic. Asian Pac. J. Allergy Immunol. 38, 1–9.
Rodrigues, T. C. V., Jaiswal, A. K., de Sarom, A., Oliveira, L. D., Oliveira, C. J. F., Ghosh, P., et al. (2019). Reverse vaccinology and subtractive genomics reveal new therapeutic targets against Mycoplasma pneumoniae: a causative agent of pneumonia. R. Soc. Open Sci. 6:190907. doi: 10.1098/rsos.190907
Sayers, E. W., Cavanaugh, M., Clark, K., Ostell, J., Pruitt, K. D., and Karsch-Mizrachi, I. (2019). GenBank. Nucleic Acids Res. 47, D94–D99.
Sayers, E. W., Cavanaugh, M., Clark, K., Ostell, J., Pruitt, K. D., and Karsch-Mizrachi, I. (2020). GenBank. Nucleic Acids Res 48, D84–D86.
Seema, M. T. (2019). Cell epitope-based vaccine design for pandemic novel Coronavirus -nCoV2020. ChemRxiv [Preprint]. doi: 10.26434/chemrxiv.12029523.v2
Sehgal, S. A. (2017). Pharmacoinformatics, adaptive evolution, and elucidation of six novel compounds for schizophrenia treatment by targeting DAOA (G72) isoforms. Biomed Res. Int. 2017:5925714.
Sehgal, S. A., Khattak, N. A., and Mir, A. (2013). Structural, phylogenetic and docking studies of D-amino acid oxidase activator (DAOA), a candidate schizophrenia gene. Theor. Biol. Med. Model. 10:3. doi: 10.1186/1742-4682-10-3
Sieker, F., May, A., and Zacharias, M. (2009). Predicting affinity and specificity of antigenic peptide binding to major histocompatibility class I molecules. Curr. Protein Pept. Sci. 10, 286–296. doi: 10.2174/138920309788452191
Sievers, F., and Higgins, D. G. (2014). Clustal omega. Curr. Protoc. Bioinformatics 48, 3.13.1–3.13.16.
Sievers, F., and Higgins, D. G. (2018). Clustal Omega for making accurate alignments of many protein sequences. Protein Sci. 27, 135–145. doi: 10.1002/pro.3290
Snijder, E. J., Bredenbeek, P. J., Dobbe, J. C., Thiel, V., Ziebuhr, J., and Poon, L. L. (2003). Unique and conserved features of genome and proteome of SARS-coronavirus, an early split-off from the coronavirus group 2 lineage. J. Mol. Biol. 331, 991–1004. doi: 10.1016/s0022-2836(03)00865-9
Tahir, R. A., Wu, H., Rizwan, M. A., Jafar, T. H., Saleem, S., and Sehgal, S. A. (2018). Immunoinformatics and molecular docking studies reveal potential epitope-based peptide vaccine against DENV-NS3 protein. J. Theor. Biol. 459, 162–170. doi: 10.1016/j.jtbi.2018.10.005
Tahir ul Qamar, M. T., Rehman, A., Ashfaq, U. A., Awan, M. Q., Fatima, I., Shahid, F., et al. (2020). Designing of a next generation multiepitope based vaccine (MEV) against SARS-COV-2: immunoinformatics and in silico approaches. bioRxiv [Preprint]. doi: 10.1101/2020.02.28.970343
Tahir ul Qamar, M., Saleem, S., Ashfaq, U. A., Bari, A., Anwar, F., and Alqahtani, S. (2019a). Epitope-based peptide vaccine design and target site depiction against Middle East Respiratory Syndrome Coronavirus: an immune-informatics study. J. Transl. Med. 17:362.
Tahir ul Qamar, M., Saleem, S., Ashfaq, U. A., Bari, A., Anwar, F., and Alqahtani, S. (2019b). Epitope-based peptide vaccine design and target site depiction against Middle East Respiratory Syndrome Coronavirus: an immune-informatics study. J. Transl. Med. 17:362.
Usman Mirza, M., Rafique, S., Ali, A., Munir, M., Ikram, N., Manan, A., et al. (2017). Towards peptide vaccines against Zika virus: immunoinformatics combined with molecular dynamics simulations to predict antigenic epitopes of Zika viral proteins. Sci. Rep. 6:37313.
Vanhee, P., van der Sloot, A. M., Verschueren, E., Serrano, L., Rousseau, F., and Schymkowitz, J. (2011). Computational design of peptide ligands. Trends Biotechnol. 29, 231–239. doi: 10.1016/j.tibtech.2011.01.004
Vita, R., Mahajan, S., Overton, J. A., Dhanda, S. K., Martini, S., Cantrell, J. R., et al. (2019). The Immune Epitope Database (IEDB): 2018 update. Nucleic Acids Res. 47, D339–D343.
Wang, J. (2020). Fast identification of possible drug treatment of Coronavirus Disease-19 (COVID-19) through computational drug repurposing study. J. Chem. Inf. Model. 60, 3277–3286. doi: 10.1021/acs.jcim.0c00179
Wilkins, M. R., Gasteiger, E., Bairoch, A., Sanchez, J. C., Williams, K. L., Appel, R. D., et al. (1999). Protein identification and analysis tools in the ExPASy server. Methods Mol. Biol. 112, 531–552. doi: 10.1385/1-59259-584-7:531
World Health Organization [WHO]. (2020). WHO Statement Regarding Cluster of Pneumonia Cases in Wuhan CAohwwicnd—w-s-r-c. Beijing: WHO.
Wuhan Municipal Health Commission (2020). http://en.nhc.gov.cn/2020-04/06/c_78861_2.htm (accessed January 19, 2020).
Xiao, Y., Li, Z., Wang, X., Wang, Y., Wang, Y., Wang, G., et al. (2020). Comparison of three TaqMan real-time reverse transcription-PCR assays in detecting SARS-CoV-2. BioRxiv [Preprint]. doi: 10.1101/2020.07.06.189860
Xu, D. R., Bian, H. L., Cai, J. L., Bao, D. C., Jin, Q., Zhu, M., et al. (2017). Computational design of peptide ligands to target the intermolecular interaction between viral envelope protein and pediatric receptor. Comput. Biol. Chem. 69, 120–125. doi: 10.1016/j.compbiolchem.2017.06.001
Keywords: immunoinformatics, SARS-CoV-2, SARS-CoV, peptide vaccines, corona virus disease 2019
Citation: Waqas M, Haider A, Sufyan M, Siraj S and Sehgal SA (2020) Determine the Potential Epitope Based Peptide Vaccine Against Novel SARS-CoV-2 Targeting Structural Proteins Using Immunoinformatics Approaches. Front. Mol. Biosci. 7:227. doi: 10.3389/fmolb.2020.00227
Received: 01 April 2020; Accepted: 11 August 2020;
Published: 15 October 2020.
Edited by:
Francesco Luigi Gervasio, University College London, United KingdomReviewed by:
Jan Prchal, University of Chemistry and Technology, Prague, CzechiaValentina Tozzini, Consiglio Nazionale delle Ricerche, Italy
Luisa Di Paola, Campus Bio-Medico University, Italy
Yassmine Chebaro, Centre National de la Recherche Scientifique (CNRS), France
Copyright © 2020 Waqas, Haider, Sufyan, Siraj and Sehgal. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Sheikh Arslan Sehgal, YXJzbGFuc2VoZ2FsQHlhaG9vLmNvbQ==