Skip to main content

ORIGINAL RESEARCH article

Front. Microbiol., 12 August 2022
Sec. Virology
This article is part of the Research Topic Coronavirus: Broad-Spectrum Anti-viral Targets and Treatments View all 10 articles

B.1.1.7 (Alpha) variant is the most antigenic compared to Wuhan strain, B.1.351, B.1.1.28/triple mutant and B.1.429 variants

  • 1Department of Zoology, Fakir Mohan University, Balasore, Odisha, India
  • 2Institute for Skeletal Aging and Orthopedic Surgery, Hallym University-Chuncheon Sacred Heart Hospital, Chuncheon, Gangwon-do, South Korea
  • 3Department of Applied Science, Galgotias College of Engineering and Technology, Greater Noida, Uttar Pradesh, India
  • 4Department of Biotechnology, School of Life Science and Biotechnology, Adamas University, Kolkata, West Bengal, India

The rapid spread of the SARS-CoV-2 virus and its variants has created a catastrophic impact worldwide. Several variants have emerged, including B.1.351 (Beta), B.1.1.28/triple mutant (P.1), B.1.1.7 (Alpha), and B.1.429 (Epsilon). We performed comparative and comprehensive antigenicity mapping of the total S-glycoprotein using the Wuhan strain and the other variants and identified 9-mer, 15-mer, and 20-mer CTL epitopes through in silico analysis. The study found that 9-mer CTL epitope regions in the B.1.1.7 variant had the highest antigenicity and an average of the three epitope types. Cluster analysis of the 9-mer CTL epitopes depicted one significant cluster at the 70% level with two nodes (KGFNCYFPL and EGFNCYFPL). The phage-displayed peptides showed mimic 9-mer CTL epitopes with three clusters. CD spectra analysis showed the same band pattern of S-glycoprotein of Wuhan strain and all variants other than B.1.429. The developed 3D model of the superantigen (SAg)-like regions found an interaction pattern with the human TCR, indicating that the SAg-like component might interact with the TCR beta chain. The present study identified another partial SAg-like region (ANQFNSAIGKI) from the S-glycoprotein. Future research should examine the molecular mechanism of antigen processing for CD8+ T cells, especially all the variants’ antigens of S-glycoprotein.

Introduction

The COVID-19 has affected the global population, and a surge of the SARS-CoV-2 has been noted in many countries (Chakraborty et al., 2020; Dinesh et al., 2020; Li and Liu, 2020). Due to this COVID-19 wave, high peaks of daily infection and death occurred in these countries. One of the most significant causes of surges in different countries is the emergence of new variants, developing because of mutations (Brüssow, 2021).

Several significant variants of concern/interest (VOCs/VOIs) of this virus [B.1.351, (Beta), B.1.1.28/triple mutant (P.1), B.1.1.7 (Alpha), and B.1.429 (Epsilon)] have been determined in South Africa, Brazil, United Kingdom, and the United States and have mushroomed to many other countries (Chakraborty et al., 2021a,b; Focosi et al., 2021; Harvey et al., 2021). These newly evolved variants with significant mutations have developed from the original Wuhan strain in different regions worldwide. The ongoing evolution of the SARS-CoV-2 virus is due to the region-wise adaptive changes and multiple mutations during the evolutionary process (Chakraborty et al., 2021c; Rochman et al., 2021). Diverse variants of this virus have developed over time due to the several mutations (Figure 1A). However, four significant variants (B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429) have been identified in different regions worldwide. All four variants have mutations within the spike (S) protein, which change the attachment site of the virus, human cell entry, and strong binding affinity with VOC/VOI and receptor compared to the Wuhan strain. The UK variant (B.1.1.7) steadily surged in just a few months and is now found in at least 114 countries worldwide (Davies et al., 2021). Several significant mutations have been identified in the B.1.1.7 lineage. One significant mutation (N501Y) was observed in the viral S-glycoprotein of the receptor-binding domain (RBD). Similarly, four mutations were observed in the S1-domain (del144Y, del69–70 HV, D614G, and A570D), and four mutations were noted in the S2 domain (P681H, S982A, T761I, and D1118H) (Li et al., 2021; Meng et al., 2021; Shen et al., 2021). The B.1.351 lineage consists of three significant mutations in the RBD region of the S-glycoprotein. These mutations are noted as K417N, E484K, and N501Y. Five important mutations have been observed in the N-terminal domain: R246I, D215G, L18F, D80A, and a deletion mutation at amino acid positions 242–244. Another mutation (A701V) in the S2 subunit has also been observed (Cele et al., 2021; Wang et al., 2021b). The P.1 (B.1.1.28.1) lineage has three significant mutations (K417T, E484K, and N501Y) in the RBD region of the S-glycoprotein, and eight other significant amino acid changes have been found (D138Y, T20N, L18F, R190S, P26S, T1027I, H655Y, and V1176F). The B.1.429 lineage has four mutations, two mutations (L452R and D614G) reside within the RBD, and two mutations (S13I and W152C) are found in the NTD (N-terminal domain) segment of the S-glycoprotein. These mutational changes exhibit augmented neutralization resistance (Faria et al., 2021; Wang et al., 2021a). The present study has explored the antigenicity of B.1.1.28/triple mutant along with two VOCs (B.1.1.7 and B.1.351). Similarly, several mutations have been noted in the S-glycoprotein of these variants (Naveca et al., 2021).

FIGURE 1
www.frontiersin.org

Figure 1. The figure depicted the timeline and mutations of newly significant emerging variants and Wuhan strain. (A) Timeline showing the origin time of some newly significant emerging variants (B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429) and Wuhan strain. (B) The schematic diagram shows the significant mutations present in the CTL antigenic regions of newly emerging SARS-CoV-2 variants S-protein.

Antigenicity mapping is a powerful technique for understanding the antigenic properties of any virus and can be analyzed using epitope mapping (Sun et al., 2013; Anderson et al., 2018). Antigenicity mapping can help to understand different characteristics such as antigenic sites and immune escape mechanisms (Peacock et al., 2016). CD8+ T cells epitopes are also called cytotoxic T lymphocytes (CTL) epitopes and are one of the critical determinants for understanding the CD8+ T cell response and antigenic properties (Feliu et al., 2013; Tscharke et al., 2015). Recently, Gangaev et al. (2021) attempted to map the immunodominant and immunogenic epitopes of this virus to illustrate the role of CD8+ T cells in host defense (Gangaev et al., 2021). Understanding CTL epitopes with antigenic properties can assist in vaccine design (Wong et al., 2019; Bhattacharya et al., 2020a,b, 2021). Therefore, CTL epitope mapping plays a vital role in determining the antigenicity of a pathogen. Researchers have examined 9-mer, 15-mer, and 20-mer CTL epitopes to understand the antigenicity of pathogens and designed vaccines against these pathogens (Doi et al., 2009; Bhatnager et al., 2020; Rencilin et al., 2021; Shey et al., 2021). The 9-mer CTL epitopes determine the viral antigenicity (Raza et al., 2021). Several researchers have used 9-mer CTL epitopes to assess the antigenic peptides from S-glycoprotein, and this 9-mer CTL epitope has been used for vaccine development against SARS CoV-2 (Kar et al., 2020; Rencilin et al., 2021).

Epitope clustering strategies might help capture the diversity of pathogens’ epitopes and understand highly similar or overlapping sequences. Previous studies have examined CTL epitope clustering for different pathogens (Liu et al., 2010; Rist et al., 2015; Vujovic et al., 2020). In the present study, the epitope clustering method was used to understand the clustering of CTL epitopes of the S-glycoprotein of the Wuhan strain (wild type) and mutant variants of SARS-CoV-2. Circular dichroism (CD) spectra provide information on the molecular structure of proteins at a quantitative level (Micsonai et al., 2015). The CD spectra can also provide information on the molecular structure of a mutant protein (Huai et al., 2021). We analyzed the CD spectra of the Wuhan strain and mutant variants of the S-glycoprotein of this virus.

In patients with severe cases, toxic shock syndrome has been observed, which causes hyperinflammation. Cheng et al. (2020) observed a superantigenic insert, which is a unique S-glycoprotein, in SARS-CoV-2, and concluded that the insert might be responsible for causing hyper inflammation in patients with severe COVID-19 (Cheng et al., 2020). SAgs can excessively activate the immune system. Previous studies have identified SAgs from viruses (Lafon, 2000). A SAg-like region (PRRA) was recently detected in this virus S-glycoprotein of the Wuhan strain (Cheng et al., 2020). We attempted to analyze the superantigenic insert in the Wuhan strain and mutant variants further.

In this study, our objective was to perform a comprehensive in silico analysis of the total S-glycoprotein of the Wuhan strain, B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants to understand the antigenicity using the CTL epitopes or SAg-like region of S-glycoprotein and the interaction between the SAg-like area within the and the T-cell antigen receptor (TCR).

Materials and methods

Data acquisition of the S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

The sequences of amino acids within the S protein of the Wuhan strain and the four significant mutant variants were retrieved from the NCBI/PDB (protein database) in FASTA format. The accession numbers of the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants are QHR63290.2, 7LYL_A, 7LWW_A, 7LWV_A, and 7N8H_A, respectively. Similarly, we collected PDB files from the PDB database for the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants, 6VXX 7LYL, 7LWW, 7LWV, and 7N8H, respectively. We started our work with the PDB immediately at that time of deposition of the structure for 7LWW. We found that the 7LWW was mentioned as structure S-glycoprotein of the B.1.1.28 in the PDB database.

Afterward, we found that the PDB is mentioned as a triple mutant in some pieces of literature. In this study, we have used both the two parameters (“B.1.1.28” and “triple mutant”) at the same time for describing the 7LWW and mentioning it as B.1.1.28/triple mutant.

Identification of 9-mer, 15-mer, and 20-mer CTL epitopes of total S-glycoprotein and RBD region of S-glycoprotein of the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

We identified the 9-mer, 15-mer, and 20-mer CTL epitopes of S-glycoprotein from the Wuhan strain and all four variants (B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429) and analyzed their antigenic sequences. To identify any promising MHC-I (Major Histocompatibility Complex class I) binding epitopes, we used ProPred-I, which is an online web server for predicting 9-mer, 15-mer, and 20-mer peptide binding to MHC-I alleles (Singh and Raghava, 2003). Finally, we calculated the average number of epitopes in the five variants.

Antigenicity predictions of the identified epitopes of the S-glycoprotein of the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

In the present study, we employed the VaxiJen server, which uses a unique algorithm comprising an auto cross-covariance-based machine-learning server and an alignment-based prediction process (Doytchinova and Flower, 2007). This server uses a default parameter and threshold to calculate the antigenicity of pathogens. Here, we applied 0.4 as the threshold value for the viral peptide.

Comprehensive and comparative analysis of significant 9-mer CTL epitopic landscape of S-glycoprotein of the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

We performed a comprehensive comparative analysis of the Wuhan strain’s significant 9-mer CTL epitopic landscape and significant VOCs/VOI/others. We attempted to determine the different essential mutations present in the 9-mer CTL epitopes in the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants. Finally, we hypothesized that the epitopes mutated from the Wuhan strain evolved into different significant VOCs/VOI/others.

MSA using the identified CTL epitopes (9-mer, 15-mer, and 20-mer) of the S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

The 9-mer, 15-mer, and 20-mer CTL epitopes of the S-glycoprotein of the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants were used to perform MSA. MSA was used to undertake sequence alignment with Clustal Omega (Sievers and Higgins, 2018, 2021). The tool applied the HHalign technique to align the landscape of HMMs (Hidden Markov models).

Cluster analysis of CTL epitopes (9-mer, 15-mer, and 20-mer) of S-glycoprotein of the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

The clustering of peptides is a well-known problem in biological science. The analysis of the clustering of epitopes informs us of the epitope homology. It also helps us to generate a precise consensus sequence, which is essential to understanding a group of sequences sharing a defined level of identity. We performed cluster analysis at the different threshold levels (10 to 80%).

The clustering tool resolves the similarity between peptide sequences in separate and specific clusters. It is also ideal for analyzing sequences that may cluster together, even though they are derived explicitly from distinct antigenic sequences. It also specifies and identifies regions of homology between unrelated peptide sequences linked with several critical biological phenomena (allergen cross-reactivity, molecular mimicry, modulation of adaptive immune responses). Researchers also identified the epitopes in a large-scale screen of overlapping peptides that frequently share substantial sequence similarities, usually complicating the analysis of epitope-related data (Stufano et al., 2010). Therefore, clustering algorithms are often used to simplify these analyses. However, existing approaches are commonly inadequate in their capacity to define the biologically meaningful epitope clusters in the advanced framework of the immune response. Therefore, this algorithm-based IEDB epitope cluster analysis tool is applied to generate epitope clusters based on consensus or representative sequences (Vita et al., 2015). This specialized tool also allows the researchers to cluster the target peptide sequences based on a specified level of identity by selecting among changed method options (Dhanda et al., 2018). From a computational standpoint and advanced phase of immunoinformatics, the clustering tool is flexible enough to consider diverse strategies to address different immunological queries using B cell and T cell epitopes (Vivona et al., 2008).

We performed cluster analysis of the CTL epitopes (9-mer, 15-mer, and 20-mer) of S-glycoprotein of the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants. We used the IEDB epitope cluster analysis tool (Zhang et al., 2008; Dhanda et al., 2018). This tool graph visualizes the generated network using Python networks.

Exploration of the CD spectra and different components of the secondary structure of S-glycoprotein of the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

CD spectroscopy is a significant parameter for understanding the protein structure. A CD spectrum is an essential component that can help rapidly determine the secondary structure of proteins (Greenfield, 2006). We can understand different secondary structure elements using the CD spectra via MD simulations (Rogers et al., 2019; Drew and Janes, 2020). The PDBMD2CD is a server that can generate CD spectra using PDB files. Using this server, we analyzed the CD spectra of the S-glycoprotein of the Wuhan strain and significant VOCs/VOIs (Drew and Janes, 2020). The analysis illustrated information on the wavelength of the CD signal. The server produces a result with an RMSD smaller or equivalent to 0.5, the utmost away from the investigational spectrum. We used this server to analyze the secondary structural features such as the helix, antiparallel sheet, parallel sheet, and turn of the S-glycoprotein of the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants.

Tertiary or 3D model generation of the 9-mer CTL epitopic regions

We developed the tertiary structure of 9-mer CTL epitopes from the S-glycoprotein of the Wuhan strain and significant VOCs/VOIs/others using the DISTILL 2.0 server (Baú et al., 2006). This tool adopts an algorithm based on recursive neural network architectures. The algorithm can predict through a specific architecture of a neural network (single-or dual-layer) and directs an acyclic graph (Li et al., 2019). Different epitopes were marked in the tertiary structure, generated from the PDB file. The 3D structure of the PDB was developed using chimera visualization software (Pettersen et al., 2004).

Phage-displayed peptides that mimic SARS-CoV-2 9-mer CTL epitopic peptides

A peptide library for phage-displayed was applied for epitope mapping (Wu et al., 2016). We analyzed the SARS-CoV-2 9-mer CTL epitopic peptides that mimic phage-displayed peptides’ epitopes. We used a routine technique to map the location of the epitope using a phage-display library. We used the Pepitope server to perform this analysis, which applied the PepSurf algorithm. This algorithm is based on a specific amino acid similarity matrix (Mayrose et al., 2007).

Identification of the SAg-like region from the S-glycoprotein of SARS-CoV-2 of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants, and their tertiary or 3D model generation

We have identified the SAg-like region from the S-glycoprotein of SARS-CoV-2 of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants. We developed a 3D structure model using the same SAg-like region from the S-glycoprotein of Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants using the DISTILL 2.0 server (Baú et al., 2006).

Interaction between SAg-like part of S-glycoprotein of SARS-CoV-2 and TCR

A 3D structural model of the SAg-like region of the S-glycoprotein of this virus and TCR interaction was illustrated with the HADDOCK server (PDB ID: 4WW1; De Vries et al., 2010), which was used for protein–protein docking. The docking analysis was further validated using the ClusPro server (Kozakov et al., 2017).

Identification of the partial SAg-like part of this virus S-glycoprotein of the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

The SAgs used in the present study were: α-cobra toxin (Naja naja), α-bungarotoxin, rabies virus G protein (189–199), α-cobra toxin (Naja kaouthia), and HIV-1 gp120 (164–174). We utilized the entire sequence of the S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants during the present study. We found another SAg-like part from the S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants. The peptide sequence of the SAg-like part of all these SARS-CoV-2 virus sequences (both wild and mutant type) was used for cluster formation.

Finally, we have depicted a flowchart of the materials and methods to provide a bird’s eye view of the current study (Figure 2).

FIGURE 2
www.frontiersin.org

Figure 2. The flow diagram illustrates our used methodologies to test the objective of the study hypothesis. Our aim/objective of this research is to understand the antigenicity of the significant variants which have emerged from time to time, such as the Wuhan strain, the B.1.351 (Beta), B.1.1.28/triple mutant (P.1), B.1.1.7 (Alpha), and B.1.429 (Epsilon).

Results

Total number of CTL epitopes (9-mer, 15-mer, and 20-mer) in the S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

We identified 9-mer, 15-mer, and 20-mer CTL epitopes in the S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants. We found that B.1.1.7 contained a maximum number of 9-mer CTL epitopes (approximately six), whereas the B.1.429 variant had a minimum number of 9-mer CTL epitopes (approximately three; Figure 3A). We noted a maximum number of 15-mer CTL epitopes in the S-glycoprotein in B.1.1.7 and B.1.351 (approximately five), whereas the Wuhan strain and B.1.1.28/triple mutant variant contained a minimum number of 15-mer CTL epitopes in the S-glycoprotein (approximately three; Figure 3A; Supplementary Table S1). However, the 20-mer CTL epitopes in the S-glycoprotein showed slightly different results. We found a maximum number of 20-mer CTL epitopes in the S-glycoprotein in B.1.351 (approximately four). In contrast, the Wuhan strain and the B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants contained a minimum number of 20-mer CTL epitopes in the S-glycoprotein (approximately three; Figure 3A; Supplementary Table S2). Calculating the average number of epitopes found that the average number of epitopes was the same in B.1.1.7 and B.1.351 (Figure 3A).

FIGURE 3
www.frontiersin.org

Figure 3. Comparison of the number of the 9 mer, 15 mer, 20 mer CTL antigenic epitopes of total S-glycoprotein and RBD region of S-glycoprotein of Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variant. (A) Comparison of the number of the 9 mer, 15 mer, 20 mer, and an average number of CTL antigenic epitopes of total S-glycoprotein. (B) Comparison of the number of 9 mer, 15 mer, 20 mer, and the average number of CTL antigenic epitopes of RBD region of S-glycoprotein. We develop the polynomial (order 2) relationships for the two graphs (A; R2 = 0.1793 and B; R2 = 0.9524).

Total number of CTL epitopes (9-mer, 15-mer, and 20-mer) in the RBD region of S-glycoprotein of the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

We identified 9-mer, 15-mer, and 20-mer CTL epitopes in the RBD region of the S-glycoprotein of the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants. We found that the B.1.1.7, B.1.351, and B.1.1.28/triple mutant variants contained a maximum number of 9-mer CTL epitopes (approximately two). In contrast, the Wuhan strain and B.1.429 variant had a minimum number of 9-mer CTL epitopes (approximately one; Figure 3B).

We noted a maximum number of 15-mer CTL epitopes in the RBD region of the S-glycoprotein in the B.1.1.7 variant (approximately two). In contrast, the Wuhan strain and B.1.351, B.1.1.28/triple mutant, and B.1.429 variants contained no 15-mer CTL epitopes (Figure 3B).

However, the 20-mer CTL epitopes in the RBD region showed slightly different results. We found one 20-mer CTL epitope in the RBD region in B.1.351 and B.1.1.7. The other two variants (Wuhan strain and the B.1.1.28/triple mutant) contained no 20-mer CTL epitopes in the RBD region of this virus (Figure 3B).

By calculating the average number of epitopes in the RBD region, we found that highest average number of epitopes was contained in the B.1.1.7 variant (Figure 3A).

Comprehensive analysis of significant 9-mer CTL epitopic landscape of S-glycoprotein of the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

We performed comparative antigenicity mapping of the S-glycoprotein on the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants, with 23 epitopes identified. Four epitopes were non-antigenic based on their VaxiJen score (> 0.4). The significant mutations in the variants containing the 9-mer epitope played a crucial role in viral antigenicity (Figure 1B). Five 9-mer antigenic CTL epitopes were identified from the S-glycoprotein of the Wuhan strain. Similarly, the S-glycoprotein of B.1.351 contained five 9-mer CTL antigenic epitopes covering the mutagenic regions. Among these five epitopes, three were highly antigenic, as predicted by the VaxiJen server. The S-glycoprotein of the B.1.1.28/triple mutant variant contained four 9-mer CTL antigenic epitopes. We found the highest number of 9-mer CTL antigenic epitopes in the B.1.1.7 variant (approximately six), with all six antigenic epitopes showing a significant VaxiJen score (highest score = 1.5485 and lowest score = 0.4551). We found the lowest number of 9-mer CTL antigenic epitopes in B.1.429 (approximately three), with all showing a significant to mild antigen-related VaxiJen score (highest score = 0.4587 and lowest score = 0.6465; Figure 3A).

We performed a comparative assessment and characterization of the 9-mer CTL antigenic epitopes in the S-glycoprotein chain of the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants (Table 1). However, during the comparative assessment and characterization, we found that some of the 9-mer CTL antigenic epitopes of the Wuhan strain were also found in particular parts of the S-glycoprotein in the other four variants.

TABLE 1
www.frontiersin.org

Table 1. Identification and comparative analysis of 9 mer CTL epitopes of Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, B.1.429 variant.

Our study observed that the Wuhan strain contained five 9-mer CTL epitopes. One epitope with residue positions 1,060–1,068 showed the highest antigenic score (1.5122; Figure 4A). Similarly, the B.1.351 variant showed five significant mutation 9-mer CTL epitopes. The 9-mer CTL epitope with residue positions 417–425 showed the highest antigenic score (1.5325) and contained the K417N mutation (Figure 4B). This mutation might play a crucial role in providing the highest antigenicity for this particular epitope.

FIGURE 4
www.frontiersin.org

Figure 4. A schematic diagram of significant 9 mer CTL epitopic landscape of S-glycoprotein of Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variant. The figure also shows the significant mutations present in the 9 mer CTL epitopic regions. (A) A schematic diagram of 9 mer CTL antigenic epitopes identified of S-glycoprotein in Wuhan strain. (B) A schematic diagram of 9 mer CTL antigenic epitopes recognized of S-glycoprotein in the B.1.351 variant with significant mutations in the epitopic regions. (C) A schematic diagram of 9 mer CTL antigenic epitopes identified of S-glycoprotein in B.1.1.28/triple mutant variant with substantial mutations in the epitopic areas. (D) A schematic diagram of 9 mer CTL antigenic epitopes recognized of S-glycoprotein in B.1.1.7 variant with significant mutations in the epitopic regions. (E) A schematic diagram of 9 mer CTL antigenic epitopes identified of S-glycoprotein in B.1.429 variant with substantial mutations in the epitopic regions.

We found four 9-mer CTL epitopes in B.1.1.28/triple mutant. Among the epitopes, the highest antigenic score was 1.5476 in the 9-mer CTL epitope, where no mutations were observed (Figure 4C). The other three epitopes had significant mutations. The second 9-mer CTL epitope (epitopic residue positions 680–688) contained the R682G mutation, and the third epitope (epitopic residue positions 497–505) contained the N501Y mutation. The last one (epitopic residue positions 417–425) had a K417N mutation.

The B.1.1.7 variant contained six 9-mer CTL epitopes. The epitope with residue positions 497–505 showed the highest antigenicity (antigenic score 1.5485). This CTL epitope contained the N501Y mutation (Figure 4D), which might substantially provide the antigenicity for this CTL epitope. Other epitopes included different mutations [G481E (epitopic residue positions 481–489), A567D (epitopic residue positions 564–572), T715F (epitopic residue positions 715–723), and N1115H (epitopic residue positions 1,111–1,119)].

The 9-mer CTL epitopes analysis found that the B.1.429 variant contained one significant antigenic CTL epitope (antigenic score 0.4587). This epitope, with residue positions 152–160, consisted of a substantial mutation (W152C). In this variant, we also found two non-antigenic epitopes: the first one with residue positions 452–460 (antigenic score−0.6465) and the second one with residue positions 132–149 (antigenic score−0.2493; Figure 4E). The CTL epitope with residue positions 452–460 contained the L452R mutation, which might help generate a negative score for the epitope. Therefore, in this variant, antigenicity loss was noted in the 9-mer CTL epitopes.

Comparative mapping of significant 9-mer CTL epitope of the S-glycoprotein of the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

Comparative sequence homology analysis of the 9-mer CTL epitopes using S-glycoprotein from the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants showed some epitopic homology among significant VOCs/VOI/others. Sequence homology was observed between the epitopic regions of the S-glycoprotein in the two variants such as B.1.351 and B.1.1.7. We found epitopic homology between the 481–489 residues in epitopic part of the B.1.1.7 variant and the 484–492 residues in epitopic part of the B.1.351 variant. Similarly, we also found high sequence homology between the 497–505 epitopic part of the B.1.1.7 variant and 497–505 residues in epitopic part of the B.1.1.28/triple mutant variant. A substantial epitopic homology was observed between the 417–525 epitopic region of the B.1.351 variant and the 417–525 epitopic part of the B.1.1.28/triple mutant variant (Figure 5A). The evolutionary process is attributed to creating the epitopic homology of variants, which determines their antigenicity. However, epitopic homology was not observed in the Wuhan strain and B.1.429 variant (Figure 5B).

FIGURE 5
www.frontiersin.org

Figure 5. Comparative mapping of significant 9 mer CTL epitope of S-glycoprotein of Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variant. (A) Epitopes homology of Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variant. (B) A schematic diagram shows the similarity between the epitopes among B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variant.

MSA using the identified CTL epitopes (9-mer, 15-mer, and 20-mer) of S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

Sequence alignment of 9-mer, 15-mer, and 20-mer CTL epitopes of the S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants were examined. The 9-mer CTL epitopes showed four blocks (Figure 6A), whereas the 15-mer CTL epitopes showed two blocks (Figure 6B) and the 20-mer CTL epitopes showed one block (Figure 6C).

FIGURE 6
www.frontiersin.org

Figure 6. Epitope alignment of 9 mer, 15 mer, 20 mer CTL antigenic epitopes present in the S-glycoprotein of Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variant. (A) Epitope alignment shows four significant blocks from 9 mer CTL antigenic epitopes. (B) Epitope alignment shows two significant blocks from 15 mer CTL antigenic epitopes. (C) Epitope alignment shows one significant block from 20 mer CTL antigenic epitopes.

Cluster analysis of 9-mer CTL epitopes of S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

We performed a cluster analysis of the 9-mer CTL epitopes of the S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants at different threshold levels (10 to 80% level). We used 23 of the 9-mer CTL epitopes for cluster formation. We found that one dense cluster was formed at the 10% level (Figure 7A), which had a circular shape, and approximately 23 nodes were visible.

FIGURE 7
www.frontiersin.org

Figure 7. Cluster formation of 9 mer CTL epitopes of S-glycoprotein of Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variant using the threshold 10% level to 80% level. (A) Cluster formation at 10% level. (B) Cluster formation at 20% level. (C) Cluster formation at 30% level. (D) Cluster formation at 40% level. (E) Cluster formation at 50% level. (F) Cluster formation at 60% level. (G) Cluster formation at 70% level. (H) Cluster formation at 80% level.

We found that three clusters formed at the 20% level, containing one, seven, and 13 nodes (Figure 7B). We observed five prominent cluster formations at the 30% level, which had two, three, three, three, and four nodes (Figure 7C). We found one significant cluster at the 40% level that contained two nodes (Figure 7D), with the other nodes individually scattered. We found one crucial cluster creation at the 50% level, including two nodes (Figure 7E), and the other nodes were individually scattered. We observed one significant cluster at the 60% level, containing two nodes, with the other nodes individually scattered (Figure 7F). We evaluated the cluster formation at the 70% level, which showed one significant cluster, containing two nodes and several nodes individually scattered (Figure 7G). Finally, we evaluated the cluster formation at the 80% level, which showed one significant cluster containing two nodes and several nodes positioned in a scattered manner (Figure 7H).

Cluster analysis of 15-mer CTL epitopes of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

We performed a cluster analysis of the 15-mer CTL epitopes of the S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants at different threshold levels (10 to 80% level). We examined 19 numbers of the 15-mer CTL epitopes for cluster formation. We found a dense cluster formation at the 10% level, with a circular shape, and approximately 16 nodes were visible (Supplementary Figure S1A).

We found one significant cluster at the 20% level, containing 15 nodes, and observed another single cluster with one node (Supplementary Figure S1B). We found three prominent clusters at the 30% level, containing three, three, and four nodes (Supplementary Figure S1C). We found four significant clusters at the 40% level, having three, three, three, and two, and the other nodes were individually scattered (Supplementary Figure S1D). At the 50% level, we identified four critical clusters containing three, three, two, and two nodes and several nodes individually scattered (Supplementary Figure S1E). We observed four major clusters at the 60% level, containing two, two, three, and three nodes and several other nodes were independently scattered (Supplementary Figure S1F). We evaluated the cluster formation at the 70% level, which showed four significant clusters containing three, two, two, and two nodes, and several nodes individually scattered (Supplementary Figure S1G). Finally, we evaluated the cluster formation at the 80% level, which showed four significant clusters, containing two, two, two, and three nodes, and numerous nodes separately situated in a scattered manner (Supplementary Figure S1H).

Cluster analysis of the 20-mer CTL epitopes of S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

We performed cluster analysis of the 20-mer CTL epitopes of S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants at different threshold levels (minimum sequence identity threshold 10 to 80% level). We examined 16 numbers of the 20-mer CTL epitopes for cluster formation. We found one dense cluster formation at the 10% level, which had a circular shape, and approximately 16 nodes were visible (Supplementary Figure S2A). All 20-mer CTL epitopes participated in the cluster formation.

We found two significant clusters at the 20% level, containing four and 12 nodes (Supplementary Figure S2B). We found four prominent clusters at the 30% level, including four, two, two, and three nodes, and the other nodes were individually scattered (Supplementary Figure S2C).

We found four significant clusters that were generated at the 40% level, containing two, two, two, and three nodes and some other nodes were individually scattered (Supplementary Figure S2D). We observed four significant clusters created at the 50% level, containing three, two, two, and two nodes and several nodes positioned individually in a scattered manner (Supplementary Figure S2E). We observed four major clusters at the 60% level, containing two, two, two, and three nodes and some other nodes were located independently in a scattered manner (Supplementary Figure S2F). We evaluated the cluster formation at the 70% level, which showed four significant clusters, containing two, two, two, and three nodes and several nodes were individually scattered (Supplementary Figure S2G). We evaluated the cluster formation at the 80% level, which showed four significant clusters, containing two, two, two, and two nodes and numerous single nodes were situated in a scattered manner (Supplementary Figure S2H).

Analysis of the CD spectra of S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

A CD spectrum is essential for analyzing the proper secondary structure and fold detection of a protein (Micsonai et al., 2015). We generated the computer-simulated CD spectra of S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants. The spectrum of the viral glycoprotein of the Wuhan strain showed both positive and negative bands. The CD spectra showed a positive band at approximately 198 nm, comprising a maximum positive peak at about 190 nm, and showed a negative band from 198 nm to 225 nm with a maximum negative peak at 207–210 nm (Figure 8A). We analyzed the CD spectra of the B.1.351 variant, where we used the Wuhan strain as a control or the closest CD spectrum to the experimental protein (green spectra). Here, we used the S-glycoprotein of the B.1.351 variant as the experimental protein, and the generated spectrum is shown in red (Figure 8B). Both CD spectra showed the same pattern, with the maximum positive peak greater for the B.1.351 variant than for the Wuhan strain. The study evaluated the root-mean-square deviation (RMSD) variation of the control or closest CD spectrum and experimental CD spectrum (B.1.351 variant), which was 0.12 (Figure 9A; Supplementary Table S3). We analyzed the CD spectrum of the B.1.1.28/triple mutant variant using the same methodology as described above (Figure 8C). Both CD spectra (Wuhan strain and B.1.1.28/triple mutant variant) showed a similar pattern. The maximum positive peak was slightly more remarkable in the B.1.1.28/triple mutant variant than in the Wuhan strain, with an RMSD variation of 0.17 (Figure 9B). We generated the CD spectrum of the B.1.1.7 variant and compared it to the Wuhan strain (Figure 8D), and both CD spectra showed identical patterns. Maximum positive and maximum negative peaks were the same. The RMSD variation was 0.14 (Figure 9C). Finally, we generated the CD spectrum of the B.1.429 variant and compared it to the Wuhan strain (Figure 8E), with both CD spectra showing identical patterns. A lower negative peak was observed for the CD signal of the B.1.429 variant than the Wuhan strain, and the RMSD variation was 0.45 (Figure 9D). We also analyzed the secondary structure pattern of the Wuhan strain, B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants to calculate the beta-sheet (β-sheet) percentages, which were 26.21% (Supplementary Figure S3A), 24.84% (Supplementary Figure S3B), 24.84% (Supplementary Figure S3C), 24.46% (Supplementary Figure S3D), and 32.19%, respectively (Supplementary Figure S3E; Supplementary Table S4). Therefore, all proteins were β-sheet-rich. However, more β-sheets were found in B.1.429 than in the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, and B.1.1.7 variants (Supplementary Figure S4A). In addition, more β turns (9.71%) were found in B.1.429 than in the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, and B.1.1.7 variants (Supplementary Figure S4B).

FIGURE 8
www.frontiersin.org

Figure 8. CD spectra of S-glycoprotein of the Wuhan variant and B.1.351, B.1.1.28/triple mutant, B.1.1.7, B.1.429 variants. (A) CD spectra of S-glycoprotein of the Wuhan strain (B) Resemblance of CD spectra of S-glycoprotein of the Wuhan strain and B.1.351. (C) The resemblance of CD spectra of S-glycoprotein of the Wuhan strain and B.1.1.28/triple mutant. (D) The resemblance of CD spectra of S-glycoprotein of the Wuhan strain and B.1.1.7. (E) The resemblance of CD spectra of S-glycoprotein of the Wuhan strain and B.1.429.

FIGURE 9
www.frontiersin.org

Figure 9. Comparison of RMSD between the CD spectra of the S-glycoprotein Wuhan variant and other variants. (A) The resemblance of RMSD between the CD spectra of S-glycoprotein Wuhan strain and B.1.351. (B) The resemblance of RMSD between the CD spectra of S-glycoprotein Wuhan strain and B.1.1.28/triple mutant. (C) The resemblance of RMSD between the CD spectra of the S-glycoprotein Wuhan strain and B.1.1.7. (D) The resemblance of RMSD between the CD spectra of S-glycoprotein Wuhan strain and B.1.429.

The CD spectra of proteins were applied to evaluate proteins’ secondary structure pattern, binding properties, and folding shape. It is one of the rapid methods for assessing proteins’ secondary structure pattern, binding properties, and folding shape.

Parsons et al. (2019) have evaluated the RBD protein secondary structure of avian coronavirus by CD spectra analysis (Parsons et al., 2019). In this analysis, the spectral bandwidth has shown diverse values regarding nanometer scale, specific temperature, etc., of RBD proteins of glycosylation-site and non-glycosylation-site variants.

In our study, we applied the PDBMD2CD server to generate the in silico CD spectra (wavelength of the CD signal) using PDB files of S-glycoprotein of the Wuhan strain and significant variants.

The secondary structure pattern of S-glycoprotein of studied SARS-CoV-2 (Wuhan strain and other variants) features are stabilized by the amino acid residues turn tendencies as well as the cross-strand interactions among the sequences flanking the beta-turns. Researchers also showed that the specific β-turns of protein have a crucial role in interacting with the T-cell receptor (TCR), leading the MHCII-peptide-TCR complex to induce a suitable immune response (Bermudez et al., 2018). Subsequently, the beta-sheets also significantly impact on the managing the affinity of MHC class II binding sites in superantigens (Al-Shangiti et al., 2004). However, in the case of SARS-CoV-2 S-glycoprotein, no such work has been reported. But, our bioinformatics tools have shown the proper interactions of epitopic peptides (9-mer) from S-glycoprotein with TCR. These tools have established the vital role of beta-sheets and beta-turns in forming the SAg-like region.

3D model generation of the 9-mer CTL epitopic regions

3D models were created for all 9-mer CTL epitopes, and their epitopic regions were located in the S-glycoprotein. We generated four 3D models for the 9-mer CTL epitopes of the Wuhan strain: G181-L189, G311-R319, V382-L390, and V1060-1068 (Figure 10A); four 3D models for the B.1.351 variant: F79-N87, G215-L223, N417-L425, K484-L492, and V701-N709 (Figure 10B); two 3D models for the B.1.1.28/triple mutant variants: N417-L425 and F497-Y505 (Figure 10C); six 3D models for the B.1.1.7 variant: E481-L489, F497-Y505, R564-A572, E583-S591, F715-I723, and I1111-V1119 (Figure 10D); and three 3D models for the B.1.429 variant: F132-L140, C152-Y160, and R452-N460 (Figure 10E).

FIGURE 10
www.frontiersin.org

Figure 10. Generated three-dimensional (3D) model of 9 mer CTL epitopic regions of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, B.1.429 variants and their position in 3D model of S-glycoprotein. (A) 3D model of 9 mer CTL epitopic regions of the Wuhan strain and their position in 3D model of S-glycoprotein. (B) 3D model of 9 mer CTL epitopic regions of the B.1.351 variant and their position in 3D model of S-glycoprotein. (C) 3D model of 9 mer CTL epitopic regions of the B.1.1.28/triple mutant variant and their position in 3D model of S-glycoprotein. (D) 3D model of 9 mer CTL epitopic regions of the B.1.1.7 variant and their position in 3D model of S-glycoprotein. (E) 3D model of 9 mer CTL epitopic regions of the B.1.429 variant and their position in the 3D model of S-glycoprotein.

Mimic phage-displayed peptides for 9-mer CTL epitopes of SARS-CoV-2 S-glycoprotein

Previous studies have identified unique phage-displayed peptides that mimic different pathogen epitopes, such as Mycobacterium tuberculosis (Wang et al., 2016) and the hepatitis E virus (Larralde and Petrik, 2017). The present study aimed to determine the mimic 9-mer CTL epitopes from the S-glycoprotein of SARS-CoV-2 from phage-displayed peptides. A random peptide library was scanned, and the selected peptides were supposed to mimic the 9-mer CTL epitopes of S-glycoprotein of this virus, which showed a list of clusters in the Wuhan, B.1.351, B.1.1.28/triple mutant, and B.1.1.7 variants. The Wuhan strain showed three mimic clusters with four different 9-mer epitopes (Figure 11A). The first mimic cluster was formed with two 9-mer epitopes (VVFLHVTYV and GIYQTSNFR), the second mimic cluster was formed with one 9-mer epitope (VSPTKLNDL), and the third mimic cluster was formed with one 9-mer epitope (GKQGNFKNL).

FIGURE 11
www.frontiersin.org

Figure 11. Phage-displayed peptides that mimic S-glycoprotein of SARS-CoV-2 9 mer CTL epitopes of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, B.1.429 variants. (A) Phage-displayed peptides that mimic S-glycoprotein of 9 mer CTL epitopes of the Wuhan strain and the cluster epitopes. (B) Phage-displayed peptides that mimic S-glycoprotein of 9 mer CTL epitopes of the B.1.351 variant and the cluster epitopes. (C) Phage-displayed peptides that mimic S-glycoprotein of 9 mer CTL epitopes of the B.1.1.28/triple mutant variant and the cluster epitopes. (D) Phage-displayed peptides that mimic S-glycoprotein of 9 mer CTL epitopes of B.1.1.7 and the cluster epitopes.

The B.1.351 variant showed three mimic clusters with four 9-mer epitopes (Figure 11B). The first mimic cluster consisted of two 9-mer epitopes (GLPQGFSAL and KGFNCYFP), the second was formed with one 9-mer epitope (VENSVAYSN), and the third included one 9-mer epitope (FANPVLPFN).

The B.1.1.28/triple mutant variant showed three mimic clusters with three different 9-mer epitopes (Figure 11C). The first mimic cluster consisted of one 9-mer epitope (FQPTYGVGY), the second was formed with one 9-mer epitope (SQCVNLTTR), and the third was formed with one 9-mer epitope (SPGSASSVA).

Finally, the B.1.1.7 variant showed three mimic clusters with four 9-mer epitopes (Figure 11D). The first mimic cluster had two 9-mer epitopes (EILDITPCS and RDIDDTTDA), the second was formed with one 9-mer epitope (FQPTYGVGY), and the third consisted of one 9-mer epitope (IITTHNTFV).

SAg-like region from S-glycoprotein of SARS-CoV-2 of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

Following Cheng et al. (2020) study, we developed a 3D structural model of the SAg-like part of the S-glycoprotein of SARS-CoV-2 of the Wuhan strain (Figure 12A; Baú et al., 2006; Cheng et al., 2020). We also developed a 3D structure model using the same region (SAg-like region) from the S-glycoprotein of this virus of the different variants, i.e., B.1.351 (Figure 12B), B.1.1.28/triple mutant (Figure 12C), B.1.1.7 (Figure 12D), and B.1.429 (Figure 12E).

FIGURE 12
www.frontiersin.org

Figure 12. 3D model of SAg-like region from S-glycoprotein of Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7 and B.1.429 variants. (A) 3D model of SAg-like region from S-glycoprotein of Wuhan strain. (B) 3D model of SAg-like region from S-glycoprotein of the B.1.351 variant which was generated same SAg-like region from S-glycoprotein of Wuhan strain. (C) 3D model of SAg-like region from S-glycoprotein of the B.1.1.28/triple mutant variant which was generated same SAg-like region from S-glycoprotein of Wuhan strain. (D) 3D model of SAg-like region from S-glycoprotein of the B.1.1.7 variant which was generated same SAg-like region from S-glycoprotein of Wuhan strain. (E) 3D model of SAg-like region from S-glycoprotein of the B.1.429 variant which was generated same SAg-like region from S-glycoprotein of Wuhan strain.

Interaction between SAg-like region of the viral glycoprotein and T cell receptor (TCR)

The present study illustrated the interaction with our developed 3D structural model of the SAg-like region of the S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants with human TCR. These interactions can be seen in Figures 13AE and showed that the SAg-like region interacted with the beta chain of the human TCR.

FIGURE 13
www.frontiersin.org

Figure 13. Interaction between SAg-like region from S-glycoprotein of Wuhan strain, B.1.351, B.1.1.28/triple mutant, B.1.1.7, B.1.429 variants and TCR. (A) Interaction between SAg-like region of Wuhan strain and TCR. (B) Interaction between SAg-like part of B.1.351 variant and TCR. (C) Interaction between SAg-like part of the B.1.1.28/triple mutant variant and TCR. (D) Interaction between the SAg-like part of the B.1.1.7 variant and TCR. (E) Interaction between the SAg-like part of the B.1.429 variant and TCR.

Identification of the partial SAg-like part of the viral S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants

Sequence alignment was performed using significant SAgs [α-cobra toxin (N. naja), α-bungarotoxin, rabies virus G protein (189–199), and α-cobra toxin (N. kaouthia)], and HIV-1 gp120 (164–174), Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants. We observed a partial SAg-like part (ANQFNSAIGKI) of the S-glycoprotein of this virus in the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants, as depicted in Figure 14A.

FIGURE 14
www.frontiersin.org

Figure 14. Another partial SAg-like region from S-glycoprotein of SARS-CoV-2 of Wuhan strain, B.1.351, B.1.1.28/triple mutant, B.1.1.7, B.1.429 variants shows the sequence similarity with an α-cobra toxin (Naja naja),α-bungarotoxin, Rabies Virus G Protein (189–199), α-cobra toxin (Naja kaouthia), and HIV-1 gp120 (164–174) and their cluster formation. (A) Sequence similarity of superantigens (α-cobra toxin (Naja naja),α-bungarotoxin, Rabies Virus G Protein (189–199), α-cobra toxin (N. kaouthia), and HIV-1 gp120 (164–174)), Wuhan variant, and significant VOCs/VOI (B.1.351, B.1.1.28/triple mutant, B.1.1.7, B.1.429) shows a partial super SAg-like region. (B) Cluster (with these MSA sequences) at 30% level. (C) Cluster (with these MSA sequences) at 40% level. (D) Cluster (with these MSA sequences) at 70% level.

We developed a cluster with these MSA sequences at the 30, 40, and 70% levels. We found one cluster with all sequences at the 30% level (Figure 14B). However, the SAg-like region was detached from the central cluster at 40% (Figure 14C) and 70% levels (Figure 14D).

Discussion

MHC-I-restricted CTLs play a significant role in controlling viral infections. CD8+ T cells perform a crucial role in the clearance of infections of this virus (acute model) in the lungs (Schmidt and Varga, 2018) and are activated and differentiated in patients with severe COVID-19 (Chen and Wherry, 2020). Bange et al. (2021) concluded that CD8+ T cells might help with the survival of patients with COVID-19 (Bange et al., 2021). Therefore, identifying CTL epitopes is essential for understanding the T cell activation mechanism and assisting epitope-driven vaccine design.

The identification and characterization of the CTL 9-mer epitope have a substantial importance in medical immunology. The identification of peptide epitopes can be helpful in the production of vaccines against microorganisms that have very little growth in the medium, as well as microorganisms whose antigenic regions are not precisely detected during the typical immune system infection (Andreatta and Nielsen, 2018; Rencilin et al., 2021). Therefore, finding considerable numbers of CTL epitopes is a necessary process for vaccine design, and those CTL epitopes consist of the binding motifs of the known class I MHC. Still, even the epitopes do not conform precisely to consensus motifs or vary in length. To do this with conventional peptide synthesizers is costly and impractical because of the large number of peptides, and a considerable amount of testing is required. Therefore, these advanced strategies for epitope mapping offer a more precise solution than the initial CTL epitope screening experiments rather than other aspects (Soria-Guerra et al., 2015).

In the present study, CTL epitopic regions of S protein were analyzed using antigenic scores. We assessed the antigenicity of the wild type (Wuhan strain) and mutant S-glycoprotein of four variants (B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429). The analysis of 9-mer CTL epitope regions of S-glycoprotein showed that the B.1.1.7 variant had an elevated antigenicity compared to other variants and the Wuhan strain for the 9-mer CTL epitopes and average CTL epitopes (average number of three epitope types). The RBD region also showed the highest numbers of 9-mer CTL epitope and 15-mer CTL epitope and an average number of CTL epitopes for the B.1.1.7 variant.

CTL 9-mer epitope sequences in the viral protein can be identified using the IEDB server. This is a standard method (Islam et al., 2020). The selected peptide size (9 amino acids) has a potential binding capacity to MHC-I rather than the 15-mer and 20-mer epitopes because these are optimal stimulators for CD8 response and can interact with several HLA-A alleles, which has a considerable binding affinity (Li Pira et al., 2010; Shi et al., 2015). Therefore, our study performed the identification and characterization of the CTL 9-mer epitope of the Wuhan strain and four mutative variants, which has immense importance.

We have used the IEDB epitope cluster analysis tool (Dhanda et., 2018) for the cluster analysis of CTL epitopes (9-mer, 15-mer, and 20-mer) of S-glycoprotein of the Wuhan strain and the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants. This tool groups epitopes into clusters based on sequence identity. A cluster is a group of sequences with a sequence similarity more incredible than the minimum sequence identity threshold specified. Our manuscript represents the CTL epitopic peptides as circles, and the solid line connecting two peptides is identified above the specified threshold. All the connected peptides are in a cluster. Here, all the peptides homologous to a certain pre-specified level are clustered together, for example, at the 20% level. In this case, any member of the cluster will be at least 20% homologous to at least one member. However, the approach’s shortcoming is that cluster members might and often are related by levels of homology much lower than 20%. As a result, the cluster provides a clear consensus sequence. Conversely, the singletons are isolated peptides, and those do not share any sequence identity with any other peptides in the given data. However, using this approach, in our work, we have analyzed and offered from a 10% threshold level of sequence identity to the 80% level of CTL epitopes (9-mer, 15-mer, and 20-mer) of mentioned SARS-CoV-2 Wuhan strain and other significant variants.

During cluster formation of 9-mer CTL epitopes, we found one significant cluster formation at the 70% level with two nodes (KGFNCYFPL and EGFNCYFPL), and the other nodes were scattered. Consequently, they did not form a dense cluster at the 70% level and showed a high level of diversity. We found four significant clusters in the 15-mer CTL epitopes and four multiple clusters formed in the 20-mer CTL epitopes at the 70% level. Considering the cluster formation of the 9-mer, 15-mer, and 20-mer CTL epitopes, the 9-mer CTL epitopes showed uniqueness in their cluster formation pattern. These epitopes, which have formed the clusters in higher percentage levels (such as 70% or 80%), might have a similar kind of nature regarding antigenicity and their antigen processing. However, we need to confirm the hypothesis through further experiments. Therefore, understanding the method of cluster formation is of enormous medical significance. Recently, Meckiff et al. (2020) used the cluster formation technique and the epitope mega pool of peptide design for this virus during the COVID-19 infection (Meckiff et al., 2020).

The CD spectra of the S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, and B.1.1.7 variants showed almost the same band pattern. On the other hand, we have found a slight difference in the CD spectra of S-glycoprotein of the B.1.429 variant compared to the Wuhan strain. This difference occurred because of the spectral contrast of the B.1.429 variant and Wuhan strain. After comparing the closest and experimental CD spectra, the RMSD variation was calculated. The Wuhan strain and the B.1.351, B.1.1.28/triple mutant, and B.1.1.7 variants had minor differences (0.12–0.17 range). However, the RMSD variation was 0.45 for the B.1.429 variant. A higher RMSD variation occurred because of more spectral differences than the control. This result is in corroboration with the results of Li and Hirst (Li and Hirst, 2020), who attempted to develop a model of SARS-CoV-2 proteins using CD spectra analysis.

The present study found three phage-displayed peptides that mimicked the 9-mer CTL epitopes from the S-glycoprotein of SARS-CoV-2 (Wuhan strain, and B.1.351, B.1.1.28/triple mutant, and B.1.1.7 variants). Previous studies have identified mimic phage-displayed peptides similar to CTL epitopes of SARS-CoV-2, which have immense importance for epitope-based immunotherapy, diagnostics, and vaccine development (Bazan et al., 2012; Wu et al., 2016). Recently, Guo et al. (2021) characterized different B cell epitopes from this virus using a peptide library (phage-displayed; Guo et al., 2021).

Finally, we attempted to understand the interaction pattern of the SAg-like region within the Wuhan strain and TCR, following Cheng et al. (Cheng et al., 2020). An identical portion of the SAg-like part of the Wuhan strain was searched from the B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants. The hyperinflammatory response in COVID-19 patients is a significant area of research. We developed a 3D model of these SAg-like regions and determined their interaction pattern with TCR. A proper understanding of hyper inflammation in severe COVID-19 cases may help solve the mystery of the death of these patients (García, 2020; Tay et al., 2020). Our study allows us to illustrate the TCR and downstream regulatory pathways for the hyperinflammatory response in COVID-19 patients. Our MSA analysis found another partial super SAg-like region (ANQFNSAIGKI) from the S-glycoprotein of SARS-CoV-2 in the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants.

The study of the insight into the antigenicity of SARS-CoV-2 variants has fundamental importance. It helps us illustrate the dynamic interaction between SARS-CoV-2 variants with humans or other hosts and also allows us to understand the immunopathogenesis of SARS-CoV-2 variants. Understanding the antigenic variations has immense importance. Antigenic variation might be a significant factor in comparing the antigenicity among the virus variants. Antigenic variation can help us to understand the virus fitness. At the same time, it can also explain how a virus can reinfect hosts by escaping the immune memory. Antigenic variations, diversification, conservation, and the total number of antigens in a particular virus protein need to be studied. However, antigenic variations have been studied in different viruses from time to time (Zost et al., 2019). The antigenic variations have also been studied in all human coronaviruses and SARS-CoV-2 (Kumar et al., 2020). Researchers have also investigated antigenic variations in spike protein in SARS-CoV-2 (Harvey et al., 2021). Some researchers have tried to illustrate the antigenic variations in SARS-CoV-2 variants (Mittal et al., 2022).

However, the total number of epitopes in a spike protein might be one criterion to determine the antigenicity comparing the SARS-COV-2 and SARS-CoV (Zheng and Song, 2020). Here we compared the total number of CTL epitopes in variants of SARS-CoV-2. Our study showed that B.1.1.7 (Alpha) of spike protein displayed the highest number of CTL epitopes compared to Wuhan strain, B.1.351 (Beta), B.1.1.28/triple mutant (P.1), and B.1.429 (epsilon). However, the surface accessibility of those epitopes is also a significant factor in interacting nAb (neutralizing antibody), and a need of further study for these variants. At the same time, there is also a need to understand how antigenic epitopes are presented to the T cell by MHC (major histocompatibility complex) molecules (both class I and class II molecules; Forni et al., 2021).

Limitations of the study

The study has performed a comprehensive analysis of the CTL epitopes or SAg-like region of the total S-glycoprotein of the Wuhan strain, B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants and predicted the antigenicity through computational biology. The analyses were performed in various directions. Every data accumulated through the bioinformatics methods for Wuhan strain and emerging mutant variants is essential for future researchers and society. The arrival of emerging variants during the pandemic made the pandemic period more critical. At this point of urgency, our data predicting the potent antigenicity through bioinformatics tools which is very significant and highly beneficial for society. However, data acquired by us needs further validation through in vitro and in vivo methods, which is a limitation of this study. Therefore, we urgently urge future researchers to validate our data to end the pandemic crisis and prepare for a future pandemic.

Conclusion

Several studies have focused on the identification and characterization of immunogenic CD8+ T cell epitopes. MHC-I molecules naturally contain 8-aa to 11-aa length peptide chains, which is the main consequence of the proteasomal degradation of antigens. This antigen is derived from infection or self-peptides (Blum et al., 2013). Our 9-mer CTL epitopic study helped identify the primary mechanism of antigen processing of the S-glycoprotein of SARS-CoV-2 via the MHC-I molecules of CD8+ T cells.

This study initiated several questions regarding the antigen processing of the S-glycoprotein of this virus via MHC-I molecules for CD8+ T cells. The researcher should solve these questions. We should also know about the antigen processing of the S-glycoprotein of the Wuhan strain and other emerging VOCs/VOIs such as B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants. At the same time, it is also necessary to understand more about the peptide–MHC-I complexes formation for the Wuhan strain and other emerging VOCs/VOIs such as B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants.

Different scientists have developed immunogenicity models for CTL epitopes (Harndahl et al., 2012; Calis et al., 2013). However, specific immunogenicity models for CTL epitopes of SARS-CoV-2 S-glycoprotein remain to be developed.

We identified the highest virulent epitopes with a significant mutation within the S-glycoprotein of SARS-CoV-2 emerging VOCs/VOIs (B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variants) and the Wuhan strain. Our results showed that the mutations might be responsible for the high antigenicity identified in the B.1.1.7 variant. This variant might help activate more CD8+  T cells via the MHC-I pathway than in the South African, Brazilian, and Wuhan variants. It might help activate more CD4+ T cells via the MHC-II pathway and cytokine signaling. In the future, the molecular mechanism by which CD8+ T cells distinguish antigens from immunogenic property antigens of S-glycoprotein of wild strain and their variants should be evaluated.

Data availablity statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding authors.

Author contributions

CC: conceptualization, data acquisition, and writing–original draft. MB and ARS: formal analysis, validation, and reviewing and editing. BM and S-SL: formal analysis and validation. CC and E-MS: supervision and funding. All authors contributed to the article and approved the submitted version.

Funding

This study was supported by the Hallym University Research Fund and the Basic Science Research Program through the National Research Foundation of Korea (NRF), funded by the Ministry of Education (NRF-2020R1C1C1008694 and NRF-2020R1I1A3074575).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2022.895695/full#supplementary-material

SUPPLEMENTARY FIGURE S1 | Cluster formation of 15 mer CTL epitopes of S-glycoprotein of Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variant using the threshold 10% level to 80% level. (A) Cluster formation at 10% level. (B) Cluster formation at 20% level. (C) Cluster formation at 30% level. (D) Cluster formation at 40% level. (E) Cluster formation at 50% level. (F) Cluster formation at 60% level. (G) Cluster formation at 70% level. (H) Cluster formation at 80% level.

SUPPLEMENTARY FIGURE S2 | Cluster formation of 20 mer CTL epitopes of S-glycoprotein of Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, and B.1.429 variant using the threshold 10% level to 80% level. (A) Cluster formation at 10% level. (B) Cluster formation at 20% level. (C) Cluster formation at 30% level. (D) Cluster formation at 40% level. (E) Cluster formation at 50% level. (F) Cluster formation at 60% level. (G) Cluster formation at 70% level. (H) Cluster formation at 80% level.

SUPPLEMENTARY FIGURE S3 | Percentage of secondary structure component (α-helix and β-sheet) calculated from CD spectra of S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, B.1.429 variants. (A) Percentage of α-helix and β-sheet of S-glycoprotein of the Wuhan variant. (B) Percentage of α-helix and β-sheet of S-glycoprotein of the B.1.351 variant. (C) Percentage of α-helix and β-sheet of S-glycoprotein of the B.1.1.28/triple mutant variant. (D) Percentage of α-helix and β-sheet of S-glycoprotein of the B.1.1.7 variant. (E) Percentage of α-helix and β-sheet of S-glycoprotein of the B.1.429 variant.

SUPPLEMENTARY FIGURE S4 | Comparison of α-helix and β-sheet and β turns of S-glycoprotein of the Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, B.1.429 variants. (A) Comparison of α-helix and β-sheet of S-glycoprotein among Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, B.1.429 variants. (B) Comparison of β turns of S-glycoprotein among Wuhan strain and B.1.351, B.1.1.28/triple mutant, B.1.1.7, B.1.429 variants.

Abbreviations

SARS-CoV-2, Severe acute respiratory syndrome coronavirus 2; COVID-19, Coronavirus disease 2019; RBD, Receptor-binding domain; S-protein, Spike glycoprotein; CTL, Cytotoxic T lymphocytes; CD, Circular dichroism; TCR, T-cell antigen receptor; VOC, Variants of concern.; VOI, Variants of interest; MSA, Multiple sequence alignment; SAg, Superantigen; RMSD, Root-mean-square deviation; MHC, Major histocompatibility complex; HLA, Human leukocyte antigens; PDB, Protein Data Bank; HMMs, Hidden Markov models.

References

Al-Shangiti, A. M., Naylor, C. E., Nair, S. P., Briggs, D. C., Henderson, B., and Chain, B. M. (2004). Structural relationships and cellular tropism of staphylococcal superantigen-like proteins. Infect. Immun. 72, 4261–4270. doi: 10.1128/IAI.72.7.4261-4270.2004

PubMed Abstract | CrossRef Full Text | Google Scholar

Anderson, C. S., McCall, P. R., Stern, H. A., Yang, H., and Topham, D. J. (2018). Antigenic cartography of H1N1 influenza viruses using sequence-based antigenic distance calculation. BMC Bioinformatics 19, 1–11. doi: 10.1186/s12859-018-2042-4

CrossRef Full Text | Google Scholar

Andreatta, M., and Nielsen, M. (2018). Bioinformatics tools for the prediction of T-cell epitopes. Methods Mol. Biol. 1785, 269–281. doi: 10.1007/978-1-4939-7841-0_18

CrossRef Full Text | Google Scholar

Bange, E. M., Han, N. A., Wileyto, P., Kim, J. Y., Gouma, S., Robinson, J., et al. (2021). CD8+ T cells contribute to survival in patients with COVID-19 and hematologic cancer. Nat. Med. 27, 1280–1289. doi: 10.1038/s41591-021-01386-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Baú, D., Martin, A. J., Mooney, C., Vullo, A., Walsh, I., and Pollastri, G. (2006). Distill: a suite of web servers for the prediction of one-, two-and three-dimensional structural features of proteins. BMC Bioinformatics 7, 1–8. doi: 10.1186/1471-2105-7-402

CrossRef Full Text | Google Scholar

Bazan, J., Całkosiński, I., and Gamian, A. (2012). Phage display—A powerful technique for immunotherapy: 1. Introduction and potential of therapeutic applications. Hum. Vaccin. Immunother. 8, 1817–1828. doi: 10.4161/hv.21703

PubMed Abstract | CrossRef Full Text | Google Scholar

Bermudez, A., Alba, M. P., Vanegas, M., Patarroyo, M. A., and Patarroyo, M. E. (2018). Specific β-turns precede PPIIL structures binding to allele-specific HLA-DRβ1* PBRs in fully-protective malaria vaccine components. Front. Chem. 6:106. doi: 10.3389/fchem.2018.00106

PubMed Abstract | CrossRef Full Text | Google Scholar

Bhatnager, R., Bhasin, M., Arora, J., and Dang, A. S. (2020). Epitope based peptide vaccine against SARS-COV2: an immune-informatics approach. J. Biomol. Struct. Dyn. 39, 5690–5705. doi: 10.1080/07391102.2020.1787227

PubMed Abstract | CrossRef Full Text | Google Scholar

Bhattacharya, M., Sharma, A. R., Ghosh, P., Lee, S.-S., and Chakraborty, C. (2021). A next-generation vaccine candidate using alternative epitopes to protect against Wuhan and all significant mutant variants of SARS-CoV-2: an immunoinformatics approach. Aging Dis. 12, 2173–2195. doi: 10.14336/AD.2021.0518

PubMed Abstract | CrossRef Full Text | Google Scholar

Bhattacharya, M., Sharma, A. R., Patra, P., Ghosh, P., Sharma, G., Patra, B. C., et al. (2020a). Development of epitope-based peptide vaccine against novel coronavirus 2019 (SARS-COV-2): immunoinformatics approach. J. Med. Virol. 92, 618–631. doi: 10.1002/jmv.25736

PubMed Abstract | CrossRef Full Text | Google Scholar

Bhattacharya, M., Sharma, A. R., Patra, P., Ghosh, P., Sharma, G., Patra, B. C., et al. (2020b). A SARS-CoV-2 vaccine candidate: In-silico cloning and validation. Inform. Med. Unlocked 20:100394. doi: 10.1016/j.imu.2020.100394

PubMed Abstract | CrossRef Full Text | Google Scholar

Blum, J. S., Wearsch, P. A., and Cresswell, P. (2013). Pathways of antigen processing. Annu. Rev. Immunol. 31, 443–473. doi: 10.1146/annurev-immunol-032712-095910

PubMed Abstract | CrossRef Full Text | Google Scholar

Brüssow, H. (2021). COVID-19: emergence and mutational diversification of SARS-CoV-2. Microb. Biotechnol. 14, 756–768. doi: 10.1111/1751-7915.13800

PubMed Abstract | CrossRef Full Text | Google Scholar

Calis, J. J., Maybeno, M., Greenbaum, J. A., Weiskopf, D., De Silva, A. D., Sette, A., et al. (2013). Properties of MHC class I presented peptides that enhance immunogenicity. PLoS Comput. Biol. 9:e1003266. doi: 10.1371/journal.pcbi.1003266

PubMed Abstract | CrossRef Full Text | Google Scholar

Cele, S., Gazy, I., Jackson, L., Hwa, S.-H., Tegally, H., Lustig, G., et al. (2021). Escape of SARS-CoV-2 501Y. V2 from neutralization by convalescent plasma. Nature 593, 142–146. doi: 10.1038/s41586-021-03471-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Chakraborty, C., Bhattacharya, M., and Sharma, A. R. (2021a). Present variants of concern and variants of interest of severe acute respiratory syndrome coronavirus 2: their significant mutations in S-glycoprotein, infectivity, re-infectivity, immune escape and vaccines activity. Rev. Med. Virol. 32:e2270. doi: 10.1002/rmv.2270

CrossRef Full Text | Google Scholar

Chakraborty, C., Bhattacharya, M., Sharma, A. R., Lee, S.-S., and Agoramoorthy, G. (2021b). SARS-CoV-2 Brazil variant in Latin America: more serious research urgently needed on public health and vaccine protection. Ann. Med. Sur. 66:102428. doi: 10.1016/j.amsu.2021.102428

PubMed Abstract | CrossRef Full Text | Google Scholar

Chakraborty, C., Sharma, A. R., Bhattacharya, M., Agoramoorthy, G., and Lee, S.-S. (2021c). Evolution, mode of transmission, and mutational landscape of newly emerging SARS-CoV-2 variants. MBio 12, e01140–e01121. doi: 10.1128/mBio.01140-21

CrossRef Full Text | Google Scholar

Chakraborty, C., Sharma, A., Sharma, G., Bhattacharya, M., and Lee, S. (2020). SARS-CoV-2 causing pneumonia-associated respiratory disorder (COVID-19): diagnostic and proposed therapeutic options. Eur. Rev. Med. Pharmacol. Sci. 24, 4016–4026. doi: 10.26355/eurrev_202004_20871

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, Z., and Wherry, E. J. (2020). T cell responses in patients with COVID-19. Nat. Rev. Immunol. 20, 529–536. doi: 10.1038/s41577-020-0402-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Cheng, M. H., Zhang, S., Porritt, R. A., Rivas, M. N., Paschold, L., Willscher, E., et al. (2020). Superantigenic character of an insert unique to SARS-CoV-2 spike supported by skewed TCR repertoire in patients with hyperinflammation. Proc. Natl. Acad. Sci. U. S. A 117, 25254–25262. doi: 10.1073/pnas.2010722117

PubMed Abstract | CrossRef Full Text | Google Scholar

Davies, N. G., Abbott, S., Barnard, R. C., Jarvis, C. I., Kucharski, A. J., Munday, J. D., et al. (2021). Estimated transmissibility and impact of SARS-CoV-2 lineage B. 1.1.7 in England. Science 372, eabg3055. doi: 10.1126/science.abg3055

PubMed Abstract | CrossRef Full Text | Google Scholar

De Vries, S. J., Van Dijk, M., and Bonvin, A. M. (2010). The HADDOCK web server for data-driven biomolecular docking. Nat. Protoc. 5, 883–897. doi: 10.1038/nprot.2010.32

PubMed Abstract | CrossRef Full Text | Google Scholar

Dhanda, S. K., Vaughan, K., Schulten, V., Grifoni, A., Weiskopf, D., Sidney, J., et al. (2018). Development of a novel clustering tool for linear peptide sequences. Immunology 155, 331–345. doi: 10.1111/imm.12984

PubMed Abstract | CrossRef Full Text | Google Scholar

Dinesh, D. C., Chalupska, D., Silhan, J., Koutna, E., Nencka, R., Veverka, V., et al. (2020). Structural basis of RNA recognition by the SARS-CoV-2 nucleocapsid phosphoprotein. PLoS Pathog. 16:e1009100. doi: 10.1371/journal.ppat.1009100

PubMed Abstract | CrossRef Full Text | Google Scholar

Doi, H., Hiroishi, K., Shimazaki, T., Eguchi, J., Baba, T., Ito, T., et al. (2009). Magnitude of CD8+ T-cell responses against hepatitis C virus and severity of hepatitis do not necessarily determine outcomes in acute hepatitis C virus infection. Hepatol. Res. 39, 256–265. doi: 10.1111/j.1872-034X.2008.00459.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Doytchinova, I. A., and Flower, D. R. (2007). VaxiJen: a server for prediction of protective antigens, tumour antigens and subunit vaccines. BMC Bioinformatics 8, 1–7. doi: 10.1186/1471-2105-8-4

CrossRef Full Text | Google Scholar

Drew, E. D., and Janes, R. W. (2020). PDBMD2CD: providing predicted protein circular dichroism spectra from multiple molecular dynamics-generated protein structures. Nucleic Acids Res. 48, W17–W24. doi: 10.1093/nar/gkaa296

PubMed Abstract | CrossRef Full Text | Google Scholar

Faria, N. R., Claro, I. M., Candido, D., Moyses Franco, L., Andrade, P. S., Coletti, T. M., et al. (2021). Genomic characterisation of an emergent SARS-CoV-2 lineage in manaus: preliminary findings. Version 1. medRxiv [Preprint]. doi: 10.1101/2021.02.26.21252554

CrossRef Full Text | Google Scholar

Feliu, V., Vasseur, V., Grover, H. S., Chu, H. H., Brown, M. J., Wang, J., et al. (2013). Location of the CD8 T cell epitope within the antigenic precursor determines immunogenicity and protection against the toxoplasma gondii parasite. PLoS Pathog. 9:e1003449. doi: 10.1371/journal.ppat.1003449

PubMed Abstract | CrossRef Full Text | Google Scholar

Focosi, D., Tuccori, M., Baj, A., and Maggi, F. (2021). SARS-CoV-2 variants: a synopsis of In vitro efficacy data of convalescent plasma, currently marketed vaccines, and monoclonal antibodies. Viruses 13, 1211. doi: 10.3390/v13071211

PubMed Abstract | CrossRef Full Text | Google Scholar

Forni, D., Cagliani, R., Pontremoli, C., Mozzi, A., Pozzoli, U., Clerici, M., et al. (2021). Antigenic variation of SARS-CoV-2 in response to immune pressure. Mol. Ecol. 30, 3548–3559. doi: 10.1111/mec.15730

PubMed Abstract | CrossRef Full Text | Google Scholar

Gangaev, A., Ketelaars, S. L., Isaeva, O. I., Patiwael, S., Dopler, A., Hoefakker, K., et al. (2021). Identification and characterization of a SARS-CoV-2 specific CD8+ T cell response with immunodominant features. Nat. Commun. 12, 1–14. doi: 10.1038/s41467-021-22811-y

CrossRef Full Text | Google Scholar

García, L. F. (2020). Immune response, inflammation, and the clinical spectrum of COVID-19. Front. Immunol. 11:1441. doi: 10.3389/fimmu.2020.01441

PubMed Abstract | CrossRef Full Text | Google Scholar

Greenfield, N. J. (2006). Using circular dichroism spectra to estimate protein secondary structure. Nat. Protoc. 1, 2876–2890. doi: 10.1038/nprot.2006.202

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo, J.-Y., Liu, I.-J., Lin, H.-T., Wang, M.-J., Chang, Y.-L., Lin, S.-C., et al. (2021). Identification of COVID-19 B-cell epitopes with phage-displayed peptide library. J. Biomed. Sci. 28, 1–13. doi: 10.1186/s12929-021-00740-8

CrossRef Full Text | Google Scholar

Harndahl, M., Rasmussen, M., Roder, G., Dalgaard Pedersen, I., Sørensen, M., Nielsen, M., et al. (2012). Peptide-MHC class I stability is a better predictor than peptide affinity of CTL immunogenicity. Eur. J. Immunol. 42, 1405–1416. doi: 10.1002/eji.201141774

PubMed Abstract | CrossRef Full Text | Google Scholar

Harvey, W. T., Carabelli, A. M., Jackson, B., Gupta, R. K., Thomson, E. C., Harrison, E. M., et al. (2021). SARS-CoV-2 variants, spike mutations and immune escape. Nat. Rev. Microbiol. 19, 409–424. doi: 10.1038/s41579-021-00573-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Huai, Z., Tong, Z., Mei, Y., and Mo, Y. (2021). Theoretical study of the spectral differences of the Fenna–Matthews–Olson protein from different species and their mutants. J. Phys. Chem. B 125, 8313–8324. doi: 10.1021/acs.jpcb.1c01686

PubMed Abstract | CrossRef Full Text | Google Scholar

Islam, M. S. B., Miah, M., Hossain, M. E., and Kibria, K. K. (2020). A conserved multi-epitope-based vaccine designed by targeting hemagglutinin protein of highly pathogenic avian H5 influenza viruses. 3 Biotech 10, 1–16. doi: 10.1007/s13205-020-02544-3

CrossRef Full Text | Google Scholar

Kar, T., Narsaria, U., Basak, S., Deb, D., Castiglione, F., Mueller, D. M., et al. (2020). A candidate multi-epitope vaccine against SARS-CoV-2. Sci. Rep. 10, 1–24. doi: 10.1038/s41598-020-67749-1

CrossRef Full Text | Google Scholar

Kozakov, D., Hall, D. R., Xia, B., Porter, K. A., Padhorny, D., Yueh, C., et al. (2017). The ClusPro web server for protein–protein docking. Nat. Protoc. 12, 255–278. doi: 10.1038/nprot.2016.169

PubMed Abstract | CrossRef Full Text | Google Scholar

Kumar, S., Maurya, V. K., Prasad, A. K., Bhatt, M. L., and Saxena, S. K. (2020). Structural, glycosylation and antigenic variation between 2019 novel coronavirus (2019-nCoV) and SARS coronavirus (SARS-CoV). Virus 31, 13–21. doi: 10.1007/s13337-020-00571-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Lafon, M. (2000). Les superantigènes viraux. Rev. Med. Interne 21, 713–716. doi: 10.1016/S0248-8663(00)80034-X

PubMed Abstract | CrossRef Full Text | Google Scholar

Larralde, O., and Petrik, J. (2017). Phage-displayed peptides that mimic epitopes of hepatitis E virus capsid. Med. Microbiol. Immunol. 206, 301–309. doi: 10.1007/s00430-017-0507-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Z., and Hirst, J. D. (2020). Computed optical spectra of SARS-CoV-2 proteins. Chem. Phys. Lett. 758:137935. doi: 10.1016/j.cplett.2020.137935

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, J., Li, X., and He, D. (2019). A directed acyclic graph network combined with CNN and LSTM for remaining useful life prediction. IEEE Access 7, 75464–75475. doi: 10.1109/ACCESS.2019.2919566

CrossRef Full Text | Google Scholar

Li, J., and Liu, W. (2020). Puzzle of highly pathogenic human coronaviruses (2019-nCoV). Protein Cell 11, 235–238. doi: 10.1007/s13238-020-00693-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, R., Ma, X., Deng, J., Chen, Q., Liu, W., Peng, Z., et al. (2021). Differential efficiencies to neutralize the novel mutants B. 1.1.7 and 501Y. V2 by collected sera from convalescent COVID-19 patients and RBD nanoparticle-vaccinated rhesus macaques. Cell. Mol. Immunol. 18, 1058–1060. doi: 10.1038/s41423-021-00641-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Li Pira, G., Ivaldi, F., Moretti, P., and Manca, F. (2010). High throughput T epitope mapping and vaccine development. J. Biomed. Biotechnol. 2010:325720. doi: 10.1155/2010/325720

CrossRef Full Text | Google Scholar

Liu, J., Sun, Y., Qi, J., Chu, F., Wu, H., Gao, F., et al. (2010). The membrane protein of severe acute respiratory syndrome coronavirus acts as a dominant immunogen revealed by a clustering region of novel functionally and structurally defined cytotoxic T-lymphocyte epitopes. J. Infect. Dis. 202, 1171–1180. doi: 10.1086/656315

PubMed Abstract | CrossRef Full Text | Google Scholar

Mayrose, I., Penn, O., Erez, E., Rubinstein, N. D., Shlomi, T., Freund, N. T., et al. (2007). Pepitope: epitope mapping from affinity-selected peptides. Bioinformatics 23, 3244–3246. doi: 10.1093/bioinformatics/btm493

PubMed Abstract | CrossRef Full Text | Google Scholar

Meckiff, B. J., Ramírez-Suástegui, C., Fajardo, V., Chee, S. J., Kusnadi, A., Simon, H., et al. (2020). Imbalance of regulatory and cytotoxic SARS-CoV-2-reactive CD4+ T cells in COVID-19. Cell 183, 1340–1353.e16. doi: 10.1016/j.cell.2020.10.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Meng, B., Kemp, S. A., Papa, G., Datir, R., Ferreira, I. A., Marelli, S., et al. (2021). Recurrent emergence of SARS-CoV-2 spike deletion H69/V70 and its role in the Alpha variant B.1.1.7. Cell Reports 35:109292. doi: 10.1016/j.celrep.2021.109292

CrossRef Full Text | Google Scholar

Micsonai, A., Wien, F., Kernya, L., Lee, Y.-H., Goto, Y., Réfrégiers, M., et al. (2015). Accurate secondary structure prediction and fold recognition for circular dichroism spectroscopy. Proc. Natl. Acad. Sci. 112, E3095–E3103. doi: 10.1073/pnas.1500851112

CrossRef Full Text | Google Scholar

Mittal, A., Khattri, A., and Verma, V. (2022). Structural and antigenic variations in the spike protein of emerging SARS-CoV-2 variants. PLoS Pathog. 18:e1010260. doi: 10.1371/journal.ppat.1010260

PubMed Abstract | CrossRef Full Text | Google Scholar

Naveca, F., Nascimento, V., Souza, V., Corado, A., Nascimento, F., Silva, G., et al. (2021). Phylogenetic relationship of SARS-CoV-2 sequences from Amazonas with emerging Brazilian variants harboring mutations E484K and N501Y in the Spike protein. Virological. Org. 1, 1–8.

Google Scholar

Parsons, L.M., Bouwman, K.M., Azurmendi, H., De Vries, R.P., and Cipollo, J.F. MH Verheije. (2019). Glycosylation of the viral attachment protein of avian coronavirus is essential for host cell and receptor binding. J. Biol. Chem. 294, 7797–7809, doi: 10.1074/jbc.RA119.007532

PubMed Abstract | CrossRef Full Text | Google Scholar

Peacock, T., Reddy, K., James, J., Adamiak, B., Barclay, W., Shelton, H., et al. (2016). Antigenic mapping of an H9N2 avian influenza virus reveals two discrete antigenic sites and a novel mechanism of immune escape. Sci. Rep. 6, 1–12. doi: 10.1038/srep18745

CrossRef Full Text | Google Scholar

Pettersen, E. F., Goddard, T. D., Huang, C. C., Couch, G. S., Greenblatt, D. M., Meng, E. C., et al. (2004). UCSF Chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612. doi: 10.1002/jcc.20084

PubMed Abstract | CrossRef Full Text | Google Scholar

Raza, M. T., Mizan, S., Yasmin, F., Akash, A.-S., and Shahik, S. M. (2021). Epitope-based universal vaccine for Human T-lymphotropic virus-1 (HTLV-1). PLoS One 16:e0248001. doi: 10.1371/journal.pone.0248001

PubMed Abstract | CrossRef Full Text | Google Scholar

Rencilin, C. F., Rosy, J. C., Mohan, M., Coico, R., and Sundar, K. (2021). Identification of SARS-CoV-2 CTL epitopes for development of a multivalent subunit vaccine for COVID-19. Infect. Genet. Evol. 89:104712. doi: 10.1016/j.meegid.2021.104712

PubMed Abstract | CrossRef Full Text | Google Scholar

Rist, M. J., Neller, M. A., Burrows, J. M., and Burrows, S. R. (2015). T cell epitope clustering in the highly immunogenic BZLF1 antigen of Epstein-Barr virus. J. Virol. 89, 703–712. doi: 10.1128/JVI.02642-14

PubMed Abstract | CrossRef Full Text | Google Scholar

Rochman, N. D., Wolf, Y. I., Faure, G., Mutz, P., Zhang, F., and Koonin, E. V. (2021). Ongoing global and regional adaptive evolution of SARS-CoV-2. Proc. Natl. Acad. Sci. U. S. A. 118, –e2104241118. doi: 10.1073/pnas.2104241118

PubMed Abstract | CrossRef Full Text | Google Scholar

Rogers, D. M., Jasim, S. B., Dyer, N. T., Auvray, F., Réfrégiers, M., and Hirst, J. D. (2019). Electronic circular dichroism spectroscopy of proteins. Chem 5, 2751–2774. doi: 10.1016/j.chempr.2019.07.008

CrossRef Full Text | Google Scholar

Schmidt, M. E., and Varga, S. M. (2018). The CD8 T cell response to respiratory virus infections. Front. Immunol. 9:678. doi: 10.3389/fimmu.2018.00678

PubMed Abstract | CrossRef Full Text | Google Scholar

Shen, X., Tang, H., McDanal, C., Wagh, K., Fischer, W., Theiler, J., et al. (2021). SARS-CoV-2 variant B. 1.1.7 is susceptible to neutralizing antibodies elicited by ancestral spike vaccines. Cell Host Microbe 29, 529–539.e523. doi: 10.1016/j.chom.2021.03.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Shey, R. A., Ghogomu, S. M., Shintouo, C. M., Nkemngo, F. N., Nebangwa, D. N., Esoh, K., et al. (2021). Computational design and preliminary serological analysis of a novel multi-epitope vaccine candidate against Onchocerciasis and related filarial diseases. Pathogens 10, 99. doi: 10.3390/pathogens10020099

PubMed Abstract | CrossRef Full Text | Google Scholar

Shi, J., Zhang, J., Li, S., Sun, J., Teng, Y., Wu, M., et al. (2015). Epitope-based vaccine target screening against highly pathogenic MERS-CoV: an in silico approach applied to emerging infectious diseases. PLoS One 10:e0144475. doi: 10.1371/journal.pone.0144475

PubMed Abstract | CrossRef Full Text | Google Scholar

Sievers, F., and Higgins, D. G. (2018). Clustal Omega for making accurate alignments of many protein sequences. Protein Sci. 27, 135–145. doi: 10.1002/pro.3290

PubMed Abstract | CrossRef Full Text | Google Scholar

Sievers, F., and Higgins, D. G. (2021). The clustal omega multiple alignment package. Methods Mol. Biol. 2231, 3–16. doi: 10.1007/978-1-0716-1036-7_1

PubMed Abstract | CrossRef Full Text | Google Scholar

Singh, H., and Raghava, G. J. B. (2003). ProPred1: prediction of promiscuous MHC class-I binding sites. Bioinformatics 19, 1009–1014. doi: 10.1093/bioinformatics/btg108

PubMed Abstract | CrossRef Full Text | Google Scholar

Soria-Guerra, R. E., Nieto-Gomez, R., Govea-Alonso, D. O., and Rosales-Mendoza, S. (2015). An overview of bioinformatics tools for epitope prediction: implications on vaccine development. J. Biomed. Inform. 53, 405–414. doi: 10.1016/j.jbi.2014.11.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Stufano, A., Capone, G., Pesetti, B., Polimeno, L., and Kanduc, D. (2010). Clustering of rare peptide segments in the HCV immunome. Self 1, 154–162. doi: 10.4161/self.1.2.11391

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, H., Yang, J., Zhang, T., Long, L.-P., Jia, K., Yang, G., et al. (2013). Using sequence data to infer the antigenicity of influenza virus. MBio 4, e00230–e00213. doi: 10.1128/mBio.00230-13

CrossRef Full Text | Google Scholar

Tay, M. Z., Poh, C. M., Rénia, L., MacAry, P. A., and Ng, L. F. (2020). The trinity of COVID-19: immunity, inflammation and intervention. Nat. Rev. Immunol. 20, 363–374. doi: 10.1038/s41577-020-0311-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Tscharke, D. C., Croft, N. P., Doherty, P. C., and La Gruta, N. L. (2015). Sizing up the key determinants of the CD8+ T cell response. Nat. Rev. Immunol. 15, 705–716. doi: 10.1038/nri3905

PubMed Abstract | CrossRef Full Text | Google Scholar

Vita, R., Overton, J. A., Greenbaum, J. A., Ponomarenko, J., Clark, J. D., Cantrell, J. R., et al. (2015). The immune epitope database (IEDB) 3.0. Nucleic Acids Res. 43, D405–D412. doi: 10.1093/nar/gku938

PubMed Abstract | CrossRef Full Text | Google Scholar

Vivona, S., Gardy, J. L., Ramachandran, S., Brinkman, F. S., Raghava, G. P. S., Flower, D. R., et al. (2008). Computer-aided biotechnology: from immuno-informatics to reverse vaccinology. Trends Biotechnol. 26, 190–200. doi: 10.1016/j.tibtech.2007.12.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Vujovic, M., Degn, K. F., Marin, F. I., Schaap-Johansen, A.-L., Chain, B., Andresen, T. L., et al. (2020). T cell receptor sequence clustering and antigen specificity. Comput. Struct. Biotechnol. J. 18, 2166–2173. doi: 10.1016/j.csbj.2020.06.041

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, P., Casner, R. G., Nair, M. S., Wang, M., Yu, J., Cerutti, G., et al. (2021a). Increased resistance of SARS-CoV-2 variant P. 1 to antibody neutralization. Cell Host Microbe 29, 747–751.e4. doi: 10.1016/j.chom.2021.04.007

CrossRef Full Text | Google Scholar

Wang, L., Deng, X., Liu, H., Zhao, L., You, X., Dai, P., et al. (2016). The mimic epitopes of Mycobacterium tuberculosis screened by phage display peptide library have serodiagnostic potential for tuberculosis. Pathog. Dis. 74, ftw091. doi: 10.1093/femspd/ftw091

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, P., Nair, M. S., Liu, L., Iketani, S., Luo, Y., Guo, Y., et al. (2021b). Antibody resistance of SARS-CoV-2 variants B. 1.351 and B. 1.1. 7. Nature 593, 130–135. doi: 10.1038/s41586-021-03398-2

CrossRef Full Text | Google Scholar

Wong, Y. C., Croft, S., Smith, S. A., Lin, L. C., Cukalac, T., La Gruta, N. L., et al. (2019). Modified vaccinia virus Ankara can induce optimal CD8+ T cell responses to directly primed antigens depending on vaccine design. J. Virol. 93, e01154–e01119. doi: 10.1128/JVI.01154-19

CrossRef Full Text | Google Scholar

Wu, C.-H., Liu, I.-J., Lu, R.-M., and Wu, H.-C. (2016). Advancement and applications of peptide phage display technology in biomedical science. J. Biomed. Sci. 23, 1–14. doi: 10.1186/s12929-016-0223-x

CrossRef Full Text | Google Scholar

Zhang, Q., Wang, P., Kim, Y., Haste-Andersen, P., Beaver, J., Bourne, P. E., et al. (2008). Immune epitope database analysis resource (IEDB-AR). Nucleic Acids Res. 36, W513–W518. doi: 10.1093/nar/gkn254

PubMed Abstract | CrossRef Full Text | Google Scholar

Zheng, M., and Song, L. (2020). Novel antibody epitopes dominate the antigenicity of spike glycoprotein in SARS-CoV-2 compared to SARS-CoV. Cell. Mol. Immunol. 17, 536–538. doi: 10.1038/s41423-020-0385-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Zost, S. J., Wu, N. C., Hensley, S. E., and Wilson, I. A. (2019). Immunodominance and antigenic variation of influenza virus hemagglutinin: implications for design of universal vaccine immunogens. J. Infect. Dis. 219, S38–S45. doi: 10.1093/infdis/jiy696

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: emerging variants, CTL epitopes, B.1.1.7, antigenicity, cluster, phage-displayed peptides, CD spectra, SAg-like region

Citation: Bhattacharya M, Sharma AR, Mallick B, Lee S-S, Seo E-M and Chakraborty C (2022) B.1.1.7 (Alpha) variant is the most antigenic compared to Wuhan strain, B.1.351, B.1.1.28/triple mutant and B.1.429 variants. Front. Microbiol. 13:895695. doi: 10.3389/fmicb.2022.895695

Received: 14 March 2022; Accepted: 25 July 2022;
Published: 12 August 2022.

Edited by:

Wanbo Tai, Shenzhen Bay Laboratory, China

Reviewed by:

Andrei Lobiuc, Ștefan cel Mare University of Suceava, Romania
J. Shawn Goodwin, Meharry Medical College, United States

Copyright © 2022 Bhattacharya, Sharma, Mallick, Lee, Seo and Chakraborty. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Eun-Min Seo, c2VvZW1AaGFsbHltLm9yLmty; Chiranjib Chakraborty, ZHJjaGlyYW5qaWJAeWFob28uY29t

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.