- 1Escuela Profesional de Genética y Biotecnología, Facultad de Ciencias Biológicas, Universidad Nacional Mayor de San Marcos, Lima, Peru
- 2Departamento de Ciencias Celulares y Moleculares, Laboratorio de Bioinformática, Biología Molecular y Desarrollos Tecnológicos, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia (UPCH), Lima, Peru
- 3Departamento de Salud Pública, Facultad de Ciencias de la Salud, Universidad Nacional Toribio Rodríguez de Mendoza de Amazonas, Chachapoyas, Peru
- 4Instituto de Ganadería y Biotecnología, Universidad Nacional Toribio Rodríguez de Mendoza de Amazonas, Chachapoyas, Peru
- 5Laboratory of Cellular Biophysics, The Rockefeller University, New York, NY, United States
Clostridium perfringens is a dangerous bacterium and known biological warfare weapon associated with several diseases, whose lethal toxins can produce necrosis in humans. However, there is no safe and fully effective vaccine against C. perfringens for humans yet. To address this problem, we computationally screened its whole proteome, identifying highly immunogenic proteins, domains, and epitopes. First, we identified that the proteins with the highest epitope density are Collagenase A, Exo-alpha-sialidase, alpha n-acetylglucosaminidase and hyaluronoglucosaminidase, representing potential recombinant vaccine candidates. Second, we further explored the toxins, finding that the non-toxic domain of Perfringolysin O is enriched in CTL and HTL epitopes. This domain could be used as a potential sub-unit vaccine to combat gas gangrene. And third, we designed a multi-epitope protein containing 24 HTL-epitopes and 34 CTL-epitopes from extracellular regions of transmembrane proteins. Also, we analyzed the structural properties of this novel protein using molecular dynamics. Altogether, we are presenting a thorough immunoinformatic exploration of the whole proteome of C. perfringens, as well as promising whole-protein, domain-based and multi-epitope vaccine candidates. These can be evaluated in preclinical trials to assess their immunogenicity and protection against C. perfringens infection.
Introduction
Clostridium perfringens is a Gram-positive bacterium frequently associated with systemic and enteric diseases (1). In humans, C. perfringens is one of the most common food-poisoning causing bacteria, responsible for 1,000,000 cases per year in the US (2). Over a thousand cases result in gas gangrene, which can take from hours to weeks to develop depending on the tissue oxygen levels (3). It has a 10-30% mortality rate when treated, but 100% when untreated (4). Moreover, it is a pathogen known as a biological warfare weapon (5, 6). Therefore, it is necessary to develop preventive tools, like the identification of proteins and domains that can be used for molecular diagnostics and vaccines.
The histotoxic infections caused by this bacterium include gas gangrene in contaminated wounds, and several symptoms of human gastrointestinal diseases by either food- or non-food-borne C. perfringens infection (7, 8). C. perfringens isolates are classified into five toxinotypes, based on the production of four major toxins: α, β, ϵ and ι (1). C. perfringens type A is the main toxinotype that infects humans, producing gas gangrene, food poisoning, and non-foodborne gastrointestinal disease (9). Its main mechanism of cell invasion depends on the formation of a pore in the host cell membrane. The phospholipase C (cpa) and perfringolysin O (pfo) are involved in histotoxic infections, while the enterotoxin (etx), the β toxin (cpb) and the β-like toxin, the epsilon toxin (cpe) are involved in intestinal diseases (9, 10).
When C. perfringens toxins enter host cells, they are cut into small peptides by the proteasomes (11). Similarly, the whole bacterium can also be phagocytized and degraded by the endolysosomes into peptides. In both scenarios, the HLA (human leukocyte antigen) molecules (class I and class II, respectively) bind to these peptides and display them on the cell surface. Then, the HLA-peptide complexes are recognized by the TCR receptor in the surface of CD8+ and CD4+ immature T-cells, respectively, triggering an adaptive immune response. They will mature into cytotoxic T (CTL) and T-helper (HTL) lymphocytes. CTLs will produce a cytotoxic response against the infected cells, whereas HTL will stimulate the proliferation of antigen-specific B cells by clonal expansion, generating specific antibodies against the pathogen (12).
Vaccination is the most cost-effective method to prevent diseases (13). Although there is a vaccine available against C. perfringens for sheep and goats (14), there is no one approved for humans (5). Traditional vaccine development approaches are based on whole attenuated or dead microorganisms, and inactivated bacterial toxins. Nevertheless, they present the risk of potential reactivation or recombination of the vaccine strain, as well as offering limited protective effectiveness and immunity versus newer technologies (13). An alternative is subunit-based vaccines, consisting in one or more domains of antigenic proteins. Generally, these protein domains should be easily accessible, such as the external region of membrane proteins. As example, the NVX-CoV2373, a protein subunit-based vaccine producing the recombinant Spike protein, have demonstrated to neutralize the virus in different organism models and in humans (15, 16).
A new kind of vaccine, called the multi-epitope vaccine, has gained popularity in recent years (17). It consists of a novel protein connecting immunogenic epitopes. This kind of vaccine provides certain advantages over classical vaccines and single-epitope vaccines, such as a broader spectrum of pathogen variants, an optimized design that elicits both CTL, HTL, and B cell responses, and reduced adverse effects (18). It relies on the identification of epitopes through computational prediction and experimental testing. Epitopes are selected based on different criteria and assembled into a construct for their delivery to the immune system machinery. Multi-epitopes vaccines have already shown good results when tested in human clinical trials. For example, EMD640744 and Reniale have shown immunologic efficacy against advanced solid tumors (19) and in reducing tumor progression (20), respectively. Also, multi-epitope vaccines developed to trigger cross-immunity against different strains of influenza have shown great immunogenicity (21). Additionally, in mice, a multi-epitope vaccine has shown protective immunity against Toxoplasma gondii (22).
Nowadays, immunoinformatics tools help to massively screen protein sequences. They allow the computational identification of antigenic proteins and epitopes, reducing development time and cost (23). They rely on experimental data, which is available in databases like the Immune Epitope Database (IEDB) (24). It collects data of antibodies and CTL-, HTL- and B-epitopes, detected or evaluated in humans and other animal species (24, 25). Furthermore, there are predictors of CTL- and HTL-epitopes based on artificial intelligence tools and trained on experimental information. Among them, we have NetMHCpan I and II (26, 27) and MHCFlurry (28), which are currently the best according to independent evaluations (29). Additionally, Epitope-Evaluator performs comparative analysis of the outputs of these predictors, allowing an easy and graphical identification of highly antigenic proteins, as well as conserved promiscuous epitopes (30). Protein modeling and molecular dynamics (MD) allow studying structural characteristics of antigenic proteins, such as flexibility, disorder degree, and solvent accessibility, among others. These features are related to the immune response elicited (31–34). This could be because flexible and disordered antigens have more different conformations available at the moment of binding to the immune system molecules, maximizing favorable interactions (33).
In the present study, we predicted immunogenic epitopes from the 2721 proteins that comprise the known proteome of C. perfringens type A, to identify vaccine candidates following 3 different approaches. First, we used epitope prediction to identify the proteins containing the highest number of epitopes that may elicit a good immunogenic response when used as recombinant vaccines. Second, we analyzed only the toxins and determined which non-toxic regions of these proteins are rich in HTL-epitopes and could be used as a vaccine, without the histotoxic damage. Third, using the best candidate epitopes, we generated synthetic constructs and studied their structural characteristics such as the flexibility and the accessibility of the epitopes through molecular dynamics. As result, we are presenting novel candidates for further testing as potential vaccines.
Material and methods
Data retrieval
Protein sequences and selection
We downloaded amino acid sequences of the 2721 proteins reported for C. perfringens Type A in the UniProt database, using the reference proteome with ID:UP000000818 (Supplementary Table 1). We explored the proteome, following three approaches (Figure 1).
Figure 1 Flowchart of the immunoinformatic exploration of pathogens for vaccine development. We present three workflows to identify: (A) immunogenic proteins for protein-based vaccines, (B) protein domains enriched in HTL epitopes for subunit vaccines, and (C) nested epitopes for the design of novel multi-epitope protein vaccines. The proteins obtained in these three approaches might be used as well in immunodiagnostic tests.
In the first approach, we sought to identify proteins with most epitopes. So, the “epitope density” of each protein was calculated, which was defined as the number of epitopes of the protein divided by the length (in aa) of the protein. This was calculated using the tool “Epitope Density” from Epitope-Evaluator (https://fuxmanlab.shinyapps.io/Epitope-Evaluator/) (30). Only proteins longer than 100aa with “evidence at protein level” according to the UniProt database were considered in this approach (Figure 1A).
In the second approach, the epitopes within toxins were analyzed, using the following protein sequences: perfringolysin O (P0C2E9), enterotoxin A (Q8XKY4), enterotoxin B (Q8XKP0), enterotoxin D (Q8XMT2), beta2-toxin (Q93MD0) and phospholipase C (P0C216), as previously reported (10, 35). To propose a subunit-based vaccine that induces an appropriate humoral response, we further studied the HTL-epitopes in the non-toxic domains of these proteins. Among them, only the non-toxic domain of Perfringolysin O is well characterized, so we further studied this region. This protein is one of the most immunogenic toxins of C. perfringens (35), and its non-toxic domain is between the amino acid position 1 and 363, while the toxic and binding domains are reported to be between 363 and 472 (35, 36). This analysis was performed using the “Epitope Location” tool of Epitope-Evaluator (30) and the tridimensional model obtained by AlphaFold2 (Figure 1B).
In the third approach, we started with all the epitopes in the proteins of C. perfringens Type A and applied consecutive filters, as described below (see Figure 1C).
Selection of HLA class-I and -II alleles
Each population has a different distribution of HLA class I and II alleles (37, 38). Therefore, to maximize the potential use of our construct across populations, the HLA “supertype” alleles were considered as they are representative of the most frequent HLAs class I and II alleles worldwide. For HLA class -I, these are: HLA-A*01:01 (A1), HLA-A*26:01 (A1), HLA-A*02:01 (A2), HLA-A*03:01 (A3), HLA-A*24:02 (A24), HLA-B*07:02 (B7), HLA-B*08:01 (B8), HLA-B*27:05 (B27), HLA-B*39:01 (B27), HLA-B*40:01 (B44), HLA-B*58:01 (B58) and HLA-B*15:01 (B62) (39). And for HLA class-II: DRB1*03:01, DRB1*04:01, DRB1*04:05, DRB1*08:02, DRB1*11:01, DRB1*13:02 and DRB3*01:01 and DRB3*02:02 (9, 10, 40).
Prediction and selection of candidate epitopes
Prediction of CTL- and HTL-epitopes
Epitopes of 8, 9, and 10 amino acids (aa) were predicted in C. perfringens proteins for the HLA class-I supertype alleles using NetMHCpan v4.1 (27) and MHCFlurry 2.0 (28), selecting based on the consensus of these predictors. For HLA class-II, epitopes of 15 aa were predicted using the NetMHCIIpan v4.0 (26). A threshold of rank ≤ 2% was used for all these software. With the prediction results, the epitope promiscuity (number of HLA alleles each epitope is predicted to bind to) was calculated, identifying epitopes that can bind to many alleles.
Discarding epitopes present in the host proteome
A previous study has shown that epitopes present in both proteins of the pathogen and proteins of the host may trigger an auto-immune response (41). To prevent this, all predicted epitopes matching human proteins with 100% identity and sequence coverage were removed.
In the first and second approaches described in 2.1.1, the whole protein was discarded. In the third approach, just the epitope was discarded. All the proteoforms of the human proteome (UniProt ID: UP000005640) were used as reference.
Multi-epitope vaccine design
Identification of nested epitopes
To keep the artificial multi-epitope protein small while maximizing the number and quality of epitopes in the construct, highly immunogenic overlapping candidate epitopes were selected, which we called “nested epitopes”. These were defined as linear HTL-epitopes containing linear CTL-epitopes in its sequence. Previous studies have also used this approach to induce both CD4+ and CD8+ T-cell responses (40, 41).
Filtering proteins by the number of transmembrane helices and UniProt evidence
To further filter the list of nested epitopes, only predicted epitopes belonging to proteins having two or more predicted external transmembrane regions with epitopes within were considered. Transmembrane helices were predicted using TMHMM (http://www.cbs.dtu.dk/services/TMHMM/) (42, 43), and the presence of epitopes in external regions was checked with a custom script in Python. Additionally, only proteins with the following evidence categories according UniProt were considered: “Experimental evidence at protein level”, “Experimental evidence at transcript level” and “Protein inferred from homology”. The annotation found in UniProt was independently verified for correctness.
Selection of nested epitopes with high immunogenicity
From the proteins filtered above, only those nested epitopes predicted to bind 5 or more HLA-II supertype alleles were selected. Adjacent nested epitopes with overlap were merged to reduce the number of peptides while extending the predicted immunogenic regions. Non-overlapping nested epitopes were discarded.
Determination of highly conserved epitopes
Pathogens frequently mutate as an adaptation mechanism to environmental and immunological pressure, generating multiple variants (44). Selecting conserved regions may extend the validity of vaccines over time and confer protection against different strains. For this purpose, we performed BLASTp of our nested epitopes against C. perfringens. Epitopes with 90% of conservation or higher in the alignment were selected as candidate epitopes.
In addition, the conservation of each candidate nested epitope among the five C. perfringens toxinotypes was calculated. For each of the proteins having any of our candidate nested epitopes, a sequence alignment of the protein and its corresponding homologous proteins in the other toxinotypes was made. First, to find the homologous proteins, a Blast alignment was performed between the C. perfringens type A protein and the whole proteome of each toxinotype (Supplementary Table 2). Next, multiple global alignments were performed by protein using ClustalX2 (45). The alignments were visualized using CLC Sequence Viewer, where the regions corresponding to nested epitopes were extracted.
Design of multi-epitope vaccine candidates
Multi-epitope constructs were designed by concatenating the candidate epitopes in different orders. However, when assembling the constructs, neoepitopes that can bind to the HLA supertype alleles may appear at the interface of two candidate epitopes joined. Therefore, we need to know which epitope connections are allowed. This problem was modeled using a directed graph, where the nested candidate epitopes were represented as nodes, connected by directed edges representing the order of precedence in which the epitopes can be concatenated. To determine which directed edges are allowed in the graph, all the candidate epitopes were concatenated in a “pyramidal order”, which is a single sequence containing all possible connections (see Figure 4A). This sequence was submitted to NetMHCIIpan to predict if there are strong binder epitopes located in the interface between two nested epitopes. Thus, the edges (connections) harboring unwanted neoepitopes were removed from the directed graph. Then, all the semi and complete Hamiltonian paths were found using EpiSorter, a python-based toolset for multi-epitope assembly (Figure 4B). Finally, those candidate epitopes either disconnected from the rest of the graph or with no edges enabling the generation of a complete path were discarded.
Additionally, neoepitopes at the interface of two candidate epitopes may be present in the host proteome, which may result in unwanted immune responses potentially leading to auto-immune reactions or tolerance. To prevent this, the multi-epitope constructs were sliced in all the possible fragments of 15 aa and we evaluated if they were present in any human protein using BLASTp against the human proteome. This was performed using BioPython and the following parameters: program = “blastp”, database = “nr”, entrez_query = “txid9606ORGN, expect = 20000, alignments = 100”. For any 15-mer, if scores of 100% of coverage and 100% of identity were obtained, the corresponding construct was discarded.
Structural analysis of the vaccine constructs
Prediction of physicochemical properties
The physicochemical properties of the multi-epitope vaccines were calculated using ProtParam (https://web.expasy.org/protparam/) (46), which computes the molecular weight, theoretical isoelectric point, grand average of hydropathicity (GRAVY), among other metrics.
Structural modeling
The structure of the multi-epitope proteins were predicted with AlphaFold v. 2.1.0 (47), using all the databases to search for templates and the model “monomer_casp14” to infer the 3D coordinates. For each multi-epitope construct, its possible 3D structures were ranked by mean pLDDT and the best five structures were refined by restrained energy minimization with AMBER99SB, as implemented in the AlphaFold2 pipeline. Thus, selecting the structure with highest confidence for each multi-epitope construct. Then, the two multi-epitope proteins with the highest mean pLDDT were selected to analyze their structural conformation through MD simulation.
Molecular dynamics
We performed 1.2 μs of MD for each multi-epitope structure. First, the structures were submitted to the PDB2PQR web server (48) to add the corresponding hydrogens at pH 7.4. Then, MD was carried out using NAMD 2.14 (49) with the CHARMM36 force field (50). Accordingly, NaCl ions were included on the surface of the protein based on its Coulombic potential using the package cIonize 2.0 (51). Next, ions located 20 Å farther from the protein were removed. Finally, the protein was solvated in a size-optimized box with 15 Å of padding and a salt concentration of 0.154 M, using the Autoionize Plugin v.1.5 (https://www.ks.uiuc.edu/Research/vmd/plugins/autoionize/).
Initially, only the water atoms were minimized for 5000 steps using the conjugate gradient algorithm. Then, MD of these atoms was performed at 0 K for 30 ps. Next, the whole system was minimized for 5000 steps, equilibrating the temperature to 298.15 K and the pressure 1 bar, as previously described (52). For this process, the Langevin thermostat and the Nosé-Hoover Langevin barostat were used in the NPT ensemble. Briefly, the system was heated from 50 K to 298 K, increasing the temperature by 4 K every 10 ps and applying harmonic restraints to the backbone with a force constant of 5 kcal/mol. Subsequently, the restraints were reduced by 10% every 0.05 ns. Finally, 1.2 μs of unrestrained MD of the NPT ensemble was performed at 298.15 K. Changes of temperature, potential energy, and density along the simulation were examined to verify convergence. All the processes described were performed in periodic boundary conditions with an integration time of 2 fs/timestep and Particle Mesh Ewald (PME) grid spacing of 1.0 Å. The cut-off for non-bonded interactions was set at 12 Å.
Assessing conformational convergence
For each trajectory, the alpha carbon root mean square deviation (Cα-RMSD) of all frames was calculated with an in-house Tcl script that uses VMD functions and the module “bigdcd”. The frame 0 was used as reference point. The structural compactness was quantified by the radius of gyration (Rg), which was calculated using an in-house Tcl script similar to the one described above. Trajectories where the RMSD and Rg did not converge were discarded. The remaining trajectories were trimmed to only include the interval where the RMSD and Rg converged.
It is known that two completely different structures could have the same Cα-RMSD when compared against the same reference. Therefore, we complemented our graphical approach with a Cα-RMSD-based clustering in Wordom v.0.22 (53). This method assigns conformations to the same cluster if every pair in the group has a Cα-RMSD less or equal to a given threshold (cluster diameter) (54), which we set as 2.5 Å. Therefore, finding a cluster much bigger than the rest suggests that a dominant conformation was produced by the simulation, suggesting conformational convergence.
Final refinement and evaluation of the structural quality
The centroid of the most populated cluster of each remaining multi-epitope MD was subjected to a two-step final refinement. First, main-chain and fast all-atom energy minimizations were conducted using the web server ModRefiner (55), without a reference structure. Second, the protein was solvated, followed by a minimization and MD of only water atoms using the methodology described above. Then, the output was minimized for 5000 steps in explicit solvent with NAMD and the CHARMM36 force field, applying harmonic restraints to all protein atoms with a force constant of 1 kcal/mol. The refined structures were uploaded to the web server MolProbity to construct Ramachandran plots. In addition, the web implementations of ProSA (56) and ERRAT (57) were used to further assess the structural quality.
Analysis of epitope flexibility and accessibility in MEP_12 structure
For each MD of the multi-epitope constructs, the centroids of the five most populated clusters were compared against each other. First, to have all the structures in the same coordinate system, all the centroids were RMS-aligned against the initial AlphaFold2 model in Pymol v.2.4.1 (58). Then, the AlphaFold2 model was removed and Pymol selection algebra was used to obtain a list of residues forming the same secondary structure in all the centroids.
To identify the most rigid and accessible epitope within the multi-epitope construct, the convergent trimmed trajectories were analyzed, identifying the most flexible protein regions by computing the Cα-RMSF of the trimmed trajectory in VMD. The trimmed trajectory was loaded with a step of 5 and aligned against frame 0, then the Cα-RMSF measured using these modified coordinates. Finally, the Solvent Accessible Surface Area (SASA) per epitope was measured during the trimmed trajectory using an in-house Tcl script. Then, the module “bigdcd” was used to load the complete trimmed trajectory and examine the changes of SASA along it, assessing the convergence of each epitope. For each epitope whose SASA converged, the mean SASA over the entire trajectory was computed and 95% confidence intervals calculated by computing the statistical inefficiency with the block averaging approach (59).
Evaluation of MEP_12 innate response potential
The potential of MEP_12 to trigger an innate response was tested based on its interaction with TLR1/TLR2 and TLR4/MD2. The representative conformation of MEP_12, from MD Ca-RMSD-based clustering, was docked against the ectodomains of TLR1/TLR2 (PDB ID: 2Z7X) and TLR4/MD-2 (TLR4/MD-2, PDB ID: 3FXI) using Haddock v.2.4 (60). The structures of these TLRs were submitted to PDBfixer to complete the missing sections and remove irrelevant heteroatoms (61). As TLR ectodomains are glycosylated in vivo (62), the covalently attached glycans of the structures were kept in order to obtain a TLR-MEP_12 interaction closer to real conditions.
As information about the possible interaction sites was not available, a blind docking approach was followed. Haddock ab-initio mode was used to scan the surface of MEP_12 and the corresponding TLR ectodomain, to find the most favorable interacting pose. This was performed during the rigid docking phase. Fifty thousand structures were computed for this step, to ensure that the whole surface of each protein will be sampled. Then, the best 500 structures were selected based on the Haddock scores, which were calculated from the semi-flexible simulated annealing and the final energy minimization refinement. The solutions of both docking steps were clustered by Fraction of Common Contacts (FCC), using 0.6 Å as threshold. Cluster-mean Haddock scores were computed using the structures with the four lowest values. Then, the clusters were ranked, where those with lower mean Haddock scores were the most favorable.
The most favorable clusters that had overlapping error bars (one standard deviation) of their Haddock-scores were selected. Protein-protein interaction analysis of the epitopes was performed using a custom Python pipeline, evaluating the plausibility of the solution based on prior structural data. After selecting one cluster, its most favorable structure was submitted to the web servers PDBsum and PRODIGY to map the interacting residues and to compute the protein-protein binding energy, respectively (63, 64).
Simulating the immune response
The immune response profile by immunization with the multi-epitope vaccine was simulated using C-IMMSIM (https://kraken.iac.rm.cnr.it/C-IMMSIM/). Three vaccinations with a dose of 1000 unit of the multi-epitope vaccine and 100 of adjuvant without LPS were administrated as described in Figure 10A. The most frequent HLA alleles were considered: HLA-A*0101, HLA-B*08:01, HLA-B*15:01, DRB1-11:01 and DRB3-02:02. One thousand timesteps of 8 hours were simulated, representing ~11 months. Vaccine doses were administered four weeks apart, as commonly recommended (65), corresponding to days 3, 30 and 60. Pathogen challenge was introduced on day 111 by inoculating 1000 units of those proteins of C. perfringens type A harboring epitopes of the vaccine, with a pathogen multiplication factor of 0.2 (Figure 10A).
Multi-epitope sequence design and codon optimization
Codon optimization was performed to improve the expression efficiency of the vaccine construct in Escherichia coli for production. The codon usage table of E. coli K12 strain, available in the Codon Usage Database (https://www.kazusa.or.jp/codon/), was used for reverse translation. The CAIcal SERVER (http://genomes.urv.es/CAIcal/) was used to calculate the Codon Adaptation Index (CAI). The cDNA sequence obtained was analyzed with NEBcutter (http://nc2.neb.com/NEBcutter2/), identifying cleavage sites of commercially available restriction enzymes.
Results
Identification of proteins with high epitope density
A total of 429809 CTL- and 121450 HTL-epitopes were predicted from the proteome of C. perfringens Type A. The number of CTL- and HTL-epitopes predicted were strongly correlated with the protein length (CTL: p < 2.2e-16, r = 0.97; HTL: p < 2.2e-16, r = 0.91, Pearson correlation) (Figure 2A). There is also a positive correlation between the number of CTL- and HTL-epitopes predicted by protein (p < 2.2e-16, r = 0.87, Pearson correlation) (Figure 2B), indicating that C. perfringens proteins containing a higher number of predicted CTL-epitopes tend to contain more HTL-predicted epitopes as well.
Figure 2 Description of the predicted epitopes in C. perfringens. (A) Correlation between the number of epitopes and protein length. Blue: CTL-epitopes. Red: HTL-epitopes. (B) Correlation between the number of HTL- and CTL-epitopes. (C) Comparison of the number of epitopes per protein, for different epitope lengths of CTL- (8, 9 and 10 aa) and HTL- (15 aa) epitopes. (D) Lasso regression of the number of CTL- (blue) and HTL- (red) epitopes across the proteome of C. perfringens. (E) Correlation between protein length and number of CTL- and HTL-epitopes in the protein (top and bottom, respectively). Proteins with more than 500 CTL- and 200 HTL-epitopes (above the dashed line) are labeled. (F) Bar plot showing the number of CTL- and HTL-epitopes (top and bottom, respectively) by HLA allele within, for the four proteins highlighted on panel (E). **** means p-value < 0.0001.
Regarding the epitope length, we found a significantly greater number of 9-mer than 8-mer or 10-mer predicted CTL-epitopes (Wilcoxon test, p value < 2.2e-16), as biologically expected. Additionally, the number of 9-mer CTL epitopes predicted by protein was higher than the number of HTL-epitopes (Figure 2C). These observations were statistically significant even when normalizing by protein length (Wilcoxon test, p < 2.2e-16) (Supplementary Figure 1A).
The distribution of predicted epitopes across the proteome was explored by randomly concatenating the sequences of all the 2721 proteins and calculating the epitope density along it. Local peaks were detected, showing that the predicted epitopes were not evenly distributed across the proteome and that not all proteins have the same epitope density (Figure 2D).
Epitopes that were either duplicated (i.e. appearing in two or more different proteins) or present in any human protein were removed, resulting in 121244 (99.83%) HTL- and 427944 (99.56%) predicted CTL-epitopes left (Supplementary Table 3). This comprises 59578, 237664 and 130702 CTL-epitopes predicted of 8, 9 and 10 aa, respectively (Supplementary Figure 1B). Regarding the epitope promiscuity (i.e. the number of alleles an epitope can bind to), we found that the majority of the CTL-epitopes predicted (312032 out of 427944) bind to only one HLA-I allele. Similarly, 67080 of the 121244 HTL-epitopes were predicted to bind just one HLA-II. Noteworthy, there were 2 promiscuous CTL-epitopes binding to 11 HLA-I alleles, and 33 HTL-epitopes binding to 8 HLA-II alleles (Supplementary Figures 1C, D).
The evidence status of the proteins with at least 1 epitope predicted was retrieved from UniProt, obtaining 32 with “evidence at protein level”, 1 with “evidence at transcript level”, 1064 with “inferred from homology”, 1623 “predicted”, and 1 “uncertain”. The set of proteins with “evidence at protein level”, when compared with the set of proteins “inferred from homology”, showed no significant difference in the number of epitopes (p = 0.33) but in epitope density (p < 0.02) (Supplementary Figures 1E, F).
Among the proteins with “evidence at protein level”, 25 show an epitope density above 0.5 (Supplementary Table 4). Notoriously, each of the proteins Collagenase A, Exo-alpha-sialidase, alpha-n-acetylglucosaminidase and hyaluronoglucosaminidase contain more than 500 CTL- and 200 HTL-epitopes (Figure 2E), and more than 15 epitopes per HLA supertype allele (Figure 2F).
Evaluation of the HTL-epitopes in C. perfringens toxins
The six toxins evaluated have a different number (45–157) but similar density (0.15-0.25) of HTL-epitopes. Notably, enterotoxin D contains the highest number of HTL-epitopes even not being the largest toxin. Contrarily, beta2-toxin showed the lowest number of HTL-epitopes (Figure 3A). In terms of HTL-epitope density, the enterotoxins A and D showed the lowest and highest values, respectively (Figure 3B).
Figure 3 Evaluation of epitopes in C. perfringens toxins. (A) Correlation between the number of HTL-epitopes and protein length. (B) Correlation between HTL-epitope density and protein length. (C) Number of HTL-epitopes within toxins binding each of the HLA-II supertype alleles. (D) Location of the HTL-epitopes along the enterotoxin (D) Epitopes are colored in a gradient from yellow to red, representing the number of HLA alleles they bind. (E) Number of HTL-epitopes in the whole Perfringolysin O toxin (in gray) and its non-toxic domain (red) that are predicted to bind each of the HLA-II supertype alleles. (F) Structure of the Perfingolysin O. The toxic-domain is represented in blue. The non-toxic domain in red, highlighting the most promiscuous epitopes in cyan.
All the toxins, except the beta2-toxin, contained several epitopes by HLA allele. Among the HLA-II alleles used, DRB3*02:02 and DRB1*11:01 were found to recognize the highest and the lowest number of epitopes from toxins (Figure 3C). Regions enriched with promiscuous HTL-epitopes within the toxins were identified. For example, the C-terminal region of the enterotoxin D contains the most promiscuous epitopes of the protein, while the region between positions 130 and 256aa has almost no epitopes (Figure 3D). Similarly, the HTL-epitopes of the enterotoxins A and B were mainly located at the N-terminal region. Additionally, the non-toxic domain of Perfringolysin O contains at least nine predicted epitopes per HLA-II allele (Figure 3E). Furthermore, among the toxins, we identified 12 HTL-epitopes binding five or more HLA-II alleles, covering the majority of HLA-II supertype alleles (Table 1). We also found that four of these promiscuous epitopes are located in the external region of the non-toxic domain of the Perfringolysin O (Figure 3F). Altogether, it suggests that this domain can be considered as a potential subunit-based vaccine.
Design of multi-epitope vaccine candidates
We identified 112714 nested epitopes from the whole proteome, comprising 112714 predicted HTL-epitopes containing 145854 predicted CTL-epitopes (Supplementary Table 5). Next, from the 2685 proteins containing nested epitopes, 266 proteins had at least two external transmembrane regions with at least one nested epitope. Then, the proteins annotated as “predicted proteins” or “uncertain proteins” were removed, resulting in 105 proteins containing 1884 nested epitopes. From this set, only nested epitopes that can bind to at least five HLA-II alleles were selected, obtaining 42. And it was possible to merge 29 of these 42 nested epitopes into 11 overlapped nested epitopes (Table 2), which represent the candidates for the design of the muti-epitope construct (Figure 1C). The conservation analysis showed that these candidate epitopes were highly conserved among the sequences of the different strains of C. perfringens (Table 2). Moreover, these epitopes are 100% conserved among the five toxinotypes (Supplementary Figure 2).
Table 2 Characteristics of the predicted epitopes selected for the construction of the multi-epitope protein.
These 11 overlapped nested epitopes were used to build the directed graph. By finding the allowed directed edges, epitopes “7” was discarded as it was disconnected from the rest of the graph. Also, epitope “2”, because there were no edges permitting the generation of a complete path (Figure 4C). From the subgraph containing the nine remaining overlapped nested epitopes, we obtained 21 different Hamiltonian paths, representing multi-epitope constructs (Supplementary Table 6). Using BLASTp, we obtained that only a 5 aa fragment of an epitope (DTNYI) matched with a human protein (ATP-dependent DNA helicase HFM1) with a 100% coverage and identity. Therefore, no construct was discarded.
Figure 4 Selection of epitopes for the design of a multi-epitope construct. (A) An example of the pyramidal order for 5 epitopes, showing how they should be concatenated into a new protein, to evaluate the presence of neoepitopes in all the connections in just a single prediction step. (B) Allowed connections (without neoepitopes) are represented in the directed graph. The Hamiltonian path (in red) exemplifies a solution containing all the nodes. (C) Graph representing the nested epitopes (nodes) and its allowed connections (edges), selected in this study for the construction of the multi-epitope construct. Epitopes discarded from the design are marked in red.
Structural analysis of the multi-epitope constructs
Physicochemical characterization
We evaluated the physicochemical properties of the constructs using ProtParam (Supplementary Table 7). Each one had 150 aa, a molecular weight of 16.83 kDa and a predicted pI of 4.61, indicating an acidic nature. The number of negatively and positively charged residues computed at pH 7 were 27 and 17, respectively. The aliphatic index (relative volume occupied by the aliphatic side chains) was 93, indicating a thermostable nature. The construct had a GRAVY value of -0.363.
Modelling the constructs
Models of each multi-epitope construct were prepared in AlphaFold2, and its mean pLDDT were computed. Values under 50 were obtained for all models (Supplementary Figure 3A), suggesting structural disorder (66). After modeling the 21 multi-epitope constructs, only MEP_6 and MEP_12 were selected, as they showed the highest mean pLDDT.
The structures of the multi-epitope (MEP) MEP_6 and MEP_12 had the highest mean pLDDT values, being the most reliable ones. Besides the similarity in their mean pLDDT, the distribution of per-residue pLDDT notably differed. MEP_6 had 94.7% of its residues with pLDDT under 50 and 5.3% between 50 and 70 (Figure 5A). In contrast, MEP_12 had 68% of residues with pLDDT under 50, 31.3% between 50 and 70, and 0.7% between 70 and 90 (Figure 5B).
Figure 5 Amino acid sequence, epitope sorting and AlphaFold2 3D model of the multi-epitope constructs MEP_6 (A) and MEP_12 (B). The confidence value (pLDDT) is categorized in 4 groups (orange, yellow, cyan, and blue), representing their percentage in pie charts.
In addition, the per-residue pLDDT seemed to be non-dependent on the epitope from which the residues belong. For instance, the residues of epitope “4” had pLDDTs under 50 in MEP_6, whereas values between 50 and 70 in MEP_12 (Figure 5).
Assessing the conformational convergence
The confidence of our best candidate models (MEP_6 and MEP_12) were improved by performing extensive MDs to refine them, and their conformational changes were evaluated. The RMSD and Rg over each trajectory were measured to assess their conformational convergence.
The RMSD of MEP_6 increased during the last 600 ns of MD, which is indicative of a conformation that is still changing (Figure 6A). The conformational instability was corroborated by an unstable Rg, which continuously decreased during all the trajectory (Figure 6B). Therefore, MEP_6 was discarded of further analysis as it did not reach conformational convergence.
Figure 6 Convergence analysis of the MD simulation. Ca-RMSD values and radius of gyration (Rg) of MEP_6 (A, B) and MEP_12 (C, D) during a simulation of 1.2 μs. Inset plots in (C, D) show the last 500 ns where the RMSD and Rg converged. (E) Coverage (%) of the five most populated clusters obtained from the RMDS-based clustering. (F) Structural alignment between the AlphaFold2 model (gray) and the centroid of the most populated cluster (in orange) of MEP_12.
In contrast, the RMSD and Rg of MEP_12 reached a steady state during the last 500 ns (Figures 6C, D) suggesting conformational convergence of the trajectory. It was verified by an RMSD-based clustering of frames of the last 500 ns, using 2.5 Å as threshold. More than 70% of the conformations adopted during this lapse were highly similar among them and were grouped in Cluster 1, confirming the convergence (see Figure 6E; Supplementary Table 8).
As the centroid of Cluster 1 represents the preferred conformation of MEP_12, it was retrieved and considered as the most probable average structure of MEP_12 in solution. Remarkably, although displaced due to the MD, the β-strands formed by the same residues are present both in the centroid and in the initial AphaFold2 model (Figure 6F).
Evaluating the structural quality
A two-steps refinement of the centroid was performed using ModRefiner and NAMD, obtaining a structure that remained close to the original conformation (Supplementary Figure 3B). The quality of the refined structure was assessed using the web servers MolProbity, ProSA and ERRAT. We observed that all the residues in the model were correctly oriented, as the Ramachandran plot indicated that 100% of residues were in allowed regions, not showing any outliers (Figure 7A). Furthermore, 96.6% of the residues were observed in favored regions. ProSA predicted a Z-score of -6.55 for our model, which is within the range of values observed in other proteins of similar length, obtained from NMR or X-ray crystallography (Figure 7B). An ERRAT overall score of 94.531 was obtained, indicating that this model has a good resolution. Notably, only windows centered at residues 96 and 127 had error values above 99% (Figure 7C). All the quality statistics were better in the refined structure than the original model (Supplementary Figure 4).
Figure 7 Quality assessment of the refined centroid of MEP_12 most populated cluster. (A) Ramachandran plot of MEP_12 centroid, indicating the percentage of residues in favored (light blue) and allowed (blue) regions. (B) Scatterplot of the Z-scores of MEP_12 centroid (black dot) and structures with experimental evidence obtained from NMR (blue) and X-ray crystallography (light blue). (C) ERRAT plot of MEP_12 centroid. Bars represent the error value (white: error < 95%, yellow: 95%, error < 99%, red: error > 99%) of a nine-residue sliding window. The overall quality factor indicates the percentage of protein residues with error values lower than 95%.
Analysis of MEP_12 trajectory
An ensemble approach was followed to assign the secondary structures of the aminoacidic residues. The centroids of the five most populated clusters were compared, as they are distinct and representative conformations of MEP_12 in solution. Then, sets of residues forming the same secondary structure in all the centroids were retrieved. We reasoned that if those sets are folded in the same way, even in different but representative conformations, there is a high chance that they might adopt that conformation when synthesized. Notably, comparing the five centroids, we found that 12 β-strands were formed by the same sets of residues (Figure 8A). The number of residues forming these β-strands represent 41.33% of the MEP_12. The other residues were mostly in loops in all the centroids evaluated. No consistent alpha helices were found.
Figure 8 Structural characteristics of MEP_12. (A) Alignment of the 3D structures of MEP_12 from all the centroids of the five most populated clusters. Cyan, purple, orange, green, and pink cartoons correspond to the centroids of clusters 1, 2, 3, 4 and 5 respectively. The twelve β-strands in the structure (left) are represented as arrows in the sequence (right). (B) Violin plots showing the distribution of Ca-RMSF values. The lowest Ca-RMSF distributions are colored in green. (C) Mean SASA by epitope. The error bars represent 95% confidence intervals. (D) Modeled structure of MEP_12, showing epitopes “3” and “6” in red and green, respectively.
By analyzing the Cα-RMSF, epitopes “3”, “8” and “10” were found to be the least flexible ones (Figure 8B). The rigidity of “3” can be explained because part of this epitope forms a parallel β-sheet with a portion of epitope “6” (Figure 8D). Next, the mean SASA over the steady state of the MD was computed, excluding epitope “4” as it was not stable over the MD (Supplementary Figure 5). Epitopes “3”, “10”, and “8” showed the lowest mean SASA values, in decreasing order (Figure 8C). Altogether, this indicates that epitope “3” is the most accessible among the rigid epitopes of MEP_12.
Evaluation of MEP_12 innate response potential
Blind docking of MEP_12 against TLR1/TLR2 resulted in four clusters containing 3.6% of the refined structures. The Haddock-scores of these clusters showed overlapping error bars (one standard deviation) (Figure 9A), not allowing to discern which cluster is the best.
Figure 9 Protein docking of MEP_12 against TLR1/TLR2 and TLR4/MD-2. Bar plots summarizing the dockings (A) MEP_12-TLR1/TLR2 and (C) MEP_12-TLR4/MD2. The number of structures and the mean Haddock-score by cluster are shown in orange and blue, respectively. Whiskers in the mean Haddock-score bars represent one standard deviation. Structural representation of the most favorable binding mode of the cluster selected, of dockings (B) MEP_12-TLR1/TLR2, and (D) MEP_12-TLR4/MD2; showing the binding energy computed in PRODIGY. MEP_12 is colored in cyan; TLR1 and TLR4 in green; and TLR2 and MD2 in orange. Glycosylated residues and attached glycans (cyan) are shown as sticks, and non-carbon atoms are colored following the CPK convention. The inset plots show a closeup of the residues involved in polar interactions (as cyan and green sticks). The hydrogen bonds and salt bridges are represented by blue and red lines, respectively, connecting the interacting atoms which are labeled indicating amino acid and position.
The TLR1/TLR2 crystal structure had missing residues in the N-termini of both TLRs. As the cluster “1” contained binding modes of MEP_12 interacting with TLR2 in a region that might be occupied by those missing residues, it was discarded (Supplementary Figures 6A, B). All the other clusters showed MEP_12 docked to similar regions of TLR1 (Supplementary Figure 6A). The best binding mode of the cluster “2” showed contacts with epitopes “3”, “4” and “6”. It showed four hydrogen bonds and one salt bridge that stabilized the interaction (Figure 9B). The best binding modes of the cluster “3” and “4” showed contacts were with epitopes “6” and “0”; and “0”, “1” and “9”, respectively (Supplementary Table 9).
Blind docking of MEP_12 against TLR4/MD-2 resulted in six clusters containing 5.2% of the refined structures. The Haddock-scores of the best four clusters had overlapping error bars (one standard deviation) (Figure 9C). From this set, only cluster “5” contained solutions in which MEP_12 interacted exclusively with the co-receptor MD-2, while the others showed MEP_12 interacting with any of the two chains of TLR4 (Supplementary Figure 6C). Therefore, the solutions of cluster “5” were discarded.
Noteworthy, unlike TLR1/TLR2, the TL4/MD-2 complex is symmetric (Supplementary Figure 6D). In that sense, we noticed that the binding modes of clusters “3” are the same but reflected along the symmetry axis (Supplementary Figure 6D). Thus, these two can be seen as the same cluster. The best binding mode of the cluster “3-6” showed contacts with epitopes “0”, “1”, “3” and “10”. It showed 3 hydrogen bonds and 2 salt bridges and had a binding energy of -9.9 kcal/mol (Figure 9D). The best binding mode of the cluster “1” showed contacts with epitopes “1”, “5”, “8 and “9” (Supplementary Table 9).
Simulating the immune response
The simulation of immunizing with the multi-epitope protein showed that the second and third doses generated significantly higher responses than the first one, as expected. HTL populations were higher in the second and third dose than in the first one, suggesting activation of the memory cells (Figure 10B). However, the CTL population was higher during the first dose, indicating an early immune response (Figure 10C). The B-cell subpopulations, including memory B-cells and Plasma B lymphocytes (PLB cells), showed considerable expansion after each dose reaching the highest peak at day 60 (Figure 10D). After the challenge, the response generated by B-cells was the highest in the simulation, indicating an appropriate production of antibodies (Figure 10E). Moreover, an early production of IgM was detected, which changed to IgG after the antigen administration (fourth response) (Figure 10F). Regarding the innate system, NK (natural killer) activity was found to be constant during the three doses and showed increased activity during the challenge (Figure 10G).
Figure 10 Results of the C-IMMSIM simulation for 350 days. (A) Schematic illustration of the vaccination trial, with three doses of the multi-epitope vaccine (green box) at days 3, 30, 60; and the challenge (red box) at day 111. Dynamics of (B) HTL and (C) CTL populations. Memory and not memory cells are represented with light-blue and green lines, respectively. (D) B cell populations, grouped by immunoglobulin isotype production. (E) Population of Plasma B lymphocytes producing IgM, IgG1 and IgG2. (F) Antigen concentration and relative antibodies responses. (G) Total population of NK cells. The first, second and third doses were inoculated.
Codon optimization of MEP_12 DNA sequence
The optimized sequence of the 450 nt cDNA vaccine construct had a Codon Adaptation Index (CAI) of 0.772, and a %GC of 48.9%. This sequence contains 111 restriction sites for 55 commercial enzymes (Supplementary Table 10).
Discussion
C. perfringens is one of the most common food-poisoning causing bacteria, causing a major impact on human health worldwide. It is also a reported biological warfare agent. These reasons make developing a vaccine an urgent matter. Most experimental studies have focused on C. perfringens toxins to find vaccines protective against gas gangrene. It has been shown that the alpha toxin protect against C. perfringens type A (67, 68). This toxin has been produced in endospores of recombinant Bacillus subtilis and tested as vaccine in mice, resulting in protection against gas gangrene (69). The β-toxin has also been evaluated as a possible vaccine against C. perfringens type C in piglets (70). However, none of them have been tested in humans. There is still no approved vaccine for humans nowadays (5).
While other studies had focused only on specific proteins of C. perfringens (71, 72), we consider that analyzing the whole proteome could reveal novel more immunogenic proteins. We followed three approaches to propose vaccine candidates: a whole-protein based vaccine, a protein-subunit based vaccine, and a multi-epitope protein vaccine. The first approach consist in identifying natural proteins of the pathogen enriched with both CTL and HTL epitopes to be used as recombinant vaccines. With the second approach, we further studied the sequence and structure of the main toxins, identifying regions exposed to the extra-cellular medium and rich in HTL epitopes that could generate immune responses against gas gangrene. Lastly, in the third approach, we identified the best CTL and HTL epitopes of the whole proteome and performed several immunoinformatics and structural analyses to assemble them into a multi-epitope construct, aiming to combat C. perfringens infection. The immunogenic proteins obtained by computational methods in these three approaches may represent good vaccine candidates, and they could be used as well for the development of immunodiagnostic tests (73, 74).
The immunoinformatic exploration of the whole proteome resulted in 429809 CTL- and 121450 HTL- predicted epitopes (Supplementary Table 3), which could be further used in both computational and experimental studies seeking a better understanding of the immunogenic characteristics of C. perfringens proteins, the developing of diagnostic tests, and the design of peptide-based vaccines and therapies. A strong positive correlation between the number of CTL- and HTL-epitopes predicted with the protein length was found, indicating that longer proteins contain a higher number of epitopes. This has been previously reported by an independent study in viral proteins, showing that protein length is positively correlated with its number of CD8+ T-cell epitopes within (75), supporting our observation. Another observation is that the 9-mer CTL-epitopes were more promiscuous, binding more HLA-I supertype alleles (Supplementary Figure 1B). These two observations might be correlated with the fact that most of the experimental data, and consequently most software training data, is based on 9-mer CTL-epitopes. A similar pattern was observed for 15 aa HTL-epitopes. These observations match with was is biologically expected, as they are the canonical epitope sizes which have been commonly observed in MHC-epitope experimental data (29, 76, 77).
The Collagenase A (UniProt ID: P43153), Exo-alpha-sialidase (UniProt ID: Q8XMY5), alpha n-acetylglucosaminidase (UniProt ID: Q8XM24) and hyaluronoglucosaminidase (UniProt ID: P26831) were identified as the top 4 proteins with more CTL- and HTL-epitopes. The Collagenase A, encoded by the gene colA, is an extracellular proteolytic enzyme that degrades extracellular matrix and plays a role in the pathogenesis of gangrene (78). This enzyme has hemorrhagic and dermonecrotic activities, and intravenous inoculation of Collagenase A has shown to be lethal for mice (79). However, little is known about its role as immunogen. The Exo-alpha-sialidase, encoded by the gene nanJ, is the largest of the three sialidases produced by C. perfringens. It is involved in the intestinal virulence by increasing the binding affinity of the sialidases to their targets, enhancing pathogen adherence to intestinal cells. Sialidase inhibitors, such as Siastatin B or N-acetyl-2,3-dehydro-2-deoxyneuraminic acid (NADNA), have been tested in vitro and proposed as possible therapeutics against intestinal infection of C. perfringens; however, in vivo validation is still needed (80, 81). The hyaluronoglucosaminidase, encoded by the gene nagH, is a carbohydrate-active enzyme that acts on the connective tissue during the gas gangrene (82, 83). The alpha n-acetylglucosaminidase is an enzyme with strong preferences for carbohydrate motifs, found on the class III mucins within the gastric mucosa (81). Although there is no study testing alpha-acetylglucosaminidase, vaccination with beta-acetylglucosaminidase has shown to generate protection against necrotic enteritis in chickens (84). Similarly, a peptide-based vaccine comprising of several epitopes from these mucolytic enzymes produced reduction of lesions caused by necrotic enteritis in chicks. Altogether, these studies suggest that these four enzymes can be used as potential protein-based vaccines (85). The use of epitope-enriched proteins has been previously suggested as a strategy for the rational selection of immunogens, considering B-cell epitopes (86) and T-cell epitopes (73). Proteins with high epitope density are expected to maximize the probability of immune cells presentation and activation. Protein-based vaccines have been widely studied and used, with some advantages like having relatively low production costs, and not causing severe side effects unlike attenuated vaccines (87, 88).
Next, we studied the six most studied C. perfringens type A toxins in more detail. All of them, except for the beta2 toxin, showed at least 10 HTL-epitopes by HLA-II supertype allele (Figure 2C). These observations suggest that toxins are generally good candidates for protein-based vaccines, as shown in previous experimental studies (89–93). Our results support the idea that toxins could generate an appropriate humoral response to protect, mainly, against gangrene or histotoxic damage before infection (67, 94). Noteworthy, we show the relevance of not only predicting promiscuous epitopes but studying their location within the proteins as well. For instance, we found that the most promiscuous epitopes of enterotoxin D are in SH3B domains, which are correlated with promoting pathogen survival and invasion by binding to host receptors. This indicates that not all the extension, but specific regions of the protein are rich in epitopes. The use of non-toxic domains of toxins has shown to be advantageous for vaccine development as they present similar immunogenicity than the entire toxins without undesired toxicity (95–97). For example, the use of the non-toxic domain HC50 of the Botulinum neurotoxin type A induces a strong anti-HC50 IgG antibody response, neutralizing the circulating neurotoxin in mice (98). And, immunization with the non-toxic fragment of the C-domain of phospholipase C produces antibodies against this toxin, providing protection against gas gangrene in mice (5). Similarly, our analysis suggests that the non-toxic domain of Perfringolysin O is a promissory candidate for experimental testing as a protein-subunit based vaccine.
Numerous studies have used bioinformatics software to propose epitope-based vaccines against C. perfringens. A previous study predicted B-cell epitopes in the epsilon toxin of C. perfringens types B and D. However, they are not the main toxinotypes of C. perfringens affecting human health (99) and the epsilon toxin is not present in C. perfringens type A. Another study predicted 15 unique epitopes in the toxin NetF using mouse rather than human MHC alleles. Although the authors suggest that NetF could be a good vaccine candidate and the epitopes found can be used in multi-epitope vaccines, this needs to be evaluated in humans (100). Furthermore, the NetF protein has been associated with gastroenteritis and enterocolitis in canine and foals (100), but its role in humans is not well studied. Another study has predicted T- and B-cell epitopes in the fructose 1,6-bisphosphate aldolase (101). Nonetheless, they did not design a multi-epitope construct, which might be more effective than single-epitope vaccines (18, 102). Other bioinformatic studies of C. perfringens aimed to find only candidate epitopes in toxins or few proteins (71, 103). However, exploring the whole proteome, as done in the present study, allows the computational identification of a broader set of potential epitopes that may trigger better immune responses, as seen in other pathogens like SARS-CoV-2 (104).
A rational workflow was elaborated to design a multi-epitope vaccine, consisting of (1) using different immunoinformatics tools to predict epitopes from the whole proteome of C. perfringens (2), considering the epitope location in the structure of their native proteins, and (3) merging and assembling nested epitopes in potential constructs, and (4) evaluating the structure and dynamics of the constructs by MD. Of the programs available for T-cell epitope prediction, we opted to use NetMHCpan and MHCFlurry to predict CTL-epitopes, and NetMHCIIpan for HTL-epitopes, as they have shown the best performance against competitors (27, 28). Proteasome cleavage and TAP transport are also important processes of the intracellular presentation pathway. Predictors of proteasome developed, being the most popular netChop and ProteaSMM. However, benchmarking studies have shown that these predictors still need to improve their sensitivity and specificity (105). Moreover, it has been reported that current cleavage predictions based on in vitro data do not correlate with in vivo data (106). Therefore, we decided not to discard constructs based on these predictors. TAP transport predictors face the same limitations, with very few predictors available and the lack of unbiased benchmarking studies (107).
Targeting HLA supertype alleles maximize the potential usability of our multiepitope in different populations. This choice was a tradeoff between wider usability and higher local specificity, and we decided to prioritize the first one as currently C. perfringens infection is a widely spread pathogen without universal vaccine. Higher local specificity would be more relevant in other scenarios, such as if broad-range vaccines become available, or if the disease becomes endemic, or if there is a high incidence of cases in a specific geographic location, among others. This will imply a change in the HLA allele selection strategy, giving more importance to certain HLA alleles abundant in the specific geographic areas affected by the disease.
Keeping the synthetic protein small is important to reduce synthesis and production costs (i.e. facilitate its purification in inclusion bodies), as well as to prevent toxicity in the organism used for production (98, 108). Using nested epitopes allows to maximize the number of epitopes in the multi-epitope construct without increasing the construct length. Recent studies have used nested epitopes to construct multi-epitope proteins, but experimental testing is still needed to verify the advantage provided by this strategy (104, 109). From the set of 112714 nested epitopes, several filters were applied to select the best candidates. One of these filters is to prioritize epitope promiscuity, which has been associated with contributing to epitope immunodominance, as promiscuous epitopes are recognized by multiple HLA alleles (110). The use of promiscuous epitopes allows to cover a larger number of HLA-alleles (i.e. a larger proportion of the target population) without increasing the number of epitopes in the vaccine construct (111). We also filtered by epitope conservation among all the reported protein variants of C. perfringens. This may result in covering a broader range of current, and potentially future, pathogen variants. Thus, a high epitope conservation could lead to a better protection as the immune response tends to focus on conserved epitopes when individuals are exposed to different strains (112, 113). In our construct, the conservation analysis of the overlapped nested epitopes showed 100% conservation. Thus, even though C. perfringens type A is the most common cause of gas gangrene, our construct may confer extended protection against other toxinotypes as well. Our construct of 150 aa is made of 9 overlapped HTL-CTL epitopes, comprising 24 HTL-epitopes containing 34 CTL-epitopes. This suggests a better cellular immune response than previous multiepitope constructs, which were longer and had less epitopes in its sequence (114). Whilst the study of Aldakheel et al. (114) attempted to target all HLA-I and -II alleles, we opted to focus on the HLA supertype alleles. This allowed us to need less epitopes in our design to match all the target alleles. Additionally, having less HLAs to target allows epitopes with better immunogenicity to be selected from the prediction. Thus, we successfully covered all the HLA supertype alleles, while the previous study just covered 10 of the 12 HLA-I and 5 of the 8 HLA-II supertypes alleles. Additionally, our design is about 1/3 in length (150 versus 415 aa), due to our strategy of using nested HTL-CTL epitopes in extracellularly exposed regions.
The 9 overlapped nested epitopes in our construct belong to the FTsX domain-containing protein (UniProt ID: Q8XM39), the cardiolipin synthase (UniProt ID: P0C2E2), the Polysaccharide synthase 2 domain-containing protein (UniProt ID: Q8XN75), the probable hemolysin-related protein (UniProt ID: Q8XPD3), the TraG-D C domain-containing protein (UniProt ID: Q93M96), the spore germination protein KA (UniProt ID: Q8XMP0) and CPE0011 (UniProt ID: Q8XPF2). Our prediction indicates that all of them are membrane proteins with a high content of HTL- and CTL-epitopes. Moreover, these proteins present functions related with the cell membrane structure (Cardiolipin synthase), survival (Spore germination protein KA) and host colonization (Hemolysin-related protein) (115–117). External regions of transmembrane proteins involved in the infective process are frequently considered vaccine targets, as antibodies can efficiently neutralize them (118).
Assembling epitopes to make a new protein can be performed with or without linkers, or adding adjuvant sequences (119). Multiple linker and adjuvant sequences have been reported in the literature (120–122). Linkers are used to reduce the occurrence of neoepitopes (123), with the downside of increasing the construct length and the cost of protein synthesis. There are very few software to optimize the use of linkers (112, 124, 125), and this approach is frequently used because finding an appropriate epitope sorting without linkers is time-consuming and computationally demanding. Here, we used a novel algorithm and software based on graph theory (EpiSorter), assembling all the epitopes without the need of linkers and without unwanted neoepitopes shared with human proteins, avoiding autoimmune or tolerance responses. Thus, we believe that using linkers can be avoided and, instead, we recommend prioritizing the number and promiscuity of the epitopes selected.
Several immunoinformatics studies have reported structural models of their multi-epitope constructs using only one or two software, complemented with refinement steps (126–128). Nonetheless, modeling novel proteins is a difficult task as they usually do not have close homologues. The situation becomes even more complex if the novel protein to model consists of highly flexible linear epitopes. To tackle this problem, we employed the state-of-the-art protein-structure prediction software AlphaFold2 (47). Nowadays, this is the first time this software has been used to predict the 3D structure of multi-epitope constructs.
The high proportion of residues of the multi-epitope structures with low pLDDT suggests that the models obtained by AlphaFold were not completely reliable. However, it also says that the multi-epitope protein is highly disordered and flexible, characteristics that may favor its binding to immune proteins. Disordered regions in proteins are characterized by a lack of a stable tertiary structure and high flexibility (129). Moreover, it has been pinpointed that flexibility in protein antigens positively affects their binding affinity (130).
The multi-epitope construct selected did not present Ramachandran outliers and shows an ERRAT score comparable to scores of high-resolution structures (131). Moreover, these quality statistics are better than the ones obtained in various similar studies designing multi-epitope vaccines (119, 127, 132–135). This reaffirms the quality and consistency of our structural approach, although better modeling software is still needed for multi-epitope proteins.
Docking was performed against TLR1/TLR2 and TLR4/MD2, as they have been widely reported to be important in the innate defense against C. perfringens infection in chickens and mice (136), although little is known in humans. The lipopeptide and lipopolysaccharide binding sites of TLR1/TLR2 and TLR4/MD-2, respectively, have been structurally characterized (137, 138). As there is neither prior experimental information about the binding sites for MEPs, nor which epitope of our novel MEP might interact with the TLRs, we considered blind docking the most appropriate approach. The docking simulation showed binding between our MEP_12 and the TLR1/TLR2, suggesting an innate immune response in addition to the adaptive immune response generated by the nested epitopes. The docking analyses also showed that epitopes “3” and “10” interact with TLR1/TLR2. These epitopes are relatively rigid, which may result in a more favorable binding as shown in previous studies. Epitopes tend to be more rigid than the rest of the protein (31). And, from a thermodynamic point of view, rigid surfaces have less entropic penalty when interacting with other proteins, resulting in tighter bindings.
The binding region of MEP_12 in TLR4/MD-2 is similar to the one observed in a previous study, where a SARS-CoV-2 candidate vaccine was docked against this receptor (119), even though different protocols were followed. This suggest that this region of TLR4/MD-2 is where antigenic proteins bind. However, experimental studies are needed to validate this observation. Noteworthy, the MEP_12 is smaller and establishes fewer interactions with TLR4/MD-2 than the SARS-CoV-2 candidate vaccine but achieves a similar binding energy, differing in just in 0.4 kcal/mol. This indicates that our design is a more efficient TLR4/MD-2 binder and correlates with the finding that MEP_12 is flexible and disordered. These are two desired characteristics, as they make antigens more efficient at binding immune proteins (33).
Immunization with MEP_12 was simulated, showing that our vaccine candidate can elicit immune responses to clear the antigen on secondary exposure. The challenge with C. perfringens proteins after three vaccine injections induced higher levels of IgG than IgM. IgM is the principal isotype in the first response, while IgG is predominant in secondary responses, representing specific pathogen recognition (139). Also, the increased production of immunoglobulins by plasma B cells indicated that memory of C. perfringens proteins is present in the immune system (140). Finally, a gene encoding MEP_12 was designed optimizing its sequence for expression and production in E. coli, as this organism has been widely used to produce recombinant vaccines (141–143). Additionally, the design can be adapted to other organisms following the methodology we described.
In summary, in the current study, we have performed a thorough immunoinformatic exploration of the whole proteome to generate vaccine candidates against C. perfringens. Three approaches were followed to identify (1) the most immunogenic proteins (2), immunogenic non-toxin domains of toxins, and (3) the design a novel protein with the best HTL-CTL nested epitopes, expected to trigger both adaptive and cellular immune responses. These resulted in promising candidates for further experimental in vitro and in vivo studies. These candidates may help in the prevention of necrotic enteritis, as well as other human diseases caused by C. perfringens.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.
Author contributions
Conceptualization, LFS and DR. Methodology, LFS, DR, GJ-A, and DR. Software, LFS, DR, and GJ-A. Validation, LS and DR. Formal Analysis, LFS and DR. Investigation, LFS, AR, GJ-A, and DR. Resources, DR. Data curation, DR. Writing – original draft preparation, LFS, DR, GJ-A, and DR. Writing-review and editing, LFS, AR, and DR. Visualization, LFS and GJ-A. Supervision, DR. Project administration, DR, RLL, YS, and CO-R. Funding acquisition, DR, RLL, YS, and CO-R. All authors contributed to the article and approved the submitted version.
Funding
The publication of this work was funded by the grant N°2264962 SNIP N°317435 “Creación del Centro de Promoción de la Investigación y Transferencia Tecnológica” of the University “Toribio Rodriguez de Mendoza de Amazonas”.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2022.942907/full#supplementary-material
Supplementary Figure 1 | Density and promiscuity of the CTL- and HTL-epitopes. (A) Epitope density in proteins by epitope length (CTL-epitopes: 8, 9 and 10 aa, HTL-epitopes: 15 aa). (B) The bar plot represents in the X-axis the number of epitopes identified, for each epitope length (Y-axis). The color indicates how many HLA supertype alleles each epitope binds. In the bar plots (C) (for CTL-epitopes) and D (for HTL-epitopes), the Y-axis represents the number of epitopes that can bind a certain number of HLA supertype alleles (X-axis). Panels (E, F) show the distribution of the number and density, respectively, of HTL-epitopes in the proteins of each of the levels of evidence indicated in the X-axis.
Supplementary Figure 2 | Conservation of the nested candidate epitopes across C. perfringens toxinotypes. Each subfigure shows the alignment of the nested epitope, the epitope ID and the protein name.
Supplementary Figure 3 | (A) Mean pLDDT of the 22 multi-epitope (MEP) candidates. (B) Structural alignment between the centroid of the most populated cluster of MEP_12, before (orange) and after (cyan) refinement.
Supplementary Figure 4 | Quality assessment of the centroid of the most populated cluster of MEP_12 before refinement. (A) Ramachandran plot for the general case. The percentage of residues in preferred (light blue) and allowed (blue) regions are shown. (B) Scatterplot of the Z-scores of the centroid (black dot) and protein structures with experimental evidence (NMR: blue, X-ray crystallography: light blue). (C) ERRAT plot of the centroid. Bars represent the error value (white: error < 95%, yellow: 95% error < 99%) of a nine-residue sliding window. The overall quality factor indicates the percentage of protein residues with error values lower than 95%.
Supplementary Figure 5 | SASA values per epitope. Values over the last 500 ns of convergent simulation are shown for epitopes “0” to “10” (excluding “4”) in panels A to I, respectively. The SASA of all epitopes shown but “5” and “9” achieved convergence. Epitope “8” only achieved convergence during the last 400 ns (enclosed in a red box, close-up in inset plot). So, only this region was considered for its mean SASA computations.
Supplementary Figure 6 | Comparison of all plausible clusters of docking of MEP_12 against TLR1/TLR2 and TLR4/MD-2. The best binding modes of clusters having Haddock-scores with overlapping error bars (standard deviations) are shown for the dockings (A) MEP_12-TLR1/TLR2, and (C) MEP_12-TLR4/MD2. TLR1 and TLR2 chains are colored in green and orange, respectively. Attached glycans are shown as sticks in cyan. In (A), the best binding modes of MEP_12 from clusters “1”, “2”, “3”, and “4” are colored in red, cyan, gray, and blue, respectively. In (C), the best binding modes of MEP_12 from clusters “1”, “3”, “5”, and “6” are colored in gray, blue, red, and cyan, respectively. (B) Enlarged view of the best binding mode of cluster “1” (red cartoon with transparent surface) in the docking MEP_12-TLR1/TLR2. The N-terminal residues missing in TLR1/TLR2 structure (in gray) were completed by structural alignment to a TLR2 model from the AlphaFold Protein 1300 Database. These interact with MEP_12 cluster “1”. (D) Best binding modes from cluster “3” and “6” from the docking MEP_12-TLR4/MD2. The black line separating the two sides of TLR4/MD-2 represents a symmetry axis. The binding modes of left side become equivalent to the right side after a 180° right-handed rotation, as shown in (E).
References
1. Hassan KA, Elbourne LDH, Tetu SG, Melville SB, Rood JI, Paulsen IT. Genomic analyses of clostridium perfringens isolates from five toxinotypes. Res Microbiol (2015) 166(4):255–63. doi: doi: 10.1016/j.resmic.2014.10.003
2. Grass JE, Gould LH, Mahon BE. Epidemiology of foodborne disease outbreaks caused by clostridium perfringens, United States, 1998–2010. Foodborne Pathog Dis (2013) 10(2):131–6. doi: 10.1089/fpd.2012.1316
3. Janik E, Ceremuga M, Saluk-Bijak J, Bijak M. Biological toxins as the potential tools for bioterrorism. Int J Mol Sci (2019) 20(5):E1181. doi: 10.3390/ijms20051181
4. Buboltz JB, Murphy-Lavoie HM. Gas gangrene. Treasure Island (FL: StatPearls Publishing (2022). Available at: http://www.ncbi.nlm.nih.gov/books/NBK537030/.
5. Titball RW. Clostridium perfringens vaccines. Vaccine (2009) 27 Suppl 4:D44–47. doi: 10.1016/j.vaccine.2009.07.047
6. Barras V, Greub G. History of biological warfare and bioterrorism. Clin Microbiol Infect Off Publ Eur Soc Clin Microbiol Infect Dis (2014) 20(6):497–502. doi: 10.1111/1469-0691.12706
7. Sim K, Shaw AG, Randell P, Cox MJ, McClure ZE, Li MS, et al. Dysbiosis anticipating necrotizing enterocolitis in very premature infants. Clin Infect Dis Off Publ Infect Dis Soc Am (2015) 60(3):389–97. doi: 10.1093/cid/ciu822
8. Heida FH, van Zoonen AGJF, Hulscher JBF, te Kiefte BJC, Wessels R, Kooi EMW, et al. A necrotizing enterocolitis-associated gut microbiota is present in the meconium: Results of a prospective study. Clin Infect Dis (2016) 62(7):863–70. doi: 10.1093/cid/ciw016
9. Uzal FA, Freedman JC, Shrestha A, Theoret JR, Garcia J, Awad MM, et al. Towards an understanding of the role of clostridium perfringens toxins in human and animal disease. Future Microbiol (2014) 9(3):361–77. doi: 10.2217/fmb.13.168
10. Navarro MA, McClane BA, Uzal FA. Mechanisms of action and cell death associated with clostridium perfringens toxins. Toxins (2018) 10(5):212. doi: 10.3390/toxins10050212
11. Kammerl IE, Meiners S. Proteasome function shapes innate and adaptive immune responses. Am J Physiol Lung Cell Mol Physiol (2016) 311(2):L328–336. doi: 10.1152/ajplung.00156.2016
12. Yao Y, Yang H, Shi L, Liu S, Li C, Chen J, et al. HLA class II genes HLA-DRB1, HLA-DPB1, and HLA-DQB1 are associated with the antibody response to inactivated Japanese encephalitis vaccine. Front Immunol (2019) 10. doi: 10.3389/fimmu.2019.00428
13. Moyle PM, Toth I. Modern subunit vaccines: Development, components, and research opportunities. ChemMedChem (2013) 8(3):360–76. doi: 10.1002/cmdc.201200487
14. Uzal FA, Wong JP, Kelly WR, Priest J. Antibody response in goats vaccinated with liposome-adjuvanted clostridium perfringens type d epsilon toxoid. Vet Res Commun (1999) 23(3):143–50. doi: 10.1023/A:1006206216220
15. Tian JH, Patel N, Haupt R, Zhou H, Weston S, Hammond H, et al. SARS-CoV-2 spike glycoprotein vaccine candidate NVX-CoV2373 immunogenicity in baboons and protection in mice. Nat Commun (2021) 12(1):372. doi: 10.1038/s41467-020-20653-8
16. Heath PT, Galiza EP, Baxter DN, Boffito M, Browne D, Burns F, et al. Safety and efficacy of NVX-CoV2373 covid-19 vaccine. N Engl J Med (2021) 385(13):1172–83. doi: 10.1056/NEJMoa2107659
17. Koff WC, Schenkelberg T. The future of vaccine development. Vaccine (2020) 38(28):4485–6. doi: 10.1016/j.vaccine.2019.07.101
18. Zhang L. Multi-epitope vaccines: a promising strategy against tumors and viral infections. Cell Mol Immunol (2018) 15(2):182–4. doi: 10.1038/cmi.2017.92
19. Lennerz V, Gross S, Gallerani E, Sessa C, Mach N, Boehm S, et al. Immunologic response to the survivin-derived multi-epitope vaccine EMD640744 in patients with advanced solid tumors. Cancer Immunol Immunother (2014) 63(4):381–94. doi: 10.1007/s00262-013-1516-5
20. Doehn C, Esser N, Pauels HG, Kießig ST, Stelljes M, Grossmann A, et al. Mode-of-Action, efficacy, and safety of a homologous multi-epitope vaccine in a murine model for adjuvant treatment of renal cell carcinoma. Eur Urol (2009) 56(1):123–33. doi: 10.1016/j.eururo.2008.05.034
21. Romeli S, Hassan SS, Yap WB. Multi-epitope peptide-based and vaccinia-based universal influenza vaccine candidates subjected to clinical trials. Malays J Med Sci MJMS (2020) 27(2):10–20. doi: 10.21315/mjms2020.27.2.2
22. Majidiani H, Dalimi A, Ghaffarifar F, Pirestani M. Multi-epitope vaccine expressed in leishmania tarentolae confers protective immunity to toxoplasma gondii in BALB/c mice. Microb Pathog (2021) 155:104925. doi: 10.1016/j.micpath.2021.104925
23. Romano P, Giugno R, Pulvirenti A. Tools and collaborative environments for bioinformatics research. Brief Bioinform (2011) 12(6):549–61. doi: 10.1093/bib/bbr055
24. Fleri W, Paul S, Dhanda SK, Mahajan S, Xu X, Peters B, et al. The immune epitope database and analysis resource in epitope discovery and synthetic vaccine design. Front Immunol (2017) 8:278. doi: 10.3389/fimmu.2017.00278
25. Dhanda SK, Mahajan S, Paul S, Yan Z, Kim H, Jespersen MC, et al. IEDB-AR: immune epitope database-analysis resource in 2019. Nucleic Acids Res (2019) 47(W1):W502–6. doi: 10.1093/nar/gkz452
26. Reynisson B, Barra C, Kaabinejadian S, Hildebrand WH, Peters B, Nielsen M. Improved prediction of MHC II antigen presentation through integration and motif deconvolution of mass spectrometry MHC eluted ligand data. J Proteome Res (2020) 19(6):2304–15. doi: 10.1021/acs.jproteome.9b00874
27. Reynisson B, Alvarez B, Paul S, Peters B, Nielsen M. NetMHCpan-4.1 and NetMHCIIpan-4.0: Improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data. Nucleic Acids Res (2020) 48(1):449–54. doi: 10.1093/nar/gkaa379
28. O’Donnell TJ, Rubinsteyn A, Laserson U. MHCflurry 2.0: Improved pan-allele prediction of MHC class I-presented peptides by incorporating antigen processing. Cell Syst (2020) 11(1):42–48.e7. doi: 10.1016/j.cels.2020.06.010
29. Zhao W, Sher X. Systematically benchmarking peptide-MHC binding predictors: From synthetic to naturally processed epitopes. PloS Comput Biol (2018) 14(11):e1006457. doi: 10.1371/journal.pcbi.1006457
30. Soto LF, Requena D, Bass JIF. Epitope-evaluator: an interactive web application to study predicted T-cell epitopes. PLoS One (2022)17(8):e0273577. doi: 10.1371/journal.pone.0273577
31. Kim DG, Choi Y, Kim HS. Epitopes of protein binders are related to the structural flexibility of a target protein surface. J Chem Inf Model (2021) 61(4):2099–107. doi: 10.1021/acs.jcim.0c01397
32. Alaofi AL. Probing the flexibility of zika virus envelope protein DIII epitopes using molecular dynamics simulations. Mol Simul (2020) 46(7):541–7. doi: 10.1080/08927022.2020.1738424
33. MacRaild CA, Richards JS, Anders RF, Norton RS. Antibody recognition of disordered antigens. Struct Lond Engl (2016) 24(1):148–57. 1993. doi: 10.1016/j.str.2015.10.028
34. Yuan X, Qu Z, Wu X, Wang Y, Liu L, Wei F, et al. Molecular modeling and epitopes mapping of human adenovirus type 3 hexon protein. Vaccine (2009) 27(37):5103–10. doi: 10.1016/j.vaccine.2009.06.041
35. Verherstraeten S, Goossens E, Valgaeren B, Pardon B, Timbermont L, Haesebrouck F, et al. Perfringolysin O: The underrated clostridium perfringens toxin? Toxins (2015) 7(5):1702–21. doi: 10.3390/toxins7051702
36. Goossens E, Verherstraeten S, Valgaeren BR, Pardon B, Timbermont L, Schauvliege S, et al. Toxin-neutralizing antibodies protect against clostridium perfringens-induced necrosis in an intestinal loop model for bovine necrohemorrhagic enteritis. BMC Vet Res (2016) 12(1):101. doi: 10.1186/s12917-016-0730-8
37. Gonzalez-Galarza FF, McCabe A, Melo dos Santos EJ, Jones AR, Middleton D. A snapshot of human leukocyte antigen (HLA) diversity using data from the allele frequency net database. Hum Immunol (2021) 82(7):496–504. doi: 10.1016/j.humimm.2020.10.004
38. Requena D, Médico A, Chacón RD, Ramírez M, Marín-Sánchez O. Identification of novel candidate epitopes on SARS-CoV-2 proteins for South America: A review of HLA frequencies by country. Front Immunol (2020) 11:2008. doi: 10.3389/fimmu.2020.02008
39. Sidney J, Peters B, Frahm N, Brander C, Sette A. HLA class I supertypes: A revised and updated classification. BMC Immunol (2008) 9:1. doi: 10.1186/1471-2172-9-1
40. Greenbaum J, Sidney J, Chung J, Brander C, Peters B, Sette A. Functional classification of class II human leukocyte antigen (HLA) molecules reveals seven different supertypes and a surprising degree of repertoire sharing across supertypes. Immunogenetics (2011) 63(6):325–35. doi: 10.1007/s00251-011-0513-0
41. Berglund L, Andrade J, Odeberg J, Uhlén M. The epitope space of the human proteome. Protein Sci Publ Protein Soc (2008) 17(4):606–13. doi: 10.1110/ps.073347208
42. Sonnhammer EL, von Heijne G, Krogh A. A hidden Markov model for predicting transmembrane helices in protein sequences. Proc Int Conf Intell Syst Mol Biol (1998) 6:175–82.
43. Krogh A, Larsson B, von Heijne G, Sonnhammer ELL. Predicting transmembrane protein topology with a hidden markov model: Application to complete genomes11Edited by f. Cohen. J Mol Biol (2001) 305(3):567–80. doi: 10.1006/jmbi.2000.4315
44. Finlay BB, McFadden G. Anti-immunology: Evasion of the host immune system by bacterial and viral pathogens. Cell (2006) 124(4):767–82. doi: 10.1016/j.cell.2006.01.034
45. Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, et al. Clustal W and clustal X version 2.0. Bioinforma Oxf Engl (2007) 23(21):2947–8. doi: 10.1093/bioinformatics/btm404
46. Wilkins MR, Gasteiger E, Bairoch A, Sanchez JC, Williams KL, Appel RD, et al. Protein identification and analysis tools in the ExPASy server. Methods Mol Biol Clifton NJ (1999) 112:531–52. doi: 10.1385/1-59259-584-7:531
47. Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, et al. Highly accurate protein structure prediction with AlphaFold. Nature (2021) 596(7873):583–9. doi: 10.1038/s41586-021-03819-2
48. Dolinsky TJ, Czodrowski P, Li H, Nielsen JE, Jensen JH, Klebe G, et al. PDB2PQR: Expanding and upgrading automated preparation of biomolecular structures for molecular simulations. Nucleic Acids Res (2007) 35(suppl_2):W522–5. doi: 10.1093/nar/gkm276
49. Phillips JC, Hardy DJ, Maia JDC, Stone JE, Ribeiro JV, Bernardi RC, et al. Scalable molecular dynamics on CPU and GPU architectures with NAMD. J Chem Phys (2020) 153(4):044130. doi: 10.1063/5.0014475
50. Huang J, Rauscher S, Nawrocki G, Ran T, Feig M, de Groot BL, et al. CHARMM36m: An improved force field for folded and intrinsically disordered proteins. Nat Methods (2017) 14(1):71–3. doi: 10.1038/nmeth.4067
51. Stone JE, Phillips JC, Freddolino PL, Hardy DJ, Trabuco LG, Schulten K. Accelerating molecular modeling applications with graphics processors. J Comput Chem (2007) 28(16):2618–40. doi: 10.1002/jcc.20829
52. Hadden JA, Perilla JR. Molecular dynamics simulations of protein-drug complexes: A computational protocol for investigating the interactions of small-molecule therapeutics with biological targets and biosensors. Methods Mol Biol Clifton NJ (2018) 1762:245–70. doi: 10.1007/978-1-4939-7756-7_13
53. Seeber M, Felline A, Raimondi F, Muff S, Friedman R, Rao F, et al. Wordom: A user-friendly program for the analysis of molecular structures, trajectories, and free energy surfaces. J Comput Chem (2011) 32(6):1183. doi: 10.1002/jcc.21688
54. Daura X, Gademann K, Jaun B, Seebach D, van Gunsteren WF, Mark AE. Peptide folding: When simulation meets experiment. Angew Chem Int Ed (1999) 38(1–2):236–40. doi: 10.1002/(SICI)1521-3773(19990115)38:1/2<236::AID-ANIE236>3.0.CO;2-M
55. Xu D, Zhang Y. Improving the physical realism and structural accuracy of protein models by a two-step atomic-level energy minimization. Biophys J (2011) 101(10):2525–34. doi: 10.1016/j.bpj.2011.10.024
56. Wiederstein M, Sippl MJ. ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins. Nucleic Acids Res (2007) 35:W407–10. doi: 10.1093/nar/gkm290
57. Colovos C, Yeates TO. Verification of protein structures: Patterns of nonbonded atomic interactions. Protein Sci Publ Protein Soc (1993) 2(9):1511–9. doi: 10.1002/pro.5560020916
58. Schrödinger L, DeLano WM. PyMOL [Internet]. (2020). Available from: http://www.pymol.org/pymol
59. Molecular modelling: Principles and applications. Available at: https://www.pearson.com/content/one-dot-com/one-dot-com/us/en/higher-education/program.html.
60. Dominguez C, Boelens R, Bonvin AMJJ. HADDOCK: a protein–protein docking approach based on biochemical or biophysical information. J Am Chem Soc (2003) 125(7):1731–7. doi: 10.1021/ja026939x
61. Eastman P, Friedrichs MS, Chodera JD, Radmer RJ, Bruns CM, Ku JP, et al. OpenMM 4: A reusable, extensible, hardware independent library for high performance molecular simulation. J Chem Theory Comput (2013) 9(1):461–9. doi: 10.1021/ct300857j
62. Weber ANR, Morse MA, Gay NJ. Four n-linked glycosylation sites in human toll-like receptor 2 cooperate to direct efficient biosynthesis and secretion *. J Biol Chem (2004) 279(33):34589–94. doi: 10.1074/jbc.M403830200
63. Laskowski RA, Jabłońska J, Pravda L, Vařeková RS, Thornton JM. PDBsum: Structural summaries of PDB entries. Protein Sci Publ Protein Soc (2018) 27(1):129–34. doi: 10.1002/pro.3289
64. Xue LC, Rodrigues JP, Kastritis PL, Bonvin AM, Vangone A. PRODIGY: a web server for predicting the binding affinity of protein–protein complexes. Bioinformatics (2016) 32(23):3676–8. doi: 10.1093/bioinformatics/btw514
65. Castiglione F, Mantile F, De Berardinis P, Prisco A. How the interval between prime and boost injection affects the immune response in a computational model of the immune system. Comput Math Methods Med (2012) 2012:842329. doi: 10.1155/2012/842329
66. Tunyasuvunakool K, Adler J, Wu Z, Green T, Zielinski M, Žídek A, et al. Highly accurate protein structure prediction for the human proteome. Nature (2021) 596(7873):590–6. doi: 10.1038/s41586-021-03828-1
67. Williamson ED, Titball RW. A genetically engineered vaccine against the alpha-toxin of clostridium perfringens protects mice against experimental gas gangrene. Vaccine (1993) 11(12):1253–8. doi: 10.1016/0264-410X(93)90051-X
68. Nagahama M. Vaccines against clostridium perfringens alpha-toxin. Curr Pharm Biotechnol (2013) 14(10):913–7. doi: 10.2174/1389201014666131226124348
69. Hoang TH, Hong HA, Clark GC, Titball RW, Cutting SM. Recombinant bacillus subtilis expressing the clostridium perfringens alpha toxoid is a candidate orally delivered vaccine against necrotic enteritis. Infect Immun (2008) 76(11):5257–65. doi: 10.1128/IAI.00686-08
70. Springer S, Selbitz H -J. The control of necrotic enteritis in sucking piglets by means of a clostridium perfringens toxoid vaccine. FEMS Immunol Med Microbiol (1999) 24(3):333–6. doi: 10.1111/j.1574-695X.1999.tb01302.x
71. Wang Yh. Bioinformatics analysis of NetF proteins for designing a multi-epitope vaccine against clostridium perfringens infection. Infect Genet Evol J Mol Epidemiol Evol Genet Infect Dis (2020) 85:104461. doi: 10.1016/j.meegid.2020.104461
72. Elhag M, Abubaker M, Ahmad NM, Haroon EM, Alaagib RM, Albagi SOA, et al. Immunoinformatics prediction of epitope based peptide vaccine against listeria monocytogenes fructose bisphosphate aldolase protein. bioRxiv (2019), 649111. doi: 10.1101/649111
73. Morales Ruiz S, Bendezu J, Choque Guevara R, Montesinos R, Requena D, Choque Moreau L, et al. Development of a lateral flow test for the rapid detection of avibacterium paragallinarum in chickens suspected of having infectious coryza. BMC Vet Res (2018) 14(1):411. doi: 10.1186/s12917-018-1729-0
74. Valdivia-Olarte H, Requena D, Ramirez M, Saravia LE, Izquierdo R, Falconi-Agapito F, et al. Design of a predicted MHC restricted short peptide immunodiagnostic and vaccine candidate for fowl adenovirus c in chicken infection. Bioinformation (2015) 11(10):460–5. doi: 10.6026/97320630011460
75. Agranovich A, Maman Y, Louzoun Y. Viral proteome size and CD8+ T cell epitope density are correlated: The effect of complexity on selection. Infect Genet Evol J Mol Epidemiol Evol Genet Infect Dis (2013) 20:71–7. doi: 10.1016/j.meegid.2013.07.030
76. Trolle T, McMurtrey CP, Sidney J, Bardet W, Osborn SC, Kaever T, et al. The length distribution of class I-restricted T cell epitopes is determined by both peptide supply and MHC allele-specific binding preference. J Immunol Baltim Md 1950 (2016) 196(4):1480–7. doi: 10.4049/jimmunol.1501721
77. Paul S, Weiskopf D, Angelo MA, Sidney J, Peters B, Sette A. HLA class I alleles are associated with peptide-binding repertoires of different size, affinity, and immunogenicity. J Immunol (2013) 191(12):5831–9. doi: 10.4049/jimmunol.1302101
78. Van Damme L, Cox N, Callens C, Dargatz M, Flügel M, Hark S, et al. Protein truncating variants of colA in clostridium perfringens type G strains. Front Cell Infect Microbiol (2021) 11:645248. doi: 10.3389/fcimb.2021.645248
79. Kameyama S, Akama K. Purification and some properties of kappa toxin of clostridium perfringens. Jpn J Med Sci Biol (1971) 24(1):9–23. doi: 10.7883/yoken1952.24.9
80. Li J, Uzal FA, McClane BA. Clostridium perfringens sialidases: Potential contributors to intestinal pathogenesis and therapeutic targets. Toxins (2016) 8(11):E341. doi: 10.3390/toxins8110341
81. Boraston AB, Ficko-Blean E, Healey M. Carbohydrate recognition by a large sialidase toxin from clostridium perfringens. Biochemistry (2007) 46(40):11352–60. doi: 10.1021/bi701317g
82. Ficko-Blean E, Boraston AB. N-acetylglucosamine recognition by a family 32 carbohydrate-binding module from clostridium perfringens NagH. J Mol Biol (2009) 390(2):208–20. doi: 10.1016/j.jmb.2009.04.066
83. Canard B, Garnier T, Saint-Joanis B, Cole ST. Molecular genetic analysis of the nagH gene encoding a hyaluronidase of clostridium perfringens. Mol Gen Genet MGG (1994) 243(2):215–24. doi: 10.1007/BF00280319
84. Jiang Y, Kulkarni RR, Parreira VR, Prescott JF. Immunization of broiler chickens against clostridium perfringens-induced necrotic enteritis using purified recombinant immunogenic proteins. Avian Dis (2009) 53(3):409–15. doi: 10.1637/8656-021109-Reg.1
85. Duff AF, Vuong CN, Searer KL, Briggs WN, Wilson KM, Hargis BM, et al. Preliminary studies on development of a novel subunit vaccine targeting clostridium perfringens mucolytic enzymes for the control of necrotic enteritis in broilers. Poult Sci (2019) 98(12):6319–25. doi: 10.3382/ps/pez448
86. Caradonna TM, Schmidt AG. Protein engineering strategies for rational immunogen design. NPJ Vaccines (2021) 6(1):154. doi: 10.1038/s41541-021-00417-1
87. Merlin M, Gecchele E, Capaldi S, Pezzotti M, Avesani L. Comparative evaluation of recombinant protein production in different biofactories: The green perspective. BioMed Res Int (2014) 2014:136419. doi: 10.1155/2014/136419
88. Pollet J, Chen WH, Strych U. Recombinant protein vaccines, a proven approach against coronavirus pandemics. Adv Drug Deliv Rev (2021) 170:71–82. doi: 10.1016/j.addr.2021.01.001
89. Gao X, Ma Y, Wang Z, Bai J, Jia S, Feng B, et al. Oral immunization of mice with a probiotic lactobacillus casei constitutively expressing the α-toxoid induces protective immunity against clostridium perfringens α-toxin. Virulence (2019) 10(1):166–79. doi: 10.1080/21505594.2019.1582975
90. Pilehchian Langroudi R, Shamsara M, Aghaiypour K. Expression of clostridium perfringens epsilon-beta fusion toxin gene in e. coli and its immunologic studies in mouse. Vaccine (2013) 31(32):3295–9. doi: 10.1016/j.vaccine.2013.04.061
91. Cooper KK, Trinh HT, Songer JG. Immunization with recombinant alpha toxin partially protects broiler chicks against experimental challenge with clostridium perfringens. Vet Microbiol (2009) 133(1):92–7. doi: 10.1016/j.vetmic.2008.06.001
92. Salvarani FM, Conceição FR, Cunha CEP, Moreira GMSG, Pires PS, Silva ROS, et al. Vaccination with recombinant clostridium perfringens toxoids α and β promotes elevated antepartum and passive humoral immunity in swine. Vaccine (2013) 31(38):4152–5. doi: 10.1016/j.vaccine.2013.06.094
93. Zichel R, Mimran A, Keren A, Barnea A, Steinberger-Levy I, Marcus D, et al. Efficacy of a potential trivalent vaccine based on hc fragments of botulinum toxins a, b, and e produced in a cell-free expression system. Clin Vaccine Immunol CVI (2010) 17(5):784–92. doi: 10.1128/CVI.00496-09
94. Flores-Díaz M, Alape-Girón A. Role of clostridium perfringens phospholipase c in the pathogenesis of gas gangrene. Toxicon (2003) 42(8):979–86. doi: 10.1016/j.toxicon.2003.11.013
95. Verherstraeten S, Goossens E, Valgaeren B, Pardon B, Timbermont L, Haesebrouck F, et al. Non-toxic perfringolysin O and α-toxin derivatives as potential vaccine candidates against bovine necrohaemorrhagic enteritis. Vet J Lond Engl 1997 (2016) 217:89–94. doi: 10.1016/j.tvjl.2016.09.008
96. Liao CM, Huang C, Hsuan SL, Chen ZW, Lee WC, Liu CI, et al. Immunogenicity and efficacy of three recombinant subunit pasteurella multocida toxin vaccines against progressive atrophic rhinitis in pigs. Vaccine (2006) 24(1):27–35. doi: 10.1016/j.vaccine.2005.07.079
97. Tinker JK, Yan J, Knippel RJ, Panayiotou P, Cornell KA. Immunogenicity of a West Nile virus DIII-cholera toxin A2/B chimera after intranasal delivery. Toxins (2014) 6(4):1397–418. doi: 10.3390/toxins6041397
98. Mustafa W, Al-Saleem FH, Nasser Z, Olson RM, Mattis JA, Simpson LL, et al. Immunization of mice with the non-toxic HC50 domain of botulinum neurotoxin presented by rabies virus particles induces a strong immune response affording protection against high-dose botulinum neurotoxin challenge. Vaccine (2011) 29(28):4638–45. doi: 10.1016/j.vaccine.2011.04.045
99. Kaushik H, Deshmukh S, Mathur DD, Tiwari A, Garg LC. Recombinant expression of in silico identified bcell epitope of epsilon toxin of clostridium perfringens in translational fusion with a carrier protein. Bioinformation (2013) 9(12):617. doi: 10.6026/97320630009617
100. Mehdizadeh Gohari I, Parreira VR, Nowell VJ, Nicholson VM, Oliphant K, Prescott JF. A novel pore-forming toxin in type a clostridium perfringens is associated with both fatal canine hemorrhagic gastroenteritis and fatal foal necrotizing enterocolitis. PloS One (2015) 10(4):e0122684. doi: 10.1371/journal.pone.0122684
101. Goumari MM, Farhani I, Nezafat N, Mahmoodi S. Multi-epitope vaccines (MEVs), as a novel strategy against infectious diseases. Curr Proteomics (2020) 17(5):354–64. doi: 10.2174/1570164617666190919120140
102. Sette A, Livingston B, McKinney D, Appella E, Fikes J, Sidney J, et al. The development of multi-epitope vaccines: Epitope identification, vaccine design and clinical evaluation. Biologicals (2001) 29(3):271–6. doi: 10.1006/biol.2001.0297
103. Rodrigues RR, Ferreira MRA, Kremer FS, Donassolo RA, Júnior CM, Alves MLF, et al. Recombinant vaccine design against clostridium spp. toxins using immunoinformatics tools. In: Thomas S, editor. Vaccine design: Methods and protocols, volume 3 resources for vaccine development, vol. p . New York, NY: Springer US (2022). p. 457–70.
104. Smith CC, Entwistle S, Willis C, Vensko S, Beck W, Garness J, et al. Landscape and selection of vaccine epitopes in SARS-CoV-2. BioRxiv Prepr Serv Biol (2020) 13(1):101. doi: 10.1101/2020.06.04.135004
105. Saxová P, Buus S, Brunak S, Keşmir C. Predicting proteasomal cleavage sites: A comparison of available methods. Int Immunol (2003) 15(7):781–7. doi: 10.1093/intimm/dxg084
106. Calis JJA, Reinink P, Keller C, Kloetzel PM, Keşmir C. Role of peptide processing predictions in T cell epitope identification: Contribution of different prediction programs. Immunogenetics (2015) 67(2):85–93. doi: 10.1007/s00251-014-0815-0
107. Backert L, Kohlbacher O. Immunoinformatics and epitope prediction in the age of genomic medicine. Genome Med (2015) 7:119. doi: 10.1186/s13073-015-0245-0
108. Seligmann H. Cost-minimization of amino acid usage. J Mol Evol (2003) 56(2):151–61. doi: 10.1007/s00239-002-2388-z
109. Carrasco Pro S, Sidney J, Paul S, Lindestam Arlehamn C, Weiskopf D, Peters B, et al. Automatic generation of validated specific epitope sets. J Immunol Res (2015) 2015:763461. doi: 10.1155/2015/763461
110. Tian Y, da Silva Antunes R, Sidney J, Lindestam Arlehamn CS, Grifoni A, Dhanda SK, et al. A review on T cell epitopes identified using prediction and cell-mediated immune models for mycobacterium tuberculosis and bordetella pertussis. Front Immunol (2018) 9:2778. doi: 10.3389/fimmu.2018.02778
111. Longmate J, York J, La Rosa C, Krishnan R, Zhang M, Senitzer D, et al. Population coverage by HLA class-I restricted cytotoxic T-lymphocyte epitopes. Immunogenetics (2001) 52(3–4):165–73. doi: 10.1007/s002510000271
112. Westernberg L, Schulten V, Greenbaum JA, Natali S, Tripple V, McKinney DM, et al. T-Cell epitope conservation across allergen species is a major determinant of immunogenicity. J Allergy Clin Immunol (2016) 138(2):571–578.e7. doi: 10.1016/j.jaci.2015.11.034
113. Ikram A, Anjum S, Tahir M. In silico identification and conservation analysis of b-cell and T-cell epitopes of hepatitis c virus 3a genotype enveloped glycoprotein 2 from Pakistan: A step towards heterologous vaccine design. Hepat Mon (2014) 14(6):e9832. doi: 10.5812/hepatmon.9832
114. Aldakheel FM, Abrar A, Munir S, Aslam S, Allemailem KS, Khurshid M, et al. Proteome-wide mapping and reverse vaccinology approaches to design a multi-epitope vaccine against clostridium perfringens. Vaccines (2021) 9(10):1079. doi: 10.3390/vaccines9101079
115. Tropp BE. Cardiolipin synthase from escherichia coli. Biochim Biophys Acta (1997) 1348(1–2):192–200. doi: 10.1016/S0005-2760(97)00100-8
116. Provoda CJ, Lee KD. Bacterial pore-forming hemolysins and their use in the cytosolic delivery of macromolecules. Adv Drug Deliv Rev (2000) 41(2):209–21. doi: 10.1016/S0169-409X(99)00067-8
117. Paredes-Sabja D, Setlow P, Sarker MR. Germination of spores of bacillales and clostridiales species: Mechanisms and proteins involved. Trends Microbiol (2011) 19(2):85–94. doi: 10.1016/j.tim.2010.10.004
118. Malik JA, Mulla AH, Farooqi T, Pottoo FH, Anwar S, Rengasamy KRR. Targets and strategies for vaccine development against SARS-CoV-2. BioMed Pharmacother (2021) 137:111254. doi: 10.1016/j.biopha.2021.111254
119. Kar T, Narsaria U, Basak S, Deb D, Castiglione F, Mueller DM, et al. A candidate multi-epitope vaccine against SARS-CoV-2. Sci Rep (2020) 10(1):10895. doi: 10.1038/s41598-020-67749-1
120. Hou J, Liu Y, Hsi J, Wang H, Tao R, Shao Y. Cholera toxin b subunit acts as a potent systemic adjuvant for HIV-1 DNA vaccination intramuscularly in mice. Hum Vaccines Immunother (2014) 10(5):1274–83. doi: 10.4161/hv.28371
121. Kim HJ, Kim JK, Seo SB, Lee HJ, Kim HJ. Intranasal vaccination with peptides and cholera toxin subunit b as adjuvant to enhance mucosal and systemic immunity to respiratory syncytial virus. Arch Pharm Res (2007) 30(3):366–71. doi: 10.1007/BF02977620
122. Tamura S, Funato H, Nagamine T, Aizawa C, Kurata T. Effectiveness of cholera toxin b subunit as an adjuvant for nasal influenza vaccination despite pre-existing immunity to CTB. Vaccine (1989) 7(6):503–5. doi: 10.1016/0264-410X(89)90273-9
123. Livingston B, Crimi C, Newman M, Higashimoto Y, Appella E, Sidney J, et al. A rational strategy to design multiepitope immunogens based on multiple Th lymphocyte epitopes. J Immunol Baltim Md (2002) 168(11):5499–506. doi: 10.4049/jimmunol.168.11.5499
124. Liu C, Chin JX, Lee DY. SynLinker: An integrated system for designing linkers and synthetic fusion proteins. Bioinforma Oxf Engl (2015) 31(22):3700–2. doi: 10.1093/bioinformatics/btv447
125. Li G, Huang Z, Zhang C, Dong BJ, Guo RH, Yue HW, et al. Construction of a linker library with widely controllable flexibility for fusion protein design. Appl Microbiol Biotechnol (2016) 100(1):215–25. doi: 10.1007/s00253-015-6985-3
126. Kalita P, Lyngdoh DL, Padhi AK, Shukla H, Tripathi T. Development of multi-epitope driven subunit vaccine against fasciola gigantica using immunoinformatics approach. Int J Biol Macromol (2019) 138:224–33. doi: 10.1016/j.ijbiomac.2019.07.024
127. Vakili B, Eslami M, Hatam GR, Zare B, Erfani N, Nezafat N, et al. Immunoinformatics-aided design of a potential multi-epitope peptide vaccine against leishmania infantum. Int J Biol Macromol (2018) 120:1127–39. doi: 10.1016/j.ijbiomac.2018.08.125
128. Enayatkhani M, Hasaniazad M, Faezi S, Gouklani H, Davoodian P, Ahmadi N, et al. Reverse vaccinology approach to design a novel multi-epitope vaccine candidate against COVID-19: An in silico study. J Biomol Struct Dyn (2021) 39(8):2857–72. doi: 10.1080/07391102.2020.1756411
129. Bhattarai A, Emerson IA. Dynamic conformational flexibility and molecular interactions of intrinsically disordered proteins. J Biosci (2020) 45(1):29. doi: 10.1007/s12038-020-0010-4
130. Fieser TM, Tainer JA, Geysen HM, Houghten RA, Lerner RA. Influence of protein flexibility and peptide conformation on reactivity of monoclonal anti-peptide antibodies with a protein alpha-helix. Proc Natl Acad Sci U S A (1987) 84(23):8568–72. doi: 10.1073/pnas.84.23.8568
131. Dym O, Eisenberg D, Yeates TO. Detection of errors in protein models. International Tables for Crystallography (2006), F. ch. 21.3:520–30. doi: 10.1107/97809553602060000709
132. Khatoon N, Pandey RK, Prajapati VK. Exploring leishmania secretory proteins to design b and T cell multi-epitope subunit vaccine using immunoinformatics approach. Sci Rep (2017) 7(1):8285. doi: 10.1038/s41598-017-08842-w
133. Tahir Ul Qamar M, Rehman A, Tusleem K, Ashfaq UA, Qasim M, Zhu X, et al. Designing of a next generation multiepitope based vaccine (MEV) against SARS-COV-2: Immunoinformatics and in silico approaches. PloS One (2020) 15(12):e0244176. doi: 10.1371/journal.pone.0244176
134. Tahir ul Qamar M, Shahid F, Aslam S, Ashfaq UA, Aslam S, Fatima I, et al. Reverse vaccinology assisted designing of multiepitope-based subunit vaccine against SARS-CoV-2. Infect Dis Poverty (2020) 9:132. doi: 10.1186/s40249-020-00752-w
135. Tariq MH, Bhatti R, Ali NF, Ashfaq UA, Shahid F, Almatroudi A, et al. Rational design of chimeric multiepitope based vaccine (MEBV) against human T-cell lymphotropic virus type 1: An integrated vaccine informatics and molecular docking based approach. PloS One (2021) 16(10):e0258443. doi: 10.1371/journal.pone.0258443
136. Takehara M, Kobayashi K, Nagahama M. Toll-like receptor 4 protects against clostridium perfringens infection in mice. Front Cell Infect Microbiol (2021) 11:633440. doi: 10.3389/fcimb.2021.633440
137. Jin MS, Kim SE, Heo JY, Lee ME, Kim HM, Paik SG, et al. Crystal structure of the TLR1-TLR2 heterodimer induced by binding of a tri-acylated lipopeptide. Cell (2007) 130(6):1071–82. doi: 10.1016/j.cell.2007.09.008
138. Park BS, Song DH, Kim HM, Choi BS, Lee H, Lee JO. The structural basis of lipopolysaccharide recognition by the TLR4–MD-2 complex. Nature (2009) 458(7242):1191–5. doi: 10.1038/nature07830
139. Frank SA. Immunology and evolution of infectious disease. Princeton University Press (2020). doi: 10.1515/9780691220161
140. Crotty S, Ahmed R. Immunological memory in humans. Semin Immunol (2004) 16(3):197–203. doi: 10.1016/j.smim.2004.02.008
141. Ihssen J, Kowarik M, Dilettoso S, Tanner C, Wacker M, Thöny-Meyer L. Production of glycoprotein vaccines in escherichia coli. Microb Cell Factories (2010) 9(1):61. doi: 10.1186/1475-2859-9-61
142. Tripathi NK, Shrivastava A. Recent developments in recombinant protein-based dengue vaccines. Front Immunol (2018) 9:1919. doi: 10.3389/fimmu.2018.01919
Keywords: immunoinformatics, Clostridium perfringens, epitope, toxin, vaccine, molecular dynamics
Citation: Soto LF, Romaní AC, Jiménez-Avalos G, Silva Y, Ordinola-Ramirez CM, Lopez Lapa RM and Requena D (2022) Immunoinformatic analysis of the whole proteome for vaccine design: An application to Clostridium perfringens. Front. Immunol. 13:942907. doi: 10.3389/fimmu.2022.942907
Received: 13 May 2022; Accepted: 02 August 2022;
Published: 30 August 2022.
Edited by:
Adriana Harbuzariu, Emory University, United StatesReviewed by:
Sajjad Ahmad, Abasyn University, PakistanHamidreza Majidiani, Neyshabur University of Medical Sciences, Iran
Copyright © 2022 Soto, Romaní, Jiménez-Avalos, Silva, Ordinola-Ramirez, Lopez Lapa and Requena. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: David Requena, drequena@rockefeller.edu