Skip to main content

ORIGINAL RESEARCH article

Front. Microbiol., 28 June 2022
Sec. Infectious Agents and Disease
This article is part of the Research Topic Innovative Tools to Support the Elimination of Neglected Tropical Diseases (NTDs) View all 14 articles

In silico Analysis of Peptide-Based Biomarkers for the Diagnosis and Prevention of Latent Tuberculosis Infection

\r\nPeng Cheng,Peng Cheng1,2Liang Wang*Liang Wang3*Wenping Gong*Wenping Gong1*
  • 1Tuberculosis Prevention and Control Key Laboratory/Beijing Key Laboratory of New Techniques of Tuberculosis Diagnosis and Treatment, Senior Department of Tuberculosis, The 8th Medical Center of PLA General Hospital, Beijing, China
  • 2Hebei North University, Zhangjiakou, China
  • 3Department of Geriatrics, The 8th Medical Center of PLA General Hospital, Beijing, China

Background: Latent tuberculosis infection (LTBI) is the primary source of active tuberculosis (ATB), but there are no specific methods for diagnosing and preventing LTBI.

Methods: Dominant T and B cell epitopes predicted from five antigens related to LTBI and Mycobacterium tuberculosis region of difference (LTBI-RD) were used to construct a novel polypeptide molecule (PPM). Then, the physicochemical properties, secondary structure, tertiary structure of the PPM, and its binding ability to toll-like receptor 2 (TLR2) and TLR4 were analyzed by bioinformatics tools. Finally, immune stimulation and expression optimization of the PPM were carried out.

Results: Four helper T lymphocytes (HTL) epitopes, five cytotoxic T lymphocytes (CTL) epitopes, and three B cell epitopes were predicted and screened from five LTBI-RD related antigens. These epitopes were connected in series with linkers and adjuvants to construct a novel PPM termed C543P. The results indicated that antigenicity and immunogenicity scores of the C543P candidate were 0.936399 and 1.36469, respectively. The structural analysis results showed that the C543P candidate had good stability. Its secondary structure contained 43.6% α-helix, the Z-score after tertiary structure optimization was −7.9, and the Ramachandran diagram showed that 88.77% amino acid residues of the C543P candidate were in the allowable region. Furthermore, the C543P candidate showed an excellent affinity to TLR2 (−1091.7kcal/mol) and TLR4 (−1102.7kcal/mol). In addition, we also analyzed the immunological characteristics of the C543P candidate. Immune stimulation prediction showed that the C543P candidate could effectively activate T and B lymphocytes and produce high levels of Th1 cytokines such as IFN-γ and IL-2.

Conclusion: We constructed a novel PPM with acceptable antigenicity, immunogenicity, stability, and ability to induce robust immune responses. This study provides a new diagnostic biomarker or peptides-based vaccine for LTBI diagnosis and prevention.

Introduction

Tuberculosis (TB), a respiratory infectious disease caused by Mycobacterium tuberculosis (MTB) infection, is the second leading cause of death from a single infectious agent after coronavirus disease 2019 (COVID-19). The Global Tuberculosis Report released by the World Health Organization (WHO) pointed out that there were 10 million new TB cases and 1.3 million deaths globally in 2020 (WHO, 2021). TB prevention and control are facing severe challenges, such as latent TB infection (LTBI), multidrug-resistant TB (MDR-TB), rifampicin-resistant TB (RR-TB), and co-infection with SARS-CoV-2 or human immunodeficiency virus (HIV) (WHO, 2021). Previous studies have shown that more than 85% of ATB patients originate from LTBI (Gong and Wu, 2021). LTBI is a subclinical mycobacterial infection defined on the basis of cellular immune response to mycobacterial antigens without clinical manifestations in patients with ATB (Carranza et al., 2020). Previous studies reported that about 23% of the global population had been infected with MTB, but only 10% developed ATB, and nearly 90% of these people are LTBI (Houben and Dodd, 2016; Gong et al., 2018). However, there are no specific diagnostic methods for LTBI and ATB differential diagnosis. Only two methods were approved by the WHO for auxiliary diagnosis of LTBI, including tuberculin skin test (TST) and interferon-gamma (IFN-γ) release assay (IGRA) (Zellweger et al., 2020; Mosavari et al., 2021).

Traditional TST uses purified protein derivative (PPD) as an antigen stimulant. Although this method has the advantages of low price, simplicity, and no need for special equipment, it can not eliminate the interference caused by Bacillus Calmette–Guérin (BCG) vaccination. Therefore, some new TST and IGRAs diagnosis methods have been developed in recent years, including Diaskintest, C-Tb skin test, EC-Test, T-cell spot of TB assay (T-SPOT.TB), QuantiFERON-TB Gold In-Tube (QFT-GIT), QuantiFERON-TB Gold-Plus (QFT-Plus), LIAISON QFT-Plus, and LIOFeron TB/LTBI (Gong and Wu, 2021). These new technologies coincidentally replace the PPD antigen used in the traditional TST with early secreted antigenic target 6 (EAST-6) and culture filtrate protein 10 (CFP-10) antigens. EAST-6 and CFP10 are encoded by Rv3874 and Rv3875 genes, located in the MTB Region of Difference 1 (RD1) (Tang et al., 2014). Both antigens can eliminate BCG vaccination’s interference in LTBI diagnosis and reduce the false-positive diagnosis rate. Unfortunately, EAST-6 and CFP10 antigens are not expressed in the latent period of MTB, so it is still not possible to distinguish the LTBI population from ATB patients.

Our previous study found that antigens related to LTBI and RD (LTBI-RD) were promising candidates for LTBI differential diagnosis (Gong and Wu, 2021). Among these 21 LTBI-RD related antigens, five antigens (Rv1737c, Rv2659c, Rv2660c, Rv1981c, and Rv3879c) have been widely reported to be candidate biomarkers for discriminating diagnosis of LTBI. It has been reported that, compared with patients with ATB, the Rv1737c antigen located in the DosR region of MTB could induce significantly higher levels of IFN-γ and TNF-α and promote the expression of CD4+ T cells and CD8+ T cells with CD45RO+CD27+ phenotype in peripheral blood in individuals with LTBI (Arroyo et al., 2016, 2018). Furthermore, Bai et al. (2015) compared the ability of Rv2659c to stimulate the release of IFN-γ in TB patients, non-TB patients, and LTBI individuals, and they found that the IFN-γ level in LTBI individuals was significantly higher than that in ATB patients. In addition, a study in China using Rv2659c to screen LTBI in recruits also proved that Rv2659c can identify individuals with LTBI with an accuracy rate of 70%, so this antigen may be one of the candidate antigens for discriminating LTBI (Bai et al., 2017). He and colleagues used recombinant protein Rv2660c to evaluate its diagnostic value by enzyme-linked immunosorbent assay (ELISA) (He et al., 2015). The results showed that the number of plaques formed in the LTBI individuals stimulated by the Rv2660c antigen was significantly higher than in ATB patients and healthy populations (He et al., 2015). Furthermore, cytokines such as IL-2, IL-10, IFN-γ, and MIP-1a induced by Rv2660c antigen in LTBI individuals were significantly higher than those in TB patients. These results indicated that Rv2660 could induce more robust immune responses in individuals with LTBI (He et al., 2015).

Based on the evidence mentioned above, in this study, we predicted and screened helper T lymphocytes (HTL), cytotoxic T lymphocytes (CTL), and B cell epitopes from five antigens related to LTBI-RD to construct a polypeptide molecule (PPM) for diagnosing and preventing LTBI. We predicted the antigenicity, immunity, sensitization, physicochemical properties, and secondary structure of the PPM. We also performed the tertiary structure modeling and model validation of the PPM and docked the model with the human toll-like receptor (TLR) to understand its binding affinity. Subsequently, the thermodynamic stability of the PPM was analyzed, and the immune response process of immune cells in vivo after antigen injection was predicted by the C-ImmSim server. Finally, electronic cloning and codon optimization were carried out to better express this PPM. This study aimed to design a PPM covering HTL, CTL and B cell epitopes simultaneously, which could provide a novel candidate target for the diagnosis and prevention of LTBI.

Materials and Methods

Antigens Selection

In our previous study, we screened 21 most potential LTBI-RD related antigens from 133 RD antigens and 124 latency-associated antigens, including Rv1736c, Rv1737c, Rv2031c, Rv2626c, Rv2653c, Rv2654c, Rv2656c, Rv2657c, Rv2658c, Rv2659c, Rv2660c, Rv1511, Rv1978, Rv1980c, Rv1981c, Rv3872, Rv3873, Rv3878, Rv3879c, Rv3425, and Rv3429 (Gong and Wu, 2021). Through extensive literature reading, we found that five antigens have great potential in identifying LTBI, including Rv1737c (NarK2), Rv2659c, Rv2660c, Rv1981c (nrdf1), and Rv3879c (espk) (Table 1). The amino acid sequences of these candidate antigens were downloaded through the Mycobrowser server1 (Kapopoulou et al., 2011).

TABLE 1
www.frontiersin.org

Table 1. Characteristics of five selected LTBI-RD-associated antigens.

Prediction of HTL Epitopes

Immune Epitope Database (IEDB) is a free resource funded by the National Institute of Allergy and Infectious Diseases (NIAID). This database documents experimental data on antibodies and T-cell epitopes studied in humans, non-human primates, and other animal species in infectious diseases, allergy, autoimmunity, and transplantation. This study used IEDB’s major histocompatibility complex (MHC) II server2 to predict HTL epitopes (Zhang et al., 2008). Parameter settings are as follows: Prediction Method = IEDB recommended 2.22, MHC source species = Human, MHC allele(s) = total human leukocyte antigen (HLA) reference set (HLA-DR, HLA-DP, HLA-DQ), epitope length = 15. Potential HTL epitopes were selected using the percentile rank method. The percentile rank is obtained by comparing the peptide score with five million 15-mers in the SWISSPROT database. The lower the percentile rank score represents the higher binding of the epitope to MHC II. We choose the epitopes whose percentile rank is less than 0.5 for further exploration in this study. Then, the VaxiJen v2.03 was used to predict the antigenicity of the epitopes, and the threshold was 0.5 (Doytchinova and Flower, 2007). VaxJen v2.0 lists selected targets by auto-cross-covariance (ACC) transformation and gives predicted probabilities and protective antigen or non-protective claims. Next, the IFN-gamma epitope server4 was used to predict the IFN-γ inducibility of epitopes (Dhanda et al., 2013). The prediction method was Motif and SVM hybrid, and the model was IFN-γ versus Non-IFN-gamma. Then, epitopes with positive predictions were selected. Finally, the epitopes that passed the above screening were determined as the immunodominant HTL epitopes for constructing the PPM.

Prediction of CTL Epitopes

Cytotoxic T lymphocytes plays a vital role in controlling the occurrence and development of TB. Therefore, we predicted the CTL epitopes of five LTBI-RD related antigens using the IEDB MHC I server.5 This server predicts epitopes that bind to MHC I molecules and uses Artificial Neural Network (ANN) 4.0 for 36 HLA-A alleles, 34 HLA-B alleles, and 10 HLA-C alleles (Kim et al., 2012). Parameter settings: Prediction Method = IEDB recommended 2020.09 (NetMHCpan EL4.1), MHC source species = Human, MHC allele(s) = HLA allele reference set, and epitope length(s) = All lengths. We selected CTL epitopes with a percentile rank of <0.5 for subsequent analysis. Subsequently, the Class I Immunogenicity server6 was selected to analyze the immunogenicity of these epitopes. The CTL epitopes with a percentile rank of <0.5 and immune scores >0 were selected for further analysis. Finally, the VaxiJen v2.0 server was used to predict the antigenicity of the above epitopes, and the selection threshold was set at 0.5 (Doytchinova and Flower, 2007). The epitopes that passed the above screening were determined as the immunodominant CTL epitopes for constructing PPM.

Prediction of Linear B-cell Epitopes

Recent studies have shown that B cells play an essential role in antigen presentation and killing MTB, but they are often overlooked. Herein, the ABCpred server7 was used to predict linear B-cell epitopes of the five LTBI-RD antigens. The ABCpred server uses artificial neural networks to predict linear B-cell epitope regions in antigen sequences and is widely used in candidate diagnostic antigen and vaccine research (Saha and Raghava, 2006). The server predicted epitopes with an accuracy of 65.93%. Parameter setting: epitope length = 20, the screening threshold = 0.51. Thresholds range from +0.1 to +1.0, and an increased threshold will enhance the specificity but decrease the sensitivity. Finally, the epitopes that passed the above screening were determined to be the immunodominant B cell epitopes for constructing the PPM.

Construction of PPM for LTBI Diagnosis and Prevention

We linked the predicted and screened immunodominant HTL, CTL, and B cell epitopes by using linkers to construct a novel PPM for LTBI diagnosis and prevention. Adjuvant and pan HLA DR-binding epitope (PADRE) were added at the amino terminal of PPM. Besides, adjuvant and 6 × His tag (HHHHHH) were added at the carboxyl-terminal of PPM. The adjuvants used in the PPM included TLR4 receptor agonist CTB (AIE88420.1) (Meza et al., 2017) and TLR2 receptor agonist Pam2Cys (Jackson et al., 2004), and the linkers chosen in this study included GGPPG, AAY, and KK.

Immunogenicity, Antigenicity, Allergenicity, and Toxicity Prediction of PPM

The immunogenicity of the PPM was predicted by using the IEDB Immunogenicity server (see footnote 6). The antigenicity of PPM was predicted by using VaxiJen v2.0 (see footnote 3) and ANTIGENpro server,8 respectively. The parameters in both servers were set as default values. The ANTIGENpro server is based on cross-validation experiments with 76% accuracy on the combined dataset (Magnan et al., 2010). Identifying allergens is of great significance to developing diagnostic candidates and vaccines. Herein, two servers, AllerTOP v.2.0 server9 and Allergen FP v.1.0 server,10 were used to predict the sensitization of the PPM. AllerTOP v.2.0 develops the k nearest neighbors (kNN), ACC transformation, and amino acid E-descriptors machine learning techniques for classifying allergens by exploring the physiochemical properties of proteins. The accuracy of the AllerTOP v.2.0 was stated as 85.3% at fivefold cross-validation (Dimitrov et al., 2014a). AllerFP v.1.0 uses an alignment-free, descriptor-based fingerprint method to identify known allergens and non-allergens. AllerFP v.1.0 correctly recognized 88% of them with a Matthews correlation coefficient of 0.75945 (Dimitrov et al., 2014b). Finally, the ToxinPred server (http://crdd.osdd.net/raghava/toxinpred/) was used to predict the toxicity of the PPM.

Physiochemical Properties and Solubility Prediction

The physicochemical properties of the PPM were predicted using the Expasy Protparam server.11 This server can indicate molecular protein weight, theoretical isoelectric point (pI), amino acid composition, in vitro and in vivo half-life, instability, aliphatic index, and grand average of hydrophilicity (GRAVY) (Gasteiger, 2005). Then, the Protein-Sol server12 was used to predict the solubility of the PPM. The population average for the experimental dataset (PopAvrSol) is 0.45. Therefore, if the prediction result is >0.45, the candidate protein shows a higher solubility than the average soluble Escherichia coli (E. coli) protein from the experimental solubility dataset (Hebditch et al., 2017).

Secondary Structure and Three-Dimensional Structure Prediction and Refinement

The PSIPRED13 and Prabi servers14 were used to predict the secondary structure of the PPM. PSIPRED is a secondary structure generation tool that provides an excellent prediction of transmembrane topology, transmembrane helix, folds, domain recognition, etc. (McGuffin et al., 2000). Prabi severs use GOR4 to predict the secondary structure of peptide molecules, and the average accuracy of this method is 64.4% (Garnier et al., 1996). In addition, the I-TASSER server15 was used to predict the three-dimensional (3D) spatial structure of polypeptide molecules. The I-TASSER server is a platform for computerized protein structure and function prediction based on sequence-to-structure-to-function paradigms (Yang et al., 2015). At the same time, the I-TASSER server generates 3D atomic models through several thread alignments and iterative structural assembly simulations, and the estimated accuracy of the predictions is based on the modeled confidence score (C-score) of the modeling. In general, the value of the C-score is between −5 and 2. Therefore, the higher the value, the higher the accuracy (Roy et al., 2010). Then, the 3D model of the PPM model was optimized using the GalaxyRefine web server16 to improve the quality of its partial or complete structure (Heo et al., 2013).

Validation of 3D Structure

The ProSA-web server17 and the ERRAT-web server18 were used to evaluate the possible errors in the 3D structure of the PPM. The ProSA-web server accurately enters the frame by estimating the total quality score exact input structure and displays it in the form of a Z score. When the Z score exceeds the properties of the native protein, it indicates that the structure contains errors (Wiederstein and Sippl, 2007). The ERRAT-web server predicts the high-resolution crystallography structure of the PPM from non-bonded atom–atom interactions (Colovos and Yeates, 1993). Then, the SWISS-MODEL server19 was used to draw a Ramachandran diagram for the PPM (Waterhouse et al., 2018). The Ramachandran diagram shows the favorable region of the dihedral angle of the main chain relative to the amino acid residues in the protein structure (Waterhouse et al., 2018). The structure evaluation page shows the most relevant scores provided by MolProbity and helps us quickly identify the location of low-quality residues in its model or structure (Waterhouse et al., 2018).

Prediction of Discontinuous B-cell Epitopes

Most B cell epitopes are discontinuous, and the constructed proteins are folded in different ways to form discontinuous B cell epitopes. So we predict their discontinuous epitopes by using the ElliPro server20 following a previous study (Ponomarenko et al., 2008).

Molecular Docking With Immune Receptors

Molecular docking is the most basic and promising method to describe the interaction and binding affinity between peptides and human TLR2 and TLR4. We obtained PDB files for TLR2 (PDB ID: 6NIG) and TLR4 (PDB ID: 4G8A) from the NCBI Molecular Modeling Database (MMDB).21 Firstly, the ClusPro 2.0 online server22 was used for ligand-receptor docking analysis (Kozakov et al., 2017). Secondly, the protein-protein docking complexes were refined and re-scored using the PatchDock server23 and FireDock server24 (Schneidman-Duhovny et al., 2005; Andrusier et al., 2007; Mashiach et al., 2008). Finally, the HawDOCK server25 was used to measure the Molecular Mechanics/Generalized Born Surface Area (MM-GBSA) score. The lower the prediction score, the better the prediction result (Weng et al., 2019).

Molecular Dynamics Simulation and Immune Simulation

Molecular dynamics simulation of the PPM was explored using the iMODS web-server,26 through which the flexibility of the PPM can be defined and calculated (Lopéz-Blanco et al., 2011). Then, the C-ImmSim server27 was used to predict the PPM’s ability to induce immune cells to produce specific antibodies and various cytokines. In addition, this server can also estimate the immune response of B lymphocytes and T lymphocytes (including Th1 and Th2 lymphocytes) (Rapin et al., 2010). The parameters are set as follows: random seed, simulation volume, and simulation steps were selected default values; HLA selection: A0101, A0201, B0702, B0801, DRB10101, and DRB1501. Finally, one injection of the PPM was conducted in the human host.

Codon Optimization and in silico Cloning

Codon optimization can improve the expression of recombinant proteins. We used the Java Codon Adaptation Tool (JCAT) server28 to provide a feasible method for constructing a multi-epitopes-based recombinant plasmid. This server offers a codon-optimized version of the interest DNA sequence based on a chosen organism (Grote et al., 2005). The expression vector E. coli (K12) was selected. The output includes two parameters: codon adaptation index (CAI) and GC content percentage. In general, the ideal value of CAI is 1, and the GC content is 30–70%. The PPM sequence was inserted between EcoR I and Xbn I restriction sites of the pET30a (+) plasmid. SnapGene software was used to finalize in silico cloning.

Results

Selection of Immunodominant Epitopes and Construction of the PPM

The flow chart of prediction, screening, construction, and physicochemical properties of immunodominant epitopes was shown in Figure 1. One hundred and fifty-seven HTL epitopes (Supplementary Table 1) and 1184 CTL epitopes (Supplementary Table 2) with a percentile rank score of <0.5 were identified by using IEDB MHC II and MHC I web servers, respectively. Then, 18 epitopes with antigenicity score >0.7 and positive IFN-γ inducibility were screened from 157 HTL epitopes. In addition, 60 epitopes with immunogenicity > 0 and antigenicity > 1 were screened from 1184 CTL epitopes (Supplementary Table 3). Finally, we selected 8 epitopes with a truncated binding score > 0.9 from 154 B cell epitopes using the ABCpred server (Supplementary Table 4).

FIGURE 1
www.frontiersin.org

Figure 1. Flow chart of epitope prediction, screening, construction, and PPM analysis.

The potential immunodominant epitopes were selected following the below principles: (1) HTL epitopes with antigenicity > 0.7 and the highest IFN prediction score in each antigen were selected; (2) CTL epitopes with the highest immunogenicity score in each antigen were selected; (3) B cell epitopes with the highest ABCpred prediction score (>0.9) were selected. Thus, four HTL, five CTL, and three B cell epitopes were finally selected to construct a new PPM (Table 2). The GGPPG, AAY, or KK linker was chosen to connect HTL, CTL, or B cell epitopes, respectively. Adjuvant or agonist CTB (MTPQNITDLCAEYHNTQIYTLNDKIFSYTESLAGKREMAII TFKNGAIFQVEVPGSQHIDSQKKAIERMKDTLRIAYLTEAKV EKLCVWNNKTPHAIAAISMAN), PADRE (AGLFQRHGEGT KATVGEPV), and Pam2Cys (FNNFTVSFWLRVPKVSASHLE) were linked with EAAAK linkers. The above-constructed PPM was named C543P, and its construction pattern and amino acid sequence were shown in Figures 2A,B, respectively.

TABLE 2
www.frontiersin.org

Table 2. Overall the HTL, CTL, and B-cell epitopes selected to construct the C543P candidate.

FIGURE 2
www.frontiersin.org

Figure 2. (A) Schematic diagram of C543P polypeptide molecular construction. (B) Amino acid sequence. (C) Solubility analysis predicted by the Protein-Sol server.

Antigenicity, Immunogenicity, Sensitization, and Toxicity of the C543P Candidate

An ideal LTBI diagnostic and prophylactic candidate should be immunogenic, antigenic, non-toxic, and non-allergenic. Herein, to clarify these characteristics of the C543P candidate, we used multiple servers or databases for analysis. The results showed that the antigenicity predicted by VaxiJen v2.0 and ANTIGENpro were 0.7501 and 0.936399, respectively. The immunogenicity of the C543P candidate predicted by the IEDB Immunogenicity server was 1.36469. These data suggested that the C543P candidate had excellent antigenicity and immunogenicity, and showed a potential ability to induce strong immune responses. Furthermore, the prediction results of AllerTOP v.2.0 and AllergenFP v.1.0 servers showed that the C543P candidate was non-allergenic, and the prediction results of Toxin Pred sever revealed that the C543P candidate was non-toxic. These results indicate that the C543P candidate is immunogenic, antigenic, non-toxic, and non-allergenic, highlighting that the C543P candidate may be a promising biomarker or vaccine for LTBI diagnosis and prevention.

Physicochemical Properties and Solubility of C543P Candidate

The physicochemical properties of a diagnostic biomarker or a vaccine significantly impact their immunological functions (Ikai, 1980). Therefore, various physical and chemical properties of the C543P candidate were analyzed by the Expasy Protparam server, and the results showed that the C543P candidate is a protein with 367 amino acids, the weight of 39188.54 Da, the theoretical PI value of 9.74, instability index of 27.36, and fat index of 69.35 (Table 3). In addition, we also found that the estimated half-life of the C543P candidate in mammalian reticulocytes (in vitro), yeast (in vivo), and E. coli (in vivo) were 30 hours, > 20 hours, and > 10 hours, respectively (Table 3). These data suggest that the C543P candidate is very stable and has thermal stability (Table 3). Furthermore, the solubility of C543P predicted by Protein-Sol server was 0.437 (Figure 2C), which indicates that the C543P candidate had an acceptable solubility.

TABLE 3
www.frontiersin.org

Table 3. Physicochemical property of C543P predicted by the Expasy Protparam.

Secondary Structure, Tertiary Structure, and Tertiary Structure Validation of C543P Candidate

Based on the PSIPRED server and Prabi server, we presented the secondary structure of the C543P candidate (Figure 3A). The C543P candidate contains 43.6% α-helix, 15.8% β-strand, and 40.6% random coil. Subsequently, we predicted the 3D structures of the five 3D models designed by the I-TASSER server. The results showed that the Z scores of the five 3D models were between 0.82 and 9.12. Their confidence scores (C-score) were −1.55, −2.36, −2.33, −2.30, and −2.42, respectively. We selected the optimal 3D model with C-score = −2.36 to simulate the 3D structure of the C543P candidate (Figure 3B). We found that under this 3D model, the TM score of the C543P candidate was 0.52 ± 0.15, and the expected root-mean-square deviation (RMSD) was 10.2 ± 4.6A. The TM-value has been recommended as a calculating scale for the structural resemblance among the structures. The TM-value was suggested to address the issue of RMSD, which is delicate to native errors. Then, the GalaxyRefine web server was used to optimize the 3D model of the C543P candidate. GalaxyRefine web server can improve the consistency of modeled proteins to obtain higher quality prediction results. Through loop refinement and energy minimization on the rough model, we again obtained five optimized 3D models, from which model 2 was selected for further study (Figure 3C). The GDT-HA value of this model was 0.9469, RMSD value was 0.429, MolProbity value was 2.13, Clash score was 10.7, low Rotamers was 0.7, and Rama favored was 88.8.

FIGURE 3
www.frontiersin.org

Figure 3. Secondary and tertiary structure prediction of the C543P candidate. (A) The C543P candidate was predicted to contain 43.6% α-helices, 15.8% β-strand, and 40.6% random coils by the PSIPRED server. (B) The 3D model of the C543P candidate was obtained on the I-TASSER server. (C) The 3D model of the peptide molecule after optimization by the Galaxy refining server, and the colored structure was superimposed on the rough gray model.

ProSA-web server and ERRAT-web server were used to verify the quality and potential errors in the 3D model of the C543P candidate. The results showed that the Z score of the optimized C543P candidate was −7.9 (Figure 4A), and the energy map was shown in Figure 4B. The Overall Quality Factor of the C543P candidate was improved from 86.9436 to 91% after being optimized by the ERRAT-web server. The Ramachandran diagram showed that, before optimizing the C543P candidate, the Favored region was 75.39%, the outliers region was 7.85%, and the rotamer region was 9.70% (Figure 4C). After optimization, the Favored region was 88.77%, the outliers region was 1.92%, and the rotamer region was 0.37% (Figure 4D). Therefore, the Favored region of the C543P candidate was increased from 75.39 to 88.77%.

FIGURE 4
www.frontiersin.org

Figure 4. Evaluation and validation of the tertiary structure model of the C543P candidate. (A) Z-score predicted by ProSA-web server: −7.9. (B) Energy map of C543P peptide molecule validation. (C) Ramachandran diagram analysis of C543P peptide molecule before optimization shows Favored region: 75.39%, outliers region: 7.85%, rotamer region: 9.70%. (D) Ramachandran diagram of C543P peptide molecule optimization: Favored region: 88.77%, outliers region: 1.92%, rotamer region: 0.37%.

Conformational B-cell Epitopes

According to data obtained from the ElliPro server, 193 residues were distributed on 10 putative B-cell epitopes with values ranging from 0.501 to 0.992 (Supplementary Table 5). For discontinuous epitopes predicted by the ElliPro server, epitopes with a score > 0.69 are usually selected. The conformational B-cell epitopes of the C543P candidate were estimated to have 74 residues with scores ranging from 0.709 to 0.992 (Figure 5 and Table 4).

FIGURE 5
www.frontiersin.org

Figure 5. The conformation or discontinuous B cell epitope of the C543P candidate. The yellow balls in panels (A–D) represented 4 conformations or discontinuous B cell epitopes, respectively, and the rest were shown in gray.

TABLE 4
www.frontiersin.org

Table 4. The conformational B cell epitopes residues of the C543P candidate predicted by the ElliPro.

Molecular Docking

This study used several servers to predict protein-protein docking to improve prediction accuracy. Firstly, the docking exploration of TLR2 and TLR4 was performed using the ClusPro 2.0 server, which generated 30 predicted complexes. Secondly, two models (1091.7 and 1102.7 kcal/mol models) with lower complex binding energies were selected for further optimization. Then, the C543P candidate was evaluated using the PatchDock server, which generated different score sheets by identifying different models. Finally, the top 10 complexes identified by the above methods were refined through the FireDock server, from which the model with the lowest required binding energy was selected. The optimization results of the TLR2-C543P complex showed that the global energy was 1.34, the attractive van der Waals energy (VdW) was −5.62, the repulsive (VdW) was 2.38, and the atomic contact energy was 6.04 (Figure 6A). The optimization results of the TLR4-C543P complex showed that the global energy was −0.35, the attractive (VdW) was −3.49, the repulsive (VdW) was 1.66, and the atomic contact energy was −1.96 (Figure 6B). The HawDock server predicted the relative binding free energies of the C543P candidate to TLR2 and TLR4 in the MM-GBSA were −4.86 and −22.58 kcal/mol, respectively.

FIGURE 6
www.frontiersin.org

Figure 6. Ligand–receptor interaction. C543P was shown in purple and TLR2 (A) and TLR4 (B) were shown in blue.

Molecular Dynamics Simulation Between C543P Candidate and TLRs

The molecular dynamics simulation and normal mode analysis (NMA) of the docking complex of the C543P candidate with TLR2 and TLR4 were shown in Figures 7A, 8A, respectively. Simulation prediction was performed to determine the movement of molecules and atoms in the antigenic structure. The deformability of TLR2-C543P (Figure 7B) or TLR4-C543P (Figure 8B) complex was shown with peaks in the deformable regions of the C543P candidate. The simulation results showed that the TLR2-C543P complex and TLR4-C543P complex eigenvalues were 8.659505e-06 (Figure 7C) and 9.469126e-06 (Figure 8C), respectively. Variance plots showed a cumulative or individual variance of TLR2-C543P complex (Figure 7D) and TLR4-C543P complex (Figure 8D) with green or purple, respectively. The relation of the docked TLR2-C543P complex (Figure 7E) and TLR4-C543P complex (Figure 8E) between the NMA and the PDB sector was shown by the B-factor graph. The co-variance map of the complex where the correlated motion between a pair of residues was indicated by red color, uncorrelated by white color, and anti-correlated by blue color (Figures 7F, 8F). The elastic network model showed that the docked protein molecule (C-α) atoms were interconnected with some degree of “spring” (harder springs were shown in darker gray) (Figures 7G, 8G).

FIGURE 7
www.frontiersin.org

Figure 7. Molecular dynamics simulation of the C543P candidate and TLR2. (A) NMA mobility. The direction of the arrow indicates the direction of molecular motion. Blue represents the C543P molecule and green expresses the TLR2 receptor. (B) variability; (C) eigenvalues; (D) variance (Purple for individual variance, green for cumulative variance); (E) B-factor; (F) covariance plot (red for correlated motion, white for uncorrelated motion, the blue color indicates anti-correlated motion); (G) elastic network (dark gray areas indicate harder areas).

FIGURE 8
www.frontiersin.org

Figure 8. Molecular dynamics simulation of the C543P candidate and TLR4. (A) NMA mobility. The direction of the arrow indicates the direction of molecular motion. Blue represents the C543P molecule and green expresses the TLR2 receptor. (B) variability; (C) eigenvalues; (D) variance (Purple for individual variance, green for cumulative variance); (E) B-factor; (F) covariance plot (red for correlated motion, white for uncorrelated motion, the blue color indicates anti-correlated motion); (G) elastic network (dark gray areas indicate harder areas).

Immune Responses Induced by the Candidate C543P

Mycobacterium tuberculosis is an intracellular pathogen, and cellular and humoral immunity play essential roles in killing and eliminating MTB. We found that the C543P candidate could effectively stimulate the innate immune system to induce immune responses in the immune simulation prediction. The results showed that after C543P stimulation, the population per state of NK cells increased in a staged and fluctuating manner, reaching a peak on day 9 (Figure 9A). The population per state of dendritic cells (DCs) reached a peak on day 23, mainly resting DCs. Interestingly, we found that presenting-2 DCs populations per state peaked on day 2 after C543P stimulation and gradually decreased (Figure 9B). In addition to DCs, macrophages can clear infected cells. Our study showed that the presenting-2 macrophages population per state started to proliferate after stimulation with C543P, peaked on day 2, and gradually decreased (Figure 9C). When presenting-2 macrophages reached their peak, the active macrophages population per state began to proliferate rapidly, reaching a peak on day 7 and maintaining a high level of activation (Figure 9C). Finally, considering that MTB first interacts with alveolar epithelial cells after entering the lungs, we also analyzed the effect of C543P on epithelial cells. The results showed that the population per state of epithelial cells would maintain a significantly high secretion level after being stimulated by C543P (Figure 9D).

FIGURE 9
www.frontiersin.org

Figure 9. The innate immune responses induced by C543P in C-ImmSim server. (A) Expression of NK cells after antigen stimulation. (B) Expression of DC cells (black) after antigen stimulation. (C) Expression in macrophages (black) after antigen stimulation. (D) Secretion from active epithelial cells (purple) after antigen stimulation.

Innate immunity plays a vital role in the initial stage of MTB infection, but the clearance and killing of MTB mainly depend on the adaptive immune responses. Recent studies suggested that B lymphocyte-mediated humoral immune responses also play a role in MTB clearance (Gong et al., 2021, 2022). C-ImmSim server prediction showed that after the C543P candidate stimulation, B lymphocytes mainly performed an antigen-presenting function in the initial stage. The presenting-2 B lymphocytes population per state reached 450 cells/mm3 on day 2 after C543P stimulation. Then, the active B lymphocytes population per state proliferated in large numbers, reaching a peak on day 7 after C543P stimulation (Figure 10A). C543P also induced B lymphocytes to produce high levels of IgG and IgM antibodies, peaking on day 6 (Figure 10B). CD4+ T lymphocytes are the central effector cells that kill and clear MTB. The C-ImmSim server showed that C543P could stimulate the proliferation of memory and non-memory CD4+ T lymphocytes significantly, reaching a peak on day 10 (>3,500 cells/mm3) and then decreasing slowly (Figure 10C). The population per state of active and resting HTL peaked on day 7 and then gradually declined (Figure 10D). Regulatory T cells can regulate the immune response of helper T cells to avoid the overreaction of Th1 and Th2 cells. The results showed that the regulatory T cell population per state reached as high as 140 cells/mm3 on day 5 after C543P stimulation, followed by a decrease (Figure 10E). Furthermore, it is reported that CTL plays an essential role in killing the host cells infected by MTB via producing granular enzymes and nitric oxide. Herein, we found that the population per state of non-memory cytotoxic T cells gradually increased and peaked (1,150 cells/mm3) on day 13, and that of memory cytotoxic T cells remained at the level of 1,125 cells/mm3 (Figure 10F). Moreover, we also found that the population per state of active cytotoxic T lymphocytes gradually increased and reached a peak on the 30th day after stimulation. In contrast, resting cytotoxic T lymphocytes showed an opposite trend (Figure 10G). Interestingly, our results indicated that C543P could induce significantly high levels of IFN-γ and IL-2 (Figure 10H).

FIGURE 10
www.frontiersin.org

Figure 10. The adaptive immune responses induced by C543P in C-ImmSim server. (A) Active B cell (purple) secretion after antigen stimulation. (B) The primary B cell antibody produced after antigen stimulation is IgM + IgG (yellow) changes over time. (C,D) Changes in the secretion level of helper T lymphocytes and the secretion levels of helper T lymphocytes of different memory types. (E) Changes in Treg T cell secretion after antigen stimulation. (F,G) Changes in the level of CD8+ T secretion after antigen stimulation and the secretion of different types of CD8+ T cells. (H). Changes in secretion levels of cytokines, mainly IFN-γ (purple) and IL-2 (yellow).

Codon Optimization and in-silico Cloning

To predict the molecular cloning and expression levels of the C543P candidate in vitro, we used the Java Adaptation Tool (JACT) server to predict maximum protein expression in E. coli (K12). The data showed that the CAI value of the C543P candidate was 1, and the average GC content of the adapted sequence was 52.6%, indicating that the C543P candidate could be highly expressed in E. coli. Finally, using SnapGene software, the gene sequence of the optimized C543P candidate was inserted into the pET30a(+) plasmid to construct a recombinant plasmid (Supplementary Figure 1).

Discussion

Rapid advances in bioinformatics, structural biology, and computational tools have revolutionized the development of diagnostic biomarkers and vaccines. The use of these tools has greatly aided in the processing and analyzing basic data on microorganisms (Kardani et al., 2020). The methods for predicting and designing diagnostic biomarkers and vaccines in-silico have greatly improved, and they have been applied to bacteria, viruses, fungi, and even cancers (Raoufi et al., 2020). Previous studies have designed several peptides-based vaccine candidates against MTB infection (Bibi et al., 2021; Sharma et al., 2021; Shiraz et al., 2021). The bioinformatics technologies used in these studies promote the development of TB vaccines and reduce cost and save time. However, the epitopes of these TB vaccines were predicted and selected from antigens secreted by MTB during proliferation, which means that these TB vaccines may not fight against LTBI. Thus, we designed a PPM named C543P based on epitopes selected from five LTBI-RD-associated antigens, including Rv1737c, Rv1981c, Rv2659c, Rv2660c, and Rv3879c. The C543P candidate might be a promising candidate biomarker for the diagnosis and prevention of LTBI.

LTBI is a dynamic equilibrium state achieved by mutual antagonism between host immune function and invasiveness of MTB (Gong and Wu, 2021). The activation of CD4+ T cells and CD8+ T cells depends on recognizing the MHC-II and MHC-I molecules on the surface of antigen-presenting cells. Therefore, the C543P candidate has the following advantages: (1) As a preventive vaccine for LTBI, the C543P can activate CD4+ T cells and CD8+ T cells to produce high levels of cytokines such as IFN-γ and IL-2, enhancing the anti-MTB activity of macrophages (Bertholet et al., 1950), and inhibit the transformation of LTBI to ATB. (2) As a biomarker for the differential diagnosis of LTBI, C543P could stimulate the peripheral blood mononuclear cells (PBMCs) of the LTBI population to produce significantly high levels of IFN-γ in a short time, thereby separating the LTBI population from ATB patients.

The enzyme-linked immunosorbent spot (ELISpot) assay is a commonly used method to detect human-specific T cells, accurately assessing the number of T cells with IFN-γ releasing ability. Therefore, the C543P was constructed based on immunodominant epitopes inducing a high level of IFN-γ in individuals with LTBI. However, differences in racial genetic background constitute a significant obstacle to developing epitope-based diagnostic markers and vaccines. HLA is highly polymorphic, thus, selecting immunodominant epitopes that can be recognized by numerous HLAs is beneficial to improving the coverage of the C543P candidate and avoiding racial bias (Bui et al., 2006). Therefore, when predicting HTL and CTL epitopes, we choose the default HLA-I and HLA-II alleles most frequently distributed in the population from the IEDB database. On this basis, the immunodominant epitopes with better antigenicity and immunogenicity were selected. In addition, the addition of B cell epitopes allows C543P to activate B lymphocytes more efficiently to induce humoral immunity.

The C543P candidate also contains several vital adjuvants and TLR agonists to enhance its ability to induce immune responses. These TLR agonists used in C543P can be recognized by cognate TLRs on different immune cells and polarize naive CD4+ T cells through the Th1 pathway to induce an immune response (Kawai and Akira, 2010). Our prediction data showed that the antigenicity of C543P was 0.7501, and the immunogenicity was 1.36469. These data indicate that the C543P has a good antigenicity and can induce a high level of immune response. Furthermore, we found that C543P was a non-toxic, non-allergenic, and stable candidate for LTBI diagnosis and prevention. The limitation of the C543P is that the solubility of C543P was 0.437, so it is necessary to improve its solubility in the subsequent optimization for better cloning and expression in E. coli.

The secondary structure of C543P mainly contains 43.6% α-helix, 15.8% β-strand, and 40.6% random coil. Naturally unfolded protein regions and α-helices are considered essential types of “structural antigens.” Thus, the increase of these two structures contributes to the recognition of antibodies induced after infection (Corradin et al., 2007). It was found that 88.77% of the amino acid residues of C543P were in the Favored region, indicating that the entire model’s prediction quality was acceptable. The main targets of host recognition of MTB are TLR2 and TLR4, so adding TLR2 and TLR4 agonists as adjuvants in the design of C543P can significantly improve the presentation efficiency of C543P. We tested the binding ability of C543P to TLR2 and TLR4, and the results showed that the C543P has a high binding affinity, suggesting that the C543P can be efficiently recognized by TLR2 and TLR4 and induce a more robust immune response.

Whether as a biomarker for differential diagnosis of LTBI or as a vaccine for LTBI prevention, the ability of C543P to induce immune responses in T and B lymphocytes determines its potential value. Previous studies have shown that macrophages clear MTB through oxygen, chloride, cytokine production, phagosome acidification, and intracellular MTB autophagy (Liu et al., 2017). In addition, some macrophages phagocytose MTB present antigens or epitopes to T cells for recognition and activate adaptive immune responses (Gong et al., 2022). Our immune prediction found that the C543P could induce effective innate and adaptive immune responses in humans. As the first line of defense against MTB infection, alveolar epithelial cells can use MHC-II molecules to influence the lungs’ immune system. Furthermore, the high level of alveolar epithelial cells can promote the secretion of cytokines by HTL to kill MTB (Shenoy, 2021). Our study found that macrophages play an essential role in C543P antigen presentation. The population per state of macrophages exceeded 100 cells/mm3 on the second day after C543P stimulation. The Th1 subset of CD4+ T lymphocytes plays a central role in the host killing of MTB infection, mainly secreting IFN-γ, IL-2, IL-12, and TNF-α (Boom et al., 2003). The results of C-ImmSim server prediction found that the C543P induced T lymphocytes to secrete high levels of Th1-type cytokines such as IFN-γ and IL-2, indicating that the C543P mainly induced the secretion of Th1-type cytokines. Apart from this, the C543P can also effectively activate CD8+ T lymphocytes, and the population per state of CD8+ T cells reached 1155 cells/mm3 on the 13th day after stimulation. These massively proliferated and activated CD8+ T cells can effectively lyse and kill cells infected with MTB by secreting a large amount of granzyme B, perforin, and other cytotoxic factors (Kumar and Babu, 2017).

Conclusion

In summary, a novel diagnostic biomarker or vaccine candidate named C543P was designed in this study. The C543P candidate consists of four HTL, five CTL, and three B cell epitopes as well as adjuvant or agonists. Our results showed that C543P was highly antigenic and immunogenic while non-sensitizing and non-toxic. Furthermore, it was found that the C543P candidate had a good affinity with both TLR2 and TLR4, and the kinetic simulation showed that C543P had good stability and could promote its interaction with TLRs. In addition, the C543P candidate can induce high levels of innate and adaptive immune responses in humans, characterized by markedly higher levels of the Th1-type cytokines such as IFN-γ and IL-2. This study provides a promising diagnostic biomarker and a vaccine candidate for diagnosing and preventing LTBI.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.

Author Contributions

WG and LW: conceptualization and writing—review and editing. PC: data curation, formal analysis and writing—original draft. WG: funding acquisition. PC and WG: methodology and software. All authors contributed to the article and approved the submitted version.

Funding

This study was funded by the Beijing Municipal Science & Technology Commission (Grant No. 19L2065, 7212103, and Z181100001718005) and the Chinese PLA General Hospital (Grant No. QNC19047).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2022.947852/full#supplementary-material

Supplementary Figure 1 | The polypeptide molecular sequence was electronically cloned into the pET30a expression vector by SnapGene software. The diagnostic antigen is the gene coding for the insertion of antigen molecules, and the rest is the expression vector.

Supplementary Table 1 | The detailed information of the whole HTL epitopes predicted by IEDB database. The potential HTL epitopes were identified by adjusted rank, antigenicity score, and IFN score, and marked with yellow color.

Supplementary Table 2 | The detailed information of the whole CTL epitopes predicted by IEDB database.

Supplementary Table 3 | The potential CTL epitopes identified by immunogenicity and antigenicity.

Supplementary Table 4 | The detailed information of linear B-cell epitopes predicted by the ABCpred. The epitopes selected for further study were marked with yellow color.

Supplementary Table 5 | The detailed information of conformational B cell epitopes residues of C543P predicted by the ElliPro.

Abbreviations

ACC, auto-cross-covariance; ATB, active tuberculosis; ANN, Artificial Neural Network; BCG, Bacillus Calmette–Guérin; CTL, cytotoxic T lymphocytes; CFP10, culture filtrate protein 10; CAI, codon adaptation index; DCs, Dendritic cells; E. coli, Escherichia coli; EAST-6, early secreted antigenic target 6; ELISpot, enzyme-linked immunosorbent spot; ELISA, enzyme-linked immunosorbent assay; GRAVY, grand average of hydrophilicity; HIV, human immunodeficiency virus; HLA, Human leukocyte antigen; HTL, helper T lymphocytes; IFN-γ, interferon gamma; IGRA, interferon gamma release assays; IEDB, Immune Epitope Database; JACT, Java Adaptation Tool; kNN, k nearest neighbors; LTBI, latent tuberculosis infection; MTB, Mycobacterium tuberculosis; MDR-TB, multidrug-resistant TB; MHC, major histocompatibility complex; NIAID, National Institute of Allergy and Infectious Diseases; NMA, normal mode analysis; PopAvrSol, population average for the experimental dataset; PBMCs, peripheral blood mononuclear cells; PPD, purified protein derivative; QFT-Plus, QuantiFERON-TB Gold-Plus; QFT-GIT, QuantiFERON-TB Gold In-Tube; RR-TB, rifampicin-resistant TB; RMSD, root mean square deviation; RD1, Region of Difference 1; T-SPOT.TB, T-cell spot of TB assay; TST, tuberculin skin test; TB, Tuberculosis; TLR, toll-like receptor; VdW, van der Waals energy; WHO, World Health Organization.

Footnotes

  1. ^ https://mycobrowser.epfl.ch/
  2. ^ http://tools.iedb.org/mhcii/
  3. ^ http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html
  4. ^ http://crdd.osdd.net/raghava/ifnepitope/index.php
  5. ^ http://tools.iedb.org/mhci/
  6. ^ http://tools.iedb.org/immunogenicity/
  7. ^ https://webs.iiitd.edu.in/raghava/abcpred/ABC_submission.html
  8. ^ http://scratch.proteomics.ics.uci.edu/
  9. ^ http://www.ddg-pharmfac.net/AllerTOP/
  10. ^ http://ddg-pharmfac.net/AllergenFP/
  11. ^ https://web.expasy.org/protparam/
  12. ^ https://protein-sol.manchester.ac.uk/
  13. ^ http://bioinf.cs.ucl.ac.uk/psipred/
  14. ^ https://npsa-prabi.ibcp.fr/cgi-bin/npsa_automat.pl?page=/NPSA/npsa_gor4.html
  15. ^ https://zhanggroup.org//I-TASSER/
  16. ^ https://galaxy.seoklab.org/cgi-bin/submit.cgi?type=REFINE
  17. ^ https://prosa.services.came.sbg.ac.at/prosa.php
  18. ^ https://saves.mbi.ucla.edu/
  19. ^ https://swissmodel.expasy.org/assess
  20. ^ http://tools.iedb.org/ellipro/
  21. ^ https://www.ncbi.nlm.nih.gov/structure/
  22. ^ https://cluspro.bu.edu/home.php
  23. ^ https://bioinfo3d.cs.tau.ac.il/PatchDock/php.php
  24. ^ http://bioinfo3d.cs.tau.ac.il/FireDock/
  25. ^ http://cadd.zju.edu.cn/hawkdock/
  26. ^ https://imods.iqfr.csic.es/
  27. ^ https://150.146.2.1/C-IMMSIM/index.php
  28. ^ http://www.jcat.de/Result.jsp

References

Andrusier, N., Nussinov, R., and Wolfson, H. J. (2007). FireDock: fast interaction refinement in molecular docking. Proteins 69, 139–159.

Google Scholar

Arroyo, L., Marín, D., Franken, K., Ottenhoff, T. H. M., and Barrera, L. F. (2018). Potential of DosR and Rpf antigens from Mycobacterium tuberculosis to discriminate between latent and active tuberculosis in a tuberculosis endemic population of Medellin Colombia. BMC Infect. Dis. 18:26. doi: 10.1186/s12879-017-2929-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Arroyo, L., Rojas, M., Franken, K. L., Ottenhoff, T. H., and Barrera, L. F. (2016). Multifunctional T Cell Response to DosR and Rpf Antigens Is Associated with Protection in Long-Term Mycobacterium tuberculosis-Infected Individuals in Colombia. Clin. Vaccine Immunol. 23, 813–824. doi: 10.1128/CVI.00217-16

PubMed Abstract | CrossRef Full Text | Google Scholar

Bai, X. (2015). Preparation of Four Tuberculosis Latent Proteins and the Evaluation of Their Immunological Characteristic. Beijing: Medical School of Chinese PLA.

Google Scholar

Bai, X., Liang, Y., Yang, Y., Zhang, J., and Wu, X. (2017). A new method of screening for latent tuberculosis infection: results from army recruits in Beijing in 2014. Immunol. Lett. 186, 28–32. doi: 10.1016/j.imlet.2017.03.014

PubMed Abstract | CrossRef Full Text | Google Scholar

Bai, X. J., Liang, Y., Yang, Y. R., Li, N., Zhang, X. Y., An, H. R., et al. (2015). Immune responses to latent tuberculosis antigen Rv2659c in Chinese populations. J. Microbiol. Immunol. Infect. 48, 381–389. doi: 10.1016/j.jmii.2014.02.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Bertholet, S., Ireton, G. C., Kahn, M., Guderian, J., Mohamath, R., Stride, N., et al. (1950). Identification of human T cell antigens for the development of vaccines against Mycobacterium tuberculosis. J. Immunol. 2008, 7948–7957. doi: 10.4049/jimmunol.181.11.7948

PubMed Abstract | CrossRef Full Text | Google Scholar

Bibi, S., Ullah, I., Zhu, B., Adnan, M., Liaqat, R., Kong, W. B., et al. (2021). In silico analysis of epitope-based vaccine candidate against tuberculosis using reverse vaccinology. Sci. Rep. 11:1249. doi: 10.1038/s41598-020-80899-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Boom, W. H., Canaday, D. H., Fulton, S. A., Gehring, A. J., Rojas, R. E., and Torres, M. (2003). Human immunity to M. tuberculosis: t cell subsets and antigen processing. Tuberculosis 83, 98–106. doi: 10.1016/s1472-9792(02)00054-9

CrossRef Full Text | Google Scholar

Bui, H. H., Sidney, J., Dinh, K., Southwood, S., Newman, M. J., and Sette, A. (2006). Predicting population coverage of T-cell epitope-based diagnostics and vaccines. BMC Bioinform. 7:153. doi: 10.1186/1471-2105-7-153

PubMed Abstract | CrossRef Full Text | Google Scholar

Carranza, C., Pedraza-Sanchez, S., de Oyarzabal-Mendez, E., and Torres, M. (2020). Diagnosis for Latent Tuberculosis Infection: new Alternatives. Front. Immunol. 11:2006. doi: 10.3389/fimmu.2020.02006

PubMed Abstract | CrossRef Full Text | Google Scholar

Chegou, N. N., Essone, P. N., Loxton, A. G., Stanley, K., Black, G. F., van der Spuy, G. D., et al. (2012). Potential of host markers produced by infection phase-dependent antigen-stimulated cells for the diagnosis of tuberculosis in a highly endemic area. PLoS One 7:e38501. doi: 10.1371/journal.pone.0038501

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, J., Su, X., Zhang, Y., Wang, S., Shao, L., Wu, J., et al. (2009). Novel recombinant RD2- and RD11-encoded Mycobacterium tuberculosis antigens are potential candidates for diagnosis of tuberculosis infections in BCG-vaccinated individuals. Microbes infect. 11, 876–885. doi: 10.1016/j.micinf.2009.05.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Colovos, C., and Yeates, T. O. (1993). Verification of protein structures: patterns of nonbonded atomic interactions. Protein Sci. 2, 1511–1519. doi: 10.1002/pro.5560020916

PubMed Abstract | CrossRef Full Text | Google Scholar

Corradin, G., Villard, V., and Kajava, A. V. (2007). Protein structure based strategies for antigen discovery and vaccine development against malaria and other pathogens. Endocr. Metab. Immune. Disord. Drug Targets 7, 259–265. doi: 10.2174/187153007782794371

PubMed Abstract | CrossRef Full Text | Google Scholar

Dhanda, S. K., Vir, P., and Raghava, G. P. (2013). Designing of interferon-gamma inducing MHC class-II binders. Biol. Direct. 8:30. doi: 10.1186/1745-6150-8-30

PubMed Abstract | CrossRef Full Text | Google Scholar

Dimitrov, I., Bangov, I., Flower, D. R., and Doytchinova, I. (2014a). AllerTOP v.2–a server for in silico prediction of allergens. J. Mol. Model. 20:2278. doi: 10.1007/s00894-014-2278-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Dimitrov, I., Naneva, L., Doytchinova, I., and Bangov, I. (2014b). AllergenFP: allergenicity prediction by descriptor fingerprints. Bioinformatics 30, 846–851. doi: 10.1093/bioinformatics/btt619

PubMed Abstract | CrossRef Full Text | Google Scholar

Doytchinova, I. A., and Flower, D. R. (2007). VaxiJen: a server for prediction of protective antigens, tumour antigens and subunit vaccines. BMC Bioinform. 8:4. doi: 10.1186/1471-2105-8-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Garnier, J., Gibrat, J. F., and Robson, B. (1996). GOR method for predicting protein secondary structure from amino acid sequence. Meth. Enzymol. 266, 540–553. doi: 10.1016/s0076-6879(96)66034-0

CrossRef Full Text | Google Scholar

Gasteiger, E. (2005). The Proteomics Protocols Handbook. Berlin: Springer, 571–607.

Google Scholar

Gong, W., Liang, Y., Mi, J., Jia, Z., Xue, Y., Wang, J., et al. (2021). Peptides-Based Vaccine MP3RT Induced Protective Immunity Against Mycobacterium Tuberculosis Infection in a Humanized Mouse Model. Front. Immunol. 12:666290. doi: 10.3389/fimmu.2021.666290

PubMed Abstract | CrossRef Full Text | Google Scholar

Gong, W., Liang, Y., and Wu, X. (2018). The current status, challenges, and future developments of new tuberculosis vaccines. Hum. Vaccin Immunother. 14, 1697–1716. doi: 10.1080/21645515.2018.1458806

PubMed Abstract | CrossRef Full Text | Google Scholar

Gong, W., Pan, C., Cheng, P., Wang, J., Zhao, G., and Wu, X. (2022). Peptide-Based Vaccines for Tuberculosis. Front. Immunol. 13:830497. doi: 10.3389/fimmu.2022.830497

PubMed Abstract | CrossRef Full Text | Google Scholar

Gong, W., and Wu, X. (2021). Differential Diagnosis of Latent Tuberculosis Infection and Active Tuberculosis: a Key to a Successful Tuberculosis Control Strategy. Front. Microbiol. 12:745592. doi: 10.3389/fmicb.2021.745592

PubMed Abstract | CrossRef Full Text | Google Scholar

Grote, A., Hiller, K., Scheer, M., Münch, R., Nörtemann, B., Hempel, D. C., et al. (2005). JCat: a novel tool to adapt codon usage of a target gene to its potential expression host. Nucleic Acids Res. 33, W526–W531. doi: 10.1093/nar/gki376

PubMed Abstract | CrossRef Full Text | Google Scholar

He, H., Yang, H., and Deng, Y. (2015). Mycobacterium tuberculosis dormancy-associated antigen of Rv2660c induces stronger immune response in latent Mycobacterium tuberculosis infection than that in active tuberculosis in a Chinese population. Eur. J. Clin. Microbiol. Infect. Dis. 34, 1103–1109. doi: 10.1007/s10096-015-2335-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Hebditch, M., Carballo-Amador, M. A., Charonis, S., Curtis, R., and Warwicker, J. (2017). Protein-Sol: a web tool for predicting protein solubility from sequence. Bioinformatics 33, 3098–3100. doi: 10.1093/bioinformatics/btx345

PubMed Abstract | CrossRef Full Text | Google Scholar

Heo, L., Park, H., and Seok, C. (2013). GalaxyRefine: protein structure refinement driven by side-chain repacking. Nucleic Acids Res. 41, W384–W388. doi: 10.1093/nar/gkt458

PubMed Abstract | CrossRef Full Text | Google Scholar

Hinks, T. S., Dosanjh, D. P., Innes, J. A., Pasvol, G., Hackforth, S., Varia, H., et al. (2009). Frequencies of region of difference 1 antigen-specific but not purified protein derivative-specific gamma interferon-secreting T cells correlate with the presence of tuberculosis disease but do not distinguish recent from remote latent infections. Infect. Immun. 77, 5486–5495. doi: 10.1128/IAI.01436-08

PubMed Abstract | CrossRef Full Text | Google Scholar

Houben, R. M., and Dodd, P. J. (2016). The Global Burden of Latent Tuberculosis Infection: a Re-estimation Using Mathematical Modelling. PLoS Med. 13:e1002152. doi: 10.1371/journal.pmed.1002152

PubMed Abstract | CrossRef Full Text | Google Scholar

Ikai, A. (1980). Thermostability and aliphatic index of globular proteins. J. Biochem. 88, 1895–1898.

Google Scholar

Jackson, D. C., Lau, Y. F., Le, T., Suhrbier, A., Deliyannis, G., Cheers, C., et al. (2004). A totally synthetic vaccine of generic structure that targets Toll-like receptor 2 on dendritic cells and promotes antibody or cytotoxic T cell responses. Proc. Natl. Acad. Sci. U.S.A. 101, 15440–15445. doi: 10.1073/pnas.0406740101

PubMed Abstract | CrossRef Full Text | Google Scholar

Kapopoulou, A., Lew, J. M., and Cole, S. T. (2011). The MycoBrowser portal: a comprehensive and manually annotated resource for mycobacterial genomes. Tuberculosis 91, 8–13. doi: 10.1016/j.tube.2010.09.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Kardani, K., Bolhassani, A., and Namvar, A. (2020). An overview of in silico vaccine design against different pathogens and cancer. Expert Rev. Vaccines. 19, 699–726. doi: 10.1080/14760584.2020.1794832

PubMed Abstract | CrossRef Full Text | Google Scholar

Kawai, T., and Akira, S. (2010). The role of pattern-recognition receptors in innate immunity: update on Toll-like receptors. Nat. Immunol. 11, 373–384. doi: 10.1038/ni.1863

PubMed Abstract | CrossRef Full Text | Google Scholar

Kim, Y., Ponomarenko, J., Zhu, Z., Tamang, D., Wang, P., Greenbaum, J., et al. (2012). Immune epitope database analysis resource. Nucleic Acids Res. 40, W525–W530.

Google Scholar

Kozakov, D., Hall, D. R., Xia, B., Porter, K. A., Padhorny, D., Yueh, C., et al. (2017). The ClusPro web server for protein-protein docking. Nat. Protoc. 12, 255–278. doi: 10.1038/nprot.2016.169

PubMed Abstract | CrossRef Full Text | Google Scholar

Kumar, N. P., and Babu, S. (2017). Influence of diabetes mellitus on immunity to human tuberculosis. Immunology 152, 13–24. doi: 10.1111/imm.12762

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, C. H., Liu, H., and Ge, B. (2017). Innate immunity in tuberculosis: host defense vs pathogen evasion. Cell. Mol. Immunol. 14, 963–975. doi: 10.1038/cmi.2017.88

PubMed Abstract | CrossRef Full Text | Google Scholar

Lopéz-Blanco, J. R., Garzón, J. I., and Chacón, P. (2011). iMod: multipurpose normal mode analysis in internal coordinates. Bioinformatics 27, 2843–2850. doi: 10.1093/bioinformatics/btr497

PubMed Abstract | CrossRef Full Text | Google Scholar

Luabeya, A. K., Kagina, B. M., Tameris, M. D., Geldenhuys, H., Hoff, S. T., Shi, Z., et al. (2015). First-in-human trial of the post-exposure tuberculosis vaccine H56:IC31 in Mycobacterium tuberculosis infected and non-infected healthy adults. Vaccine 33, 4130–4140. doi: 10.1016/j.vaccine.2015.06.051

PubMed Abstract | CrossRef Full Text | Google Scholar

Magnan, C. N., Zeller, M., Kayala, M. A., Vigil, A., Randall, A., Felgner, P. L., et al. (2010). High-throughput prediction of protein antigenicity using protein microarray data. Bioinformatics 26, 2936–2943. doi: 10.1093/bioinformatics/btq551

PubMed Abstract | CrossRef Full Text | Google Scholar

Mashiach, E., Schneidman-Duhovny, D., Andrusier, N., Nussinov, R., and Wolfson, H. J. (2008). FireDock: a web server for fast interaction refinement in molecular docking. Nucleic Acids Res. 36, W229–W232. doi: 10.1093/nar/gkn186

PubMed Abstract | CrossRef Full Text | Google Scholar

McGuffin, L. J., Bryson, K., and Jones, D. T. (2000). The PSIPRED protein structure prediction server. Bioinformatics 16, 404–405.

Google Scholar

Meza, B., Ascencio, F., Sierra-Beltrán, A. P., Torres, J., and Angulo, C. (2017). A novel design of a multi-antigenic, multistage and multi-epitope vaccine against Helicobacter pylori: an in silico approach. Infect. Genet. Evol. 49, 309–317. doi: 10.1016/j.meegid.2017.02.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Mosavari, N., Karimi, A., Tadayon, K., Shahhosseini, G., Zavaran Hosseini, A., and Babaie, M. (2021). Evaluation of Heating and Irradiation Methods for Production of Purified Protein Derivative (PPD) of Mycobacterium Tuberculosis. Arch. Razi Inst. 75, 439–449. doi: 10.22092/ari.2019.123082.1238

PubMed Abstract | CrossRef Full Text | Google Scholar

Ponomarenko, J., Bui, H. H., Li, W., Fusseder, N., Bourne, P. E., Sette, A., et al. (2008). ElliPro: a new structure-based tool for the prediction of antibody epitopes. BMC Bioinform. 9:514. doi: 10.1186/1471-2105-9-514

PubMed Abstract | CrossRef Full Text | Google Scholar

Raoufi, E., Hemmati, M., Eftekhari, S., Khaksaran, K., Mahmodi, Z., Farajollahi, M. M., et al. (2020). Epitope Prediction by Novel Immunoinformatics Approach: a State-of-the-art Review. Int. J. Pept. Res. Ther. 26, 1155–1163. doi: 10.1007/s10989-019-09918-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Rapin, N., Lund, O., Bernaschi, M., and Castiglione, F. (2010). Computational immunology meets bioinformatics: the use of prediction tools for molecular binding in the simulation of the immune system. PLoS One 5:e9862. doi: 10.1371/journal.pone.0009862

PubMed Abstract | CrossRef Full Text | Google Scholar

Roy, A., Kucukural, A., and Zhang, Y. (2010). I-TASSER: a unified platform for automated protein structure and function prediction. Nat. Protoc. 5, 725–738. doi: 10.1038/nprot.2010.5

PubMed Abstract | CrossRef Full Text | Google Scholar

Saha, S., and Raghava, G. P. (2006). Prediction of continuous B-cell epitopes in an antigen using recurrent neural network. Proteins 65, 40–48. doi: 10.1002/prot.21078

PubMed Abstract | CrossRef Full Text | Google Scholar

Schneidman-Duhovny, D., Inbar, Y., Nussinov, R., and Wolfson, H. J. (2005). PatchDock and SymmDock: servers for rigid and symmetric docking. Nucleic Acids Res. 33, W363–W367. doi: 10.1093/nar/gki481

PubMed Abstract | CrossRef Full Text | Google Scholar

Sharma, R., Rajput, V. S., Jamal, S., Grover, A., and Grover, S. (2021). An immunoinformatics approach to design a multi-epitope vaccine against Mycobacterium tuberculosis exploiting secreted exosome proteins. Sci. Rep. 11:13836.

Google Scholar

Shenoy, A. T. (2021). Lyon De Ana C, Arafa EI, Salwig I, Barker KA, Korkmaz FT, et al. Antigen presentation by lung epithelial cells directs CD4(+) T(RM) cell function and regulates barrier immunity. Nat. Commun. 12:5834. doi: 10.1038/s41467-021-26045-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Shiraz, M., Lata, S., Kumar, P., Shankar, U. N., and Akif, M. (2021). Immunoinformatics analysis of antigenic epitopes and designing of a multi-epitope peptide vaccine from putative nitro-reductases of Mycobacterium tuberculosis DosR. Infect Genet. Evol. 94:105017. doi: 10.1016/j.meegid.2021.105017

PubMed Abstract | CrossRef Full Text | Google Scholar

Tang, X. L., Zhou, Y. X., Wu, S. M., Pan, Q., Xia, B., and Zhang, X. L. (2014). CFP10 and ESAT6 aptamers as effective Mycobacterial antigen diagnostic reagents. J. Infect. 69, 569–580. doi: 10.1016/j.jinf.2014.05.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Waterhouse, A., Bertoni, M., Bienert, S., Studer, G., Tauriello, G., Gumienny, R., et al. (2018). SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 46, W296–W303. doi: 10.1093/nar/gky427

PubMed Abstract | CrossRef Full Text | Google Scholar

Weng, G., Wang, E., Wang, Z., Liu, H., Zhu, F., Li, D., et al. (2019). HawkDock: a web server to predict and analyze the protein-protein complex based on computational docking and MM/GBSA. Nucleic Acids Res. 47, W322–W330. doi: 10.1093/nar/gkz397

PubMed Abstract | CrossRef Full Text | Google Scholar

WHO (2021). Global Tuberculosis Report 2021. Genevapp: World Health Organization.

Google Scholar

Wiederstein, M., and Sippl, M. J. (2007). ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins. Nucleic Acids Res. 35, W407–W410. doi: 10.1093/nar/gkm290

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, J., Yan, R., Roy, A., Xu, D., Poisson, J., and Zhang, Y. (2015). The I-TASSER Suite: protein structure and function prediction. Nat. Methods 12, 7–8.

Google Scholar

Zellweger, J. P., Sotgiu, G., Corradi, M., and Durando, P. (2020). The diagnosis of latent tuberculosis infection (LTBI): currently available tests, future developments, and perspectives to eliminate tuberculosis (TB). Med. Lav. 111, 170–183. doi: 10.23749/mdl.v111i3.9983

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Q., Wang, P., Kim, Y., Haste-Andersen, P., Beaver, J., Bourne, P. E., et al. (2008). Immune epitope database analysis resource (IEDB-AR). Nucleic Acids Res. 36, W513–W518.

Google Scholar

Keywords: tuberculosis (TB), latent tuberculosis infection (LTBI), biomarker, vaccine, diagnosis

Citation: Cheng P, Wang L and Gong W (2022) In silico Analysis of Peptide-Based Biomarkers for the Diagnosis and Prevention of Latent Tuberculosis Infection. Front. Microbiol. 13:947852. doi: 10.3389/fmicb.2022.947852

Received: 19 May 2022; Accepted: 08 June 2022;
Published: 28 June 2022.

Edited by:

Wei Wang, Jiangsu Institute of Parasitic Diseases (JIPD), China

Reviewed by:

Chuan Wang, Sichuan University, China
Mohammad Tahir Waheed, Quaid-i-Azam University, Pakistan

Copyright © 2022 Cheng, Wang and Gong. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Wenping Gong, Z3dwODkxMDE1QHdodS5lZHUuY24=; Liang Wang, V2FuZ2wzMDlAc2luYS5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.