Discovering Panel of Autoantibodies for Early Detection of Lung Cancer Based on Focused Protein Array

Jiang, Di; Zhang, Xue; Liu, Man; Wang, Yulin; Wang, Tingting; Pei, Lu; Wang, Peng; Ye, Hua; Shi, Jianxiang; Song, Chunhua; Wang, Kaijuan; Wang, Xiao; Dai, Liping; Zhang, Jianying

doi:10.3389/fimmu.2021.658922

ORIGINAL RESEARCH article

Front. Immunol., 23 April 2021

Sec. Cancer Immunity and Immunotherapy

Volume 12 - 2021 | https://doi.org/10.3389/fimmu.2021.658922

This article is part of the Research TopicTumor-Associated Antigens and Their Autoantibodies: From Discovering to Clinical UtilizationView all 19 articles

Discovering Panel of Autoantibodies for Early Detection of Lung Cancer Based on Focused Protein Array

Di Jiang^1,2,3^†

Xue Zhang^1,2,3^†

Man Liu^1,2,3

Yulin Wang^1,2,3

Tingting Wang⁴

Lu Pei⁵

Peng Wang^3,6

Hua Ye^3,6

Jianxiang Shi^1,3

Chunhua Song^3,6

Kaijuan Wang^3,6

Xiao Wang^1,3

Liping Dai^1,2,3^*

Jianying Zhang^1,3^*

¹Department of Oncology, Henan Institute of Medical and Pharmaceutical Sciences, Zhengzhou University, Zhengzhou, China
²School of Basic Medical Sciences, Academy of Medical Science, Zhengzhou University, Zhengzhou, China
³Henan Key Laboratory of Tumor Epidemiology & State Key Laboratory of Esophageal Cancer Prevention, Zhengzhou University, Zhengzhou, China
⁴Department of Clinical Laboratory, Fuwai Central China Cardiovascular Hospital, Zhengzhou, China
⁵Department of Clinical Laboratory, Zhengzhou Hospital of Traditional Chinese Medicine, Zhengzhou, China
⁶Department of Epidemiology and Biostatistics in School of Public Health, Zhengzhou University, Zhengzhou, China

Substantial studies indicate that autoantibodies to tumor-associated antigens (TAAbs) arise in early stage of lung cancer (LC). However, since single TAAbs as non-invasive biomarkers reveal low diagnostic performances, a panel approach is needed to provide more clues for early detection of LC. In the present research, potential TAAbs were screened in 150 serum samples by focused protein array based on 154 proteins encoded by cancer driver genes. Indirect enzyme-linked immunosorbent assay (ELISA) was used to verify and validate TAAbs in two independent datasets with 1,054 participants (310 in verification cohort, 744 in validation cohort). In both verification and validation cohorts, eight TAAbs were higher in serum of LC patients compared with normal controls. Moreover, diagnostic models were built and evaluated in the training set and the test set of validation cohort by six data mining methods. In contrast to the other five models, the decision tree (DT) model containing seven TAAbs (TP53, NPM1, FGFR2, PIK3CA, GNA11, HIST1H3B, and TSC1), built in the training set, yielded the highest diagnostic value with the area under the receiver operating characteristic curve (AUC) of 0.897, the sensitivity of 94.4% and the specificity of 84.9%. The model was further assessed in the test set and exhibited an AUC of 0.838 with the sensitivity of 89.4% and the specificity of 78.2%. Interestingly, the accuracies of this model in both early and advanced stage were close to 90%, much more effective than that of single TAAbs. Protein array based on cancer driver genes is effective in screening and discovering potential TAAbs of LC. The TAAbs panel with TP53, NPM1, FGFR2, PIK3CA, GNA11, HIST1H3B, and TSC1 is excellent in early detection of LC, and they might be new target in LC immunotherapy.

Introduction

Lung cancer (LC) is one of the leading causes of cancer-related deaths worldwide, accounting for 28% of all cancer deaths (1, 2). In China, LC is the first common cause of cancer-related death in men and the second cause in women (3). Due to the lack of effective early diagnosis technology for LC, it remains a challenge to improve the overall survival of patients with LC (4, 5). In the past 50 years, the 5-year survival rate of LC patients at early stage is 60–70%, while it is dreadfully < 5% at late stage (3). Therefore, early diagnosis is a critical factor to reduce the mortality and improve the long-term survival rate of LC patients (6, 7). Low-dose computed tomography (LDCT) emerged as a novel screening method for LC in 1990's, it was reported with 20% reduction of LC-related death in National Lung Cancer Screening Trial (NLST) by LDCT (8). Nevertheless, LDCT has up to 90% false-positive rate, thus it is necessary to confirm the diagnosis by additional invasive surgery or repeated radiation exposure (9), which bring unnecessary burden to the patient's economy and body.

Blood tumor biomarkers are potential for early diagnosis of LC as they have advantages of non-invasion and convenient to access (10, 11). However, multiple tumor biomarkers utilized in clinical practice show low diagnostic accuracy for cancer, such as carcinoembryonic antigen (CEA), neuron-specific enolase (NSE), and cytokeratin-19 fragment (CYFRA 21-1) (12–14). Tumor-associated antigens (TAAs) refer to antigen molecules that exist on tumor cells or normal cells, but they are abnormally expressed in diverse cancers (15). Autoantibodies to TAAs (TAAbs) are produced in early stage of cancers by humoral immune response triggered by abnormal expression of TAAs. In comparison with other types of biomarkers, serum TAAbs appeared earlier and more stable (16). They are a kind of promising biomarkers which could be applied for early diagnosis in cancers (17).

Recently, the protein array technology was commonly applied in identifying new TAAbs, which can simultaneously analyze large number of proteins in parallel and recognize posttranslational modified proteins (18, 19). The mutation of cancer driver genes may be one of the important factors for the occurrence of cancers (20). Based on the 138 cancer driver genes (74 tumor suppressor genes and 64 oncogenes) listed in study of Vogelstein et al. (21), we customized a protein array with 154 human recombinant proteins to explore the autoantibodies against TAAs in LC. The selected TAAbs were further validated by enzyme-linked immunosorbent assay (ELISA). Since single TAAb was limited by low sensitivity and accuracy and combined multiple TAAbs could improve the detection rate of LC effectively (22–24), a series of data mining techniques were performed to establish diagnostic models for LC, such as logistic regression, Fisher discriminate analysis, decision tree (DT), support vector machines (SVM), artificial neural network-multilayer perception (ANN-MLP), and artificial neural network-radial basis function (ANN-RBF). Finally, we evaluated the diagnostic efficacy of these models and chose DT model as the optimal model.

Materials and Methods

Study Populations

In this study, totally 1,204 subjects [555 LCs, 505 normal controls (NCs), and 144 benign lung disease cases (BLDs)] in three independent cohorts (discovery cohort, verification cohort, and validation cohort) were recruited from the First Affiliated Hospital of Zhengzhou University in Henan province, China between November 2016 and April 2019 (Table 1). All specimens were collected with patients' written informed consent, and the study protocol was approved by Medical Ethics Committee of Zhengzhou University (Zhengzhou, China). The process of serum specimen preparation and the inclusion criteria of subjects were presented in Supplementary Texts 1,2, respectively.

TABLE 1

Table 1. Characteristics of populations in this study.

Focused Protein Array

A total of 154 human source recombinant proteins, including 143 proteins encoded by cancer driver genes and 11 proteins (CyclinB1, c-Myc, CIP2A/p90, IMP1, IMP2, IMP3, RalA, RBM39, YWHAZ, and two fragments of Survivin) previously researched in our laboratory, were contained in the focused protein array. The array was customized in CDI Laboratories (Mayaguez, USA). The array screening, data extraction, and analysis were implemented according to the protocol illustrated in Supplementary Text 3. Signal-to-noise ratio (SNR) was used to describe the serum level of autoantibodies in the subjects of discovery cohort. Based on the results of array test, we carried out comprehensive analyses to screen candidate TAAbs for LC (Supplementary Figure 1).

ELISA

Indirect ELISA was used to detect the level of candidate TAAbs in serum samples of verification cohort and validation cohort. Detailed steps of the indirect ELISA experiment are presented in Supplementary Text 4. In this study, the verification cohort was used to test the eligibility of candidate TAAbs, and validation cohort to further validate the diagnostic performance of TAAbs. The positive and negative control sera of the TAAb were set in each plate for quality control. Furthermore, the concentration of autoantibodies in the serum was calculated according to the IgG standard curve of each plate.

The Establishment of Diagnostic Model by Data Mining Methods

All diagnostic models were established by using SPSS Modeler 18.0 software. In order to establish and externally evaluate the diagnostic models, all LCs and NCs in the validation cohort were randomly divided into training (N = 414) and test (N = 186) sets according to the proportion of 7:3 by SPSS 21.0 software. Logistic regression analysis, Fisher discriminant analysis, DT C5.0, SVM, ANN-MLP, and ANN-RBF were applied to build models based on training set and then the models' performance were validated in test set. Additionally, Logistic regression models were established through forward and backward conditional logistic regression, respectively. The stepwise method and internal cross-validation were used in the Fisher discriminant model. In the construction of DT C5.0 model, decision tree was picked as the model output type with 10-fold cross-validation as internal validation. In order to improve the model, expert and global pruning mode were chosen, meanwhile, pruning severity and the minimum number of record for each sub-branch were set to 80 and 2, respectively. We also constructed models by MLP and RBF methods. MLP had more terminative rules than RBF (using a maximum training time of 1 min) and overfitting prevents the set from being 50.0% when choosing parameters of model. Moreover, we established SVM model in which the expert mode was selected. All methods were applied to distinguish LCs from NC.

Statistical Analysis

SPSS 21.0 software package, GraphPad Prism 5.0, and MedCalc 11 were used to analyze and visualize the data from ELISA in this research. Differences of TAAbs levels among the different groups were analyzed by non-parametric tests and Wilcoxon test with Bonferroni adjustment. The sensitivity, specificity, and AUC with 95% confidence internal (CI) were all calculated by receiver operating characteristic (ROC) curve analysis. The OD value produced at the highest Youden's Index (sensitivity + specificity −1) was set as the cutoff value. The difference was considered statistically significant while P < 0.05.

Results

Overall Study Design

The overall study was divided into three phases including the discovery of potential TAAbs, the validation of candidate TAAbs, and the establishment of diagnostic models (Figure 1). Briefly, in phase I, the serum samples of discovery cohort containing 100 LCs and 50 NCs were individually profiled on focused protein array. In phase II, 155 LCs and 155 NCs in the verification cohort were matched by age and gender, which was used to verify the screened candidate TAAbs from protein array. In addition, there were 300 LCs, 300 NCs, and 144 BLDs in the validation cohort, which was used to validate the TAAbs from the verification cohort. In phase III, the ELISA results of eight TAAbs of the LCs and NCs in validation cohort were applied to build and test the diagnostic models.

FIGURE 1

Figure 1. Overall study design.

Screening 12 Potential TAAbs for LC Based on Focused Protein Array

One hundred serum samples from LCs and 50 sera from NCs were tested by customized protein array. The 154 human recombinant protein, positive control (antihuman IgG) and negative control (buffer) arranged according to the protein array layout that shows in Figure 2A. The operation process and principle of the protein array were visualized in Figure 2B. As shown in Figures 2C,D, the fluorescent scanning signal results of two representative samples illustrated that the IgG response of the LC case was stronger than the NC.

FIGURE 2

Figure 2. Protein array customization and preliminary results. (A) Protein array layout. (B) The operation process and principle of the protein array. (C,D) Protein fluorescence quantification results of LC and NC, respectively. The red and blue frames highlight the positive control (anti-human IgG) and negative control (buffer). (E) The result of repeated experiments by the same serum sample. The lower left showed the distribution of the results after linear fitting, and the upper right showed correlation results between samples after linear fitting (***P < 0.001). The middle graph was the cumulative density distribution of a single sample.

Before the formal experiment, we repeated the tests 30 times in total on the same sample at different times, different arrays, and different locations to evaluate the stability of the array and the operation. From the results, the overall average value of repeatability between different batches of arrays was 0.98, indicating the overall stability was great (Figure 2E).

As exhibited in the Supplementary Figure 1, based on the criteria of AUC >0.5 and P < 0.05 by ROC analysis, the 40 TAAbs were preliminarily screened (Supplementary Table 1). Then, totally 15 TAAbs of them were further screened, which included 11 TAAbs selected by regression analysis and four TAAbs studied in our previous research. Whereafter, according to the criteria of the positive rate of LC minus NC was > 10%, we ultimately selected 12 candidate TAAbs which involved in carcinogenesis, such as cell cycle, apoptosis, PI3K pathway, and RAS pathway (Supplementary Table 2) for further verification. Higher level of the 12 TAAbs was observed in LCs than NCs (P < 0.05) (Figure 3A). The AUC of each TAAb was ranged from 0.596 (95% CI: 0.504–0.689) to 0.706 (95% CI: 0.643–0.769) (Figure 3B).

FIGURE 3

Figure 3. (A) SNR of autoantibodies against 12 TAAs in discovery cohort with 100 LCs and 50 NCs. (B) ROC analysis of autoantibodies against 12 TAAs for LC detection in discovery cohort. C, cancer; N, normal; ***P < 0.001; **P < 0.01; *P < 0.05.

Verifying the Candidate TAAbs by ELISA in Verification Cohort

In order to determine the diagnostic validity of 12 TAAbs, we tested these TAAbs in 310 serum samples in the verification cohort (155 LCs and 155 NCs) by ELISA. The results were highly consistent with the discovery phase. According to screening criteria of AUC >0.5 and P < 0.05, four TAAbs (P62, Survivin, PBRM1, and JAK2) were excluded. The concentration level of the other eight TAAbs in the serum of LCs was significantly higher than NCs (P < 0.05) (Supplementary Figure 2A). As displayed in Supplementary Figure 2B, GNA11 owned the highest AUCof 0.802 (95% CI: 0.753–0.850).

The Performance of the Eight TAAbs in Validation Cohort and Establishment of Diagnostic Model

An independent validation cohort, including 300 LCs, 300 NCs, and 144 BLDs, was then used to validate the above eight TAAbs. As indicated in Figure 4A, all eight TAAbs showed significantly higher level in LCs compared with NCs. Interestingly, the serum levels of four TAAbs (TP53, NPM1, SRSF2, and TSC1) in LCs were significantly higher than BLDs. The AUCs of eight TAAbs for distinguishing LCs from NCs were ranged from 0.556 (95% CI: 0.509–0.602) for FGFR2 to 0.751 (95% CI: 0.710–0.793) for TP53 (Figure 4B), and the sensitivities were 13.7–43.0% at the specificities ≥90% (Supplementary Table 3). Besides, we investigated the correlation of the eight TAAbs and histologies; however, the results revealed that there were no differences among the adenocarcinoma patients, squamous cell carcinoma patients, and small cell lung cancer patients in serum TAAbs (P > 0.05) (data not shown).

FIGURE 4

Figure 4. (A) The expression of autoantibodies against eight TAAs in validation cohort with 300 LCs, 144 BLDs, and 300 NCs. (B) ROC analysis of autoantibodies against eight TAAs for LC and NC groups in validation cohort. C, cancer; B, benign; N, normal; ***P < 0.001; **P < 0.01; *P < 0.05.

In order to explore the optimal diagnostic model with higher diagnostic accuracy than single TAAb for LCs, six modeling methods were performed and compared. Clearly, the model established by the DT C5.0 yield the most remarkable diagnostic performance among the six models (Figure 5), which contain seven TAAbs (TP53, NPM1, FGFR2, PIK3CA, GNA11, HIST1H3B, and TSC1) and possessed an AUC of 0.897 (95% CI: 0.863–0.924), sensitivity of 94.4%, specificity of 84.9%, and accuracy of 89.9% (Table 2). Meanwhile, it also achieved an excellent achievement in the test set, the AUC, sensitivity, specificity, and accuracy were 0.838 (95% CI: 0.777–0.888), 89.4, 78.2, and 83.3% (Table 2).

FIGURE 5

Figure 5. ROC analysis of multiple models for the differential diagnosis of LCs and NCs in training set and test set of validation cohort.

TABLE 2

Table 2. The performance of multiple models in training set and test set for lung cancer detection.

Evaluation of the Performance of the Optimal Model in Different Stages of LC

According to clinical stages I, II, III, and IV (AGCC), stages I and II of LC were defined as early LC (N = 72) and stages III and IV as late LC (N = 141) (Table 3). For the diagnosis of early LC, TP53 owned the highest AUC (95% CI) of 0.840 (0.782–0.898), while the AUC of DT C5.0 model achieved 0.886 (95% CI: 0.845–0.926). The sensitivity of single TAAb in early LC ranged from 13.9 to 48.6%, while it dramatically increased to 94.4% in DT 5.0 model established by seven TAAbs. However, the specificity of the model (82.7%) was slightly reduced compared with the single TAAb (92.0–95.3%). For the late LC, the AUC (95% CI), sensitivity of DT C5.0 model were 0.864 (0.826–0.902) and 90.1%, which were obviously higher than single TAAb. Yet, the specificity of the model was only reduced about 10% in late LC compared with the single TAAb. Moreover, the accuracies of the model in both early and late stages were close to 90%, which highly improved the results of single TAAbs.

TABLE 3

Table 3. The diagnostic performance of DT 5.0 model and the seven TAAbs in early and late stage LC.

Discussion

In recent years, with the rapid development of proteomics methods, the discovery of new serum biomarkers has been greatly promoted by protein array which is a high-throughput method to screen specific antibody targets against protein samples (25). Hence, the protein array technique was selected for high-throughput screening in current research.

Although one study has utilized protein array to identify TAAbs for LC (26), our research design owned several novel features. First, the protein array was customized based on 138 cancer driver genes which were the key carcinogenic factors that could promote the rapid growth of tumors. On this basis, the possibility of screening out meaningful biomarkers was improved to some extent. Second, the candidate TAAbs were verified and validated in the multiple independent cohorts with more than 1,000 samples, so that the diagnostic value of these TAAbs was very reliable on account of the consistency between ELISA and protein array results. Third, we applied multiple data mining methods to establish diagnostic models and then selected the optimal model, which not only yielded further improvements in diagnostic performance but also avoided the insufficiency of using a single modeling approach.

Cancer is a disease that is caused by the DNA sequence in the genomes of cancer cells changing (20). Besides, cancer driver genes were defined as the important genes which related to the occurrence and development of cancer, and the determination of cancer driver genes is key to advancing diagnostics, therapeutics, and treatments (27). Bert Vogelstein et al. (21) summarized 138 cancer driver genes (74 tumor suppressor genes and 64 oncogenes) which can promote or “drive” tumorigenesis when altered by intragenic mutations. We customized a protein array including 154 human recombinant proteins based on the 138 genes to explore the level of autoantibodies to the proteins encoded by these genes, which integrated the merits of cancer driver gene and TAAb.

Applying the protein array technology, we analyzed the level of autoantibodies against 154 proteins in serum from 100 LCs and 50 NCs. According to multiple statistical analyses and screening criteria, 12 TAAb candidates were rapidly identified in the discovery phase. These TAAbs are all involved in some important carcinogenesis functions (Supplementary Table 2), and eight of them were first discovered in this research for diagnosis of LC. The remaining four TAAbs have been studied in various cancers, including TP53 (28–30), P62 (31, 32), NPM1 (33, 34), and Survivin (35).

In the verification phase, these 12 TAAbs were tested using indirect ELISA in 155 LCs and 155 matched NCs to assess their performance in distinguishing LCs from NCs. Furthermore, eight TAAbs (TP53, NPM1, GNA11, SRSF2, HIST1H3B, FGFR2, TSC1, and PIK3CA) were further selected on account of their excellent performance in verification cohort and subjected to validation cohort with 300 LCs, 300 NCs, and 144 BLDs. The basically consistent results of multistage and multicohort validation testified the reliability of our study. Remarkably, the level of anti-TP53 was found to be statistically significantly higher in LC than NC, which yielded the highest diagnostic value with the AUC (95% CI) of 0.751 (0.710–0.793). Park et al. (36) also found the significance of anti-TP53 in the diagnosis of LC. Besides, it was regrettably found that the majority single TAAbs had lower diagnostic performance for LC, which was similar to the results shown in previous studies (37). In order to improve the diagnostic value, we combined different TAAbs by using diverse data mining methods.

In recent years, various data mining techniques have been widely used to establish cancer diagnostic models, such as logistic regression analysis (38), Fisher discriminant analysis (39), decision tree (40), support vector machine (41), ANN-MLP, and ANN-RBF (42). However, each method has its own strengths and weaknesses, so the current study aimed to build LC diagnostic models through different modeling methods and validate the diagnostic value of each model for LC in a test set for choosing an optimal model. In result, we selected the decision tree model with a seven-TAAb panel (TP53, NPM1, FGFR2, PIK3CA, GNA11, HIST1H3B, and TSC1) which yield the highest AUCs of 0.897 (95% CI: 0.863–0.924) and 0.838 (95% CI: 0.777–0.888) for distinguishing LCs from NCs in training set and test set. Moreover, the results of the seven TAAbs and the panel of TAAbs in this study showed better discriminatory power for the early-stage LC than the advanced stage (Table 2). The above result may imply that autoantibodies to tumor-associated antigens, as a kind of promising biomarkers produced in early stage of tumorigenesis, could own more chances to be applied for early diagnosis in cancers.

However, as to the limitation, the small sample size of early-stage LCs might limit the expansibility of the value of this diagnostic model. Therefore, in our further research, we will confirm the diagnostic utility of this TAAb panel in a large sample size study to verify our findings, and explore its differential diagnostic performance between benign and malignant pulmonary nodules.

In conclusion, focused protein array based on cancer-driver genes is an effective and fast approach to discovering novel TAAbs. Comprehensive analysis of multiple models established by data mining showed that the DT C5.0 model generated by the combination of seven TAAbs had the highest LC diagnostic value. In consequence, the model may be the auxiliary means for clinicians to diagnose early-stage LC, and it may have a great influence in improving the accuracy of LC diagnosis.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author/s.

Ethics Statement

The studies involving human participants were reviewed and approved by Medical Ethics Committee of Zhengzhou University (Zhengzhou, China). The patients/participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author Contributions

LD and JZ: conception and design. LD: administrative support. TW, LP, CS, KW, and XW: provision of study materials or patients. DJ, XZ, ML, YW, PW, HY, and JS: collection and assembly of data. DJ and XZ: data analysis, interpretation, and manuscript writing. All authors: final approval of manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (Grant No. 8167291), the Leading Talents of Science and Technology Innovation in Henan Province (Grant No. 20420051008), the Major Project of Science and Technology in Henan Province (Grant No. 16110311400), the Key Project of Discipline Construction of Zhengzhou University (Grant No. XKZDQY202009), and the Project of Basic Research Fund of Henan Institute of Medical and Pharmacological Sciences (Grant No. 2020BP0202).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2021.658922/full#supplementary-material

Abbreviations

ANN-MLP, artificial neural network-multilayer perception; ANN-RBF, artificial neural network-radial basis function; AUC, area under the receiver operating characteristic curve; BLD, benign lung disease; CEA, carcinoembryonic antigen; CI, confidence internal; COPD, chronic obstructive pulmonary disease; CYFRA 21-1, cytokeratin-19 fragment; DT, decision tree; ELISA, enzyme-linked immunosorbent assay; LC, lung cancer; LDCT, low-dose computed tomography; NC, normal control; NSE, neuron-specific enolase; ROC, receiver operating characteristic; SEREX, serological analysis of recombination cDNA expression libraries; SERPA, serological proteome analysis; SNR, signal-to-noise ratio; SVM, support vector machines; TAA, tumor-associated antigen; TAAb, autoantibody to TAA.

References

1. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2020. CA Cancer J Clin. (2020) 70:7–30. doi: 10.3322/caac.21590

PubMed Abstract | CrossRef Full Text

2. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2018) 68:394–424. doi: 10.3322/caac.21492

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Chen W, Zheng R, Baade PD, Zhang S, Zeng H, Bray F, et al. Cancer statistics in China, 2015. CA Cancer J Clin. (2016) 66:115–32. doi: 10.3322/caac.21338

CrossRef Full Text | Google Scholar

4. Schwartz AG, Cote ML. Epidemiology of lung cancer. Adv Exp Med Biol. (2016) 893:21–41. doi: 10.1007/978-3-319-24223-1_2

CrossRef Full Text | Google Scholar

5. Schabath MB, Cote ML. Cancer progress and priorities: lung cancer. Cancer Epidemiol Biomarkers Prev. (2019) 28:1563–79. doi: 10.1158/1055-9965.EPI-19-0221

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Villalobos P, Wistuba II. Lung cancer biomarkers. Hematol Oncol Clin North Am. (2017) 31:13–29. doi: 10.1016/j.hoc.2016.08.006

CrossRef Full Text | Google Scholar

7. Cho JY. Lung cancer biomarkers. Adv Clin Chem. (2015) 72:107–70. doi: 10.1016/bs.acc.2015.07.003

CrossRef Full Text | Google Scholar

8. Becker N, Motsch E, Trotter A, Heussel CP, Dienemann H, Schnabel PA, et al. Lung cancer mortality reduction by LDCT screening-Results from the randomized German LUSI trial. Int J Cancer. (2019) 146:1503–13. doi: 10.1002/ijc.32486

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Kowall B, Jockel KH, Stang A. Lung cancer screening: current trends. Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz. (2018) 61:1551–8. doi: 10.1007/s00103-018-2834-8

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Mordente A, Meucci E, Martorana GE, Silvestrini A. Cancer biomarkers discovery and validation: state of the art, problems and future perspectives. Adv Exp Med Biol. (2015) 867:9–26. doi: 10.1007/978-94-017-7215-0_2

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Djureinovic D, Dodig-Crnkovic T, Hellstrom C, Holgersson G, Bergqvist M, Mattsson JSM, et al. Detection of autoantibodies against cancer-testis antigens in non-small cell lung cancer. Lung Cancer. (2018) 125:157–63. doi: 10.1016/j.lungcan.2018.09.012

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Wang J, Jiang W, Zhang T, Liu L, Bi N, Wang X, et al. Increased CYFRA 21-1, CEA and NSE are prognostic of poor outcome for locally advanced squamous cell carcinoma in lung: a nomogram and recursive partitioning risk stratification analysis. Transl Oncol. (2018) 11:999–1006. doi: 10.1016/j.tranon.2018.05.008

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Muley T, Rolny V, He Y, Wehnl B, Escherich A, Warth A, et al. The combination of the blood based tumor biomarkers cytokeratin 19 fragments (CYFRA 21-1) and carcinoembryonic antigen (CEA) as a potential predictor of benefit from adjuvant chemotherapy in early stage squamous cell carcinoma of the lung (SCC). Lung Cancer. (2018) 120:46–53. doi: 10.1016/j.lungcan.2018.03.015

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Song KS, Nimse SB, Warkad SD, Oh AC, Kim T, Hong YJ. Quantification of CYFRA 21-1 and a CYFRA 21-1-anti-CYFRA 21-1 autoantibody immune complex for detection of early stage lung cancer. Chem Commun. (2019) 55:10060–3. doi: 10.1039/C9CC03620B

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Zhang JY, Megliorino R, Peng XX, Tan EM, Chen Y, Chan EK. Antibody detection using tumor-associated antigen mini-array in immunodiagnosing human hepatocellular carcinoma. J Hepatol. (2007) 46:107–14. doi: 10.1016/j.jhep.2006.08.010

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Yang G, Xiao Z, Tang C, Deng Y, Huang H, He Z. Recent advances in biosensor for detection of lung cancer biomarkers. Biosens Bioelectron. (2019) 141:111416. doi: 10.1016/j.bios.2019.111416

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Wang T, Liu H, Pei L, Wang K, Song C, Wang P, et al. Screening of tumor-associated antigens based on oncomine database and evaluation of diagnostic value of autoantibodies in lung cancer. Clin Immunol. (2020) 210:108262. doi: 10.1016/j.clim.2019.108262

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Wang X, Zhang Y, Sun L, Wang S, Nie J, Zhao W, et al. Evaluation of the clinical application of multiple tumor marker protein chip in the diagnostic of lung cancer. J Clin Lab Anal. (2018) 32:e22565. doi: 10.1002/jcla.22565

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Anderson KS, Cramer DW, Sibani S, Wallstrom G, Wong J, Park J, et al. Autoantibody signature for the serologic detection of ovarian cancer. J Proteome Res. (2015) 14:578–86. doi: 10.1021/pr500908n

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Stratton MR, Campbell PJ, Futreal PA. The cancer genome. Nature. (2009) 458:719–24. doi: 10.1038/nature07943

CrossRef Full Text | Google Scholar

21. Vogelstein B, Papadopoulos N, Velculescu VE, Zhou S, Diaz LA Jr, Kinzler KW. Cancer genome landscapes. Science. (2013) 339:1546–58. doi: 10.1126/science.1235122

CrossRef Full Text | Google Scholar

22. Fang R, Zhu Y, Khadka VS, Zhang F, Jiang B, Deng Y. The evaluation of serum biomarkers for non-small cell lung cancer (NSCLC) diagnosis. Front Physiol. (2018) 9:1710. doi: 10.3389/fphys.2018.01710

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Du Q, Yan C, Wu SG, Zhang W, Huang C, Yao Y, et al. Development and validation of a novel diagnostic nomogram model based on tumor markers for assessing cancer risk of pulmonary lesions: a multicenter study in Chinese population. Cancer Lett. (2018) 420:236–41. doi: 10.1016/j.canlet.2018.01.079

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Jiang D, Wang Y, Liu M, Si Q, Wang T, Pei L, et al. A panel of autoantibodies against tumor-associated antigens in the early immunodiagnosis of lung cancer. Immunobiology. (2020) 225:151848. doi: 10.1016/j.imbio.2019.09.007

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Srivastava A, Creek DJ. Discovery and validation of clinical biomarkers of cancer: a review combining metabolomics and proteomics. Proteomics. (2019) 19:e1700448. doi: 10.1002/pmic.201700448

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Pan J, Song G, Chen D, Li Y, Liu S, Hu S, et al. Identification of serological biomarkers for early diagnosis of lung cancer using a protein array-based approach. Mol Cell Proteomics. (2017) 16:2069–78. doi: 10.1074/mcp.RA117.000212

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Watson IR, Takahashi K, Futreal PA, Chin L. Emerging patterns of somatic mutations in cancer. Nat Rev Gene. (2013) 14:703–18. doi: 10.1038/nrg3539

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Silwal-Pandit L, Vollan HK, Chin SF, Rueda OM, McKinney S, Osako T, et al. TP53 mutation spectrum in breast cancer is subtype specific and has distinct prognostic relevance. Clin Cancer Res. (2014) 20:3569–80. doi: 10.1158/1078-0432.CCR-13-2943

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Köbel M, Reuss A, du Bois A, Kommoss S, Kommoss F, Gao D, et al. The biological and clinical value of p53 expression in pelvic high-grade serous carcinomas. J Pathol. (2010) 222:191–8. doi: 10.1002/path.2744

CrossRef Full Text | Google Scholar

30. Kunizaki M, Sawai T, Takeshita H, Tominaga T, Hidaka S, To K, et al. Clinical value of serum p53 antibody in the diagnosis and prognosis of colorectal cancer. Anticancer Res. (2016) 36:4171–5.

Google Scholar

31. Liu W, Wang P, Li Z, Xu W, Dai L, Wang K, et al. Evaluation of tumour-associated antigen (TAA) miniarray in immunodiagnosis of colon cancer. Scand J Immunol. (2009) 69:57–63. doi: 10.1111/j.1365-3083.2008.02195.x

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Qiang L, Zhao B, Ming M, Wang N, He TC, Hwang S, et al. Regulation of cell proliferation and migration by p62 through stabilization of Twist1. Proc Natl Acad Sci USA. (2014) 111:9241–6. doi: 10.1073/pnas.1322913111

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Kumar D, Mehta A, Panigrahi MK, Nath S, Saikia KK. DNMT3A (R882) mutation features and prognostic effect in acute myeloid leukemia in coexistent with NPM1 and FLT3 mutations. Hematol Oncol Stem Cell Therapy. (2018) 11:82–9. doi: 10.1016/j.hemonc.2017.09.004

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Qin J, Wang S, Shi J, Ma Y, Wang K, Ye H, et al. Using recursive partitioning approach to select tumor-associated antigens in immunodiagnosis of gastric adenocarcinoma. Cancer Sci. (2019) 110:1829–41. doi: 10.1111/cas.14013

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Wang YQ, Zhang HH, Liu CL, Xia Q, Wu H, Yu XH, et al. Correlation between auto-antibodies to survivin and MUC1 variable number tandem repeats in colorectal cancer. Asian Pacific J Cancer Prevent. (2012) 13:5557–62. doi: 10.7314/APJCP.2012.13.11.5557

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Park Y, Kim Y, Lee JH, Lee EY, Kim HS. Usefulness of serum anti-p53 antibody assay for lung cancer diagnosis. Arch Pathol Lab Med. (2011) 135:1570–5. doi: 10.5858/arpa.2010-0717-OA

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Pei L, Liu H, Ouyang S, Zhao C, Liu M, Wang T, et al. Discovering novel lung cancer associated antigens and the utilization of their autoantibodies in detection of lung cancer. Immunobiology. (2020) 225:151891. doi: 10.1016/j.imbio.2019.11.026

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Tripepi G, Jager KJ, Dekker FW, Zoccali C. Linear and logistic regression analysis. Kidney Int. (2008) 73:806–10. doi: 10.1038/sj.ki.5002787

CrossRef Full Text | Google Scholar

39. Diaz-Vico D, Dorronsoro JR. Deep least squares fisher discriminant analysis. IEEE Trans Neural Netw Learn Syst. (2019) 31:2752–63. doi: 10.1109/TNNLS.2019.2906302

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Parikh SA, Gomez R, Thirugnanasambandam M, Chauhan SS, De Oliveira V, Muluk SC, et al. Decision tree based classification of abdominal aortic aneurysms using geometry quantification measures. Ann Biomed Eng. (2018) 46:2135–47. doi: 10.1007/s10439-018-02116-w

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Zhi J, Sun J, Wang Z, Ding W. Support vector machine classifier for prediction of the metastasis of colorectal cancer. Int J Mol Med. (2018) 41:1419–26. doi: 10.3892/ijmm.2018.3359

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Burke HB, Goodman PH, Rosen DB, Henson DE, Weinstein JN, Harrell FE Jr, et al. Artificial neural networks improve the accuracy of cancer survival prediction. Cancer. (1997) 79:857–62. doi: 10.1002/(SICI)1097-0142(19970215)79:4<857::AID-CNCR24>3.0.CO;2-Y

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: lung cancer, protein array, tumor-associated antigen, autoantibody, diagnostic model

Citation: Jiang D, Zhang X, Liu M, Wang Y, Wang T, Pei L, Wang P, Ye H, Shi J, Song C, Wang K, Wang X, Dai L and Zhang J (2021) Discovering Panel of Autoantibodies for Early Detection of Lung Cancer Based on Focused Protein Array. Front. Immunol. 12:658922. doi: 10.3389/fimmu.2021.658922

Received: 26 January 2021; Accepted: 23 February 2021;
Published: 23 April 2021.

Edited by:

Jian Zhang, Southern Medical University, China

Reviewed by:

Rongxi Yang, Nanjing Medical University, China
Pingping Chen, University of Miami, United States
Ximing Tang, University of Texas MD Anderson Cancer Center, United States

Copyright © 2021 Jiang, Zhang, Liu, Wang, Wang, Pei, Wang, Ye, Shi, Song, Wang, Wang, Dai and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Liping Dai, bHBkYWlAenp1LmVkdS5jbg==; Jianying Zhang, amlhbnlpbmd6aGFuZ0Bob3RtYWlsLmNvbQ==

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.