- 1Cancer Center, West China Hospital, Sichuan University, Chengdu, China
- 2Department of Thoracic Surgery, West China Hospital, Sichuan University, Chengdu, China
- 3West China School of Medicine, Sichuan University, Chengdu, Sichuan, China
- 4Department of Hematology, West China Hospital, Sichuan University, Chengdu, China
Background: Small cell lung cancer (SCLC) is characterized by extreme invasiveness and lethality. There have been very few developments in its diagnosis and treatment over the past decades. It is urgently needed to explore potential novel biomarkers and drug targets for SCLC.
Methods: Two-sample Mendelian Randomization (MR) was performed to investigate causal associations between SCLC and plasma proteins using genome-wide association studies (GWAS) summary statistics of SCLC from Transdisciplinary Research Into Cancer of the Lung Consortium (nCase = 2,791 vs. nControl = 20,580), and was validated in another cohort (nCase = 2,664 vs. nControl = 21,444). 734 plasma proteins and their genetic instruments of cis-acting protein quantitative trait loci (pQTL) were used, whereas external plasma proteome data was retrieved from deCODE database. Bidirectional MR, Steiger filtering and phenotype scanning were applied to further verify the associations.
Results: Seven significant (p < 6.81 × 10−5) plasma protein-SCLC pairs were identified by MR analysis, including ACP5 (OR = 0.76, 95% CI: 0.67–0.86), CPB2 (OR = 0.90, 95% CI: 0.86–0.95), GSTM3 (OR = 0.45, 95% CI: 0.33–0.63), SHMT1 (OR = 0.74, 95% CI: 0.64–0.86), CTSB (OR = 0.79, 95% CI: 0.71–0.88), NTNG1 (OR = 0.81, 95% CI: 0.74–0.90) and FAM171B (OR = 1.40, 95% CI: 1.21–1.62). The external validation confirmed that CPB2, GSTM3 and NTNG1 had protective effects against SCLC, while FAM171B increased SCLC risk. However, the reverse causality analysis revealed that SCLC caused significant changes in plasma levels of most of these proteins, including decreases of ACP5, CPB2, GSTM3 and NTNG1, and the increase of FAM171B.
Conclusion: This integrative analysis firstly suggested the causal associations between SCLC and plasma proteins, and the identified several proteins may be promising novel drug targets or biomarkers for SCLC.
Introduction
Small Cell Lung Carcinoma (SCLC) is a highly aggressive and malignant subtype of lung cancer, accounting for approximately 14% of all lung cancer cases, characterized by its desperate invasiveness and poor prognosis (Wang et al., 2023). Although initially responsive to chemotherapy and radiation therapy, the majority of SCLC patients would develop treatment resistance within a short period, resulting in a very low overall survival rate (Rudin et al., 2021). While the emergence of immunotherapy has provided a breakthrough in the treatment of SCLC, its clinical efficacy remains unsatisfactory, with most patients experiencing relapse in the short term (Kang et al., 2023). The development of novel drugs for SCLC faces multiple challenges, including rapid growth and early metastasis, complex genetic characteristics, great tumor heterogeneity, and the issue of drug resistance, which make the search for effective drug targets and treatment strategies more difficult (Megyesfalvi et al., 2023; Zhang et al., 2023).
Mendelian randomization (MR) analysis, using single nucleotide polymorphism (SNP) from genome-wide association studies (GWAS) as genetic instruments, is a promising approach for drug target identification, which can effectively avoid confounding factors that exist in observational studies. Advances in genomics and proteomics have bolstered MR-based strategies, leading to potential therapeutic targets for diseases (Reay and Cairns, 2021). However, MR analysis integrating GWAS and protein quantitative trait loci (pQTL) data is lacking for SCLC, a method that can provide crucial insights into its early diagnosis and drug target discovery. Therefore, the objective of this study is to fill this gap, and first investigate plasma proteins as potential drug targets and biomarkers for SCLC by integrating large GWAS and pQTL data through MR analysis.
Methods
Plasma protein quantitative trait loci and GWAS data
We included plasm pQTL data from the integrated version (Zheng et al., 2020) of five previously reported GWAS datasets (Folkersen et al., 2017; Suhre et al., 2017; Emilsson et al., 2018; Sun et al., 2018; Yao et al., 2018) for primary analysis with 738 cis-pQTLs of 734 plasma proteins identified through the following criteria (Figure 1A; Supplementary Table S1): 1) being a cis-acting pQTL; 2) having significant genome-wide association (p-value < 5*10−8); 3) being situated beyond the boundaries of major histocompatibility complex region (chr6, 26–34 Mb); 4) demonstrating independent association with linkage disequilibrium clumping r2 less than 0.001. The GWAS summary dataset for SCLC in the primary analysis was retrieved from Transdisciplinary Research Into Cancer of the Lung (TRICL) Consortium that enrolled 2,791 patients and 20,580 controls. Furthermore, external validation was performed using the plasma pQTL data of 4,907 proteins measured in 35,559 participants from Ferkingstad et al.’s study (Ferkingstad et al., 2021) and the SCLC GWAS summary dataset of 2,664 cases and 21,444 controls from James et al.’s study (McKay et al., 2017).
FIGURE 1. (A). Flowchart of identifying causal plasma proteins for small cell lung cancer (SCLC) by Mendelian Randomization (MR). (B). Volcano plot of the MR analysis for 734 plasma proteins on SCLC risk. OR: odds ratio, per standard deviation increase in plasma protein levels. Dashed horizontal line represented p-value = 6.81 × 10−5. PVE: proportion of variance explained.
Two-sample Mendelian Randomization analysis
Using the “TwoSampleMR” R package (https://github.com/MRCIEU/TwoSampleMR), plasma proteins were utilized as the exposure and SCLC as the outcome to conduct the primary Mendelian Randomization (MR) analysis. If there was only one pQTL available for a specific protein, we employed the Wald ratio method. For cases where two or more genetic instruments were accessible, we applied inverse variance weighted MR (MR-IVW) and performed subsequent heterogeneity analysis (Deng et al., 2022). In the primary analysis, we applied Bonferroni correction to account for multiple testing and set a threshold p-value of 0.05/734 (p-value < 6.81 × 10−5) to select potentially-valid causal proteins, which were then used to perform external validation at a threshold p-value of 0.05. During the external validation process, we also analyzed the results using the other two sets of genetic instruments of significant-variant and genome-wide significant SNPs based on the validation data.
Sensitivity analysis
Significantly-valid genetic instruments for SCLC were identified from the TRICL Consortium dataset for the detection of possible reverse causality by bidirectional MR analysis. Five methods were used to estimate the causal effects, including MR-IVW, MR-Egger, weighted median, simple mode, and weighted mode. Meanwhile, we also performed Steiger filtering to verify the direction of associations between plasma proteins and SCLC. Phenoscanncer analysis (http://www.phenoscanner.medschl.cam.ac.uk) was used to identify the previously reported traits or diseases that shared the same SNPs with our analysis. p-value < 0.05 was considered statistically significant for these results.
Results
The primary MR analysis identified seven significant plasma protein-SCLC pairs (p-value < 6.81 × 10−5; Figure 1B; and Table 1), including acid phosphatase 5 (ACP5), carboxypeptidase B2 (CPB2), glutathione S-transferase Mu 3 (GSTM3), serine hydroxy-methyltransferase 1 (SHMT1), Cathepsin B (CTSB), Netrin G1 (NTNG1), and Family With Sequence Similarity 171 Member B (FAM171B). In detail, higher levels of ACP5 (odds ratio, OR = 0.76, 95%CI: 0.67–0.86, p = 1.97 × 10−5), CPB2 (OR = 0.90, 95%CI: 0.86–0.95, p = 5.29 × 10−5), GSTM3 (OR = 0.45, 95%CI: 0.33–0.63, p = 1.86 × 10−6), SHMT1 (OR = 0.74, 95%CI: 0.64–0.86, p = 5.95 × 10−5), CTSB (OR = 0.79, 95%CI: 0.71–0.88, p = 3.07 × 10−5) and NTNG1 (OR = 0.81, 95%CI: 0.74–0.90, p = 4.60 × 10−5) decreased the risk of SCLC, while the elevated level of FAM171B (OR = 1.40, 95%CI: 1.21–1.62, p = 1.97 × 10−5) increased the risk of SCLC. Furthermore, there was no heterogeneity observed for the analyzed plasma proteins (Supplementary Table S2).
TABLE 1. Mendelian Randomization analysis for plasma proteins significantly with small cell lung cancer after Bonferroni correction.
In the external validation, the same-variant and significant-variant strategies were used to further verify seven plasma proteins identified in the primary analysis, demonstrating that CPB2, GSTM3, NTNG1, and FAM171B also had significant associations with the risk of SCLC (Figure 2). For each protein, all used SNPs (primary, same-variant and significant-variant) showed consistent findings. For example, the elevated level of CPB2 always decreased the risk of SCLC when each of three types of SNP was used (primary: OR = 0.92, 95%CI: 0.85–0.98, p = 0.018; same variant: OR = 0.86, 95%CI: 0.76–0.97, p = 0.018; significant variant: OR = 0.86, 95%CI: 0.76–0.97, p = 0.018).
FIGURE 2. External validation of the causal relationship between small cell lung cancer and seven potential causal plasma proteins by Mendelian Randomization analysis. Same-variant strategy: using same SNP in the primary analysis but with its association in the external-validation data; significant-variant strategy: using the significant SNP in the external-validation data. ACP5 and NTNG1 had no available significant SNP and FAM171B had no same variant as the primary analysis in the external-validation data. OR: odds ratio, per standard deviation increase in plasma protein levels.
To further investigate the reverse causality, we next performed bidirectional MR analysis for seven potential plasma proteins in the primary analysis (genetic instruments shown in Supplementary Table S3), which suggests that SCLC caused significant changes in plasma levels of nearly all of them based on MR-IVW, except SHMT1 and CTSB, which had opposite changes between MR-IVW and other methods (Supplementary Figure S1). Overall, the occurrence of SCLC also caused changes in certain plasma proteins, including decreases of ACP5, CPB2, GSTM3, and NTNG1, and the increase of FAM171B. Finally, the phenotype scanning revealed that none of the identified proteins had reported associations with traits related to SCLC (Supplementary Table S4). Meanwhile, by Phenoscnner analysis, there were three SNPs in the primary analysis shared with other traits or diseases, including rs1692819 for CTSB, rs8067462 for SHMT1, and rs115668827 for NTNG1, which, however, had no explained relationship with SCLC (Supplementary Table S5).
Discussion
As a highly invasive and rapidly progressing type of lung cancer, SCLC often presents a series of challenges, with the majority of patients being diagnosed at an advanced stage (Wang et al., 2022). Currently, there is a lack of effective drugs in clinical practice for SCLC treatment. Early diagnosis is crucial for treatment and prognosis, but SCLC patients typically lack obvious symptoms and biomarkers in their early stages, leading to difficulties in diagnosis. Often, by the time symptoms appear and further investigations are conducted, SCLC tumors have already been the advanced stage or spread, limiting opportunities for effective treatment (Megyesfalvi et al., 2023). Due to its rapid growth and lack of effective biomarkers, early diagnosis of SCLC is particularly challenging. In addition to the diagnostic challenges, treatment options for SCLC remain very limited, and there is a lack of effective drugs in clinical practice (Zhang et al., 2023). While some progress has been made over the past few decades, the progress in SCLC treatment has been relatively slow compared to non-small cell lung cancer. Immunotherapy has made significant progress in cancer treatment, but its applications and survival benefits in SCLC are still limited (Kang et al., 2023). Therefore, the difficulties in early diagnosis of SCLC and the lack of effective drugs are urgent issues to be addressed in clinical practice. It is critical to explore new diagnostic biomarkers and treatment drugs, aiming to provide more effective treatment options and improve the prognosis for SCLC patients.
In the present study, we firstly investigated the potential of plasma proteins as causal factors and biomarkers for SCLC incorporating large proteome datasets. By primary analysis and external validation, CPB2, GSTM3, NTNG1 and FAM171B were found to be associated with SCLC occurrence. However, according to the bidirectional MR analysis, ACP5, CPB2, GSTM3, NTNG1 and FMA171B obtained reliable changes when SCLC occurred, indicating their capabilities as biomarkers for SCLC. Noteworthily, only the increased FMA171B would increase the risk of SCLC, while others may play certain protective roles in SCLC occurrence, suggesting the individual importance of FAM171B. The protein-protein interaction analysis also demonstrated that FAM171B uniquely had no associations with other several proteins (Supplementary Figure S2).
FAM171B is an unidentified protein with its function yet to be clearly elucidated, and several studies have suggested its association with a diaphragmatic hernia and congenital heart disease (Li et al., 2015; Hauptman et al., 2018). FAM171B belongs to the Fam171b protein family, a group of secreted proteins highly selectively expressed in the brain (Hauptman et al., 2018). Therefore, our study, for the first time, identified FAM171B as a potential risk factor and biomarker for SCLC occurrence through large-scale plasma proteomics. However, its specific function and relationship with SCLC still require further investigation and exploration. Besides, other significantly protective protein molecules from SLC, whose potential functions and mechanisms remain unknown, have not been reported to be associated with SCLC., which may have potentials to become emerging drug targets and biomarkers for SCLC.
Several natural limitations are present in our study. First, the primary analysis was conducted based on plasma proteome datasets from multiple studies and measurement inconsistency might have caused bias, though most of the GWAS data sources were aptamer-based. Second, there is only cis-acting SNPs for all proteins, which limited further verifications. However, the SNPs for all proteins we finally identified had strong instruments with large values of F statistics (>10) and proportion of variance explained (PVE; >10%) except for GSTM3 with PVE = 3.8%. Finally, though we performed a series of validations and sensitivity analyses, the results we observed should be carefully explained, which are more suggestive rather than conclusive with regards to the causal identities and biomarkers of these proteins for SCLC. Thus, the future studies are required to validate these associations by developing a clinical cohort of SCLC patients and healthy controls: 1) choosing a group of SCLC patients and a group of healthy control subjects and ensuring that the two groups of samples are matched in terms of age, gender, and other potential confounding factors; 2) collecting plasma samples from each participant and ensuring that the collection method is standardized to minimize potential variability; 3) collecting tumor samples from SCLC patients to validate the expression levels of these potential molecules and their associations with plasma protein levels; 4) comparing the differences in plasma protein levels between SCLC group and the control group; 5) for molecules that do not express within SCLC tumors, potential origins should be further explored by investigating their expressions within normal human tissues of SCLC patients.
Overall, our study first investigates the causal associations between plasma proteins and SCLC by integrative analysis, revealing that circulating CPB2, GSTM3 and NTNG1 decreased SCLC risk, while FAM171B increased SCLC risk. In addition to them, ACP5 also exhibits an inverse causal relationship with SCLC, suggesting their potentials to be novel biomarkers for SCLC. Future studies are required to verify their specific roles in SCLC diagnosis and drug target development.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.
Ethics statement
The studies involving humans were approved by Ethics Committee of West China Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
YW: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Project administration, Resources, Supervision, Visualization, Writing–original draft, Writing–review and editing. ZW: Formal Analysis, Investigation, Methodology, Resources, Validation, Writing–original draft, Writing–review and editing. YY: Data curation, Investigation, Methodology, Project administration. CH: Data curation, Investigation, Methodology, Project administration, Writing–original draft. LW: Formal Analysis, Methodology, Project administration, Software, Validation, Writing–original draft. KK: Conceptualization, Data curation, Funding acquisition, Investigation, Methodology, Project administration, Resources, Visualization, Writing–review and editing. AZ: Data curation, Formal Analysis, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing–original draft.
Funding
This work was supported by National Natural Science Foundation of China (No. 82303773, No. 82303772, No. 82204490, No. 82303694), Natural Science Foundation of Sichuan Province (No. 2023NSFSC1885), Key Research and Development Program of Sichuan Province (No. 23ZDYF2836), 1·3·5 project for disciplines of excellence, West China Hospital, Sichuan University (No. ZYJC21003) and Sichuan Provincial Research Foundation (No. 2023NSFSC0699).
Acknowledgments
We acknowledge the participants and investigators of the TRICL Consortium, GWAS Summary Database and deCODE database.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphar.2023.1266782/full#supplementary-material
SUPPLEMENTARY FIGURE S1 | Bidirectional Mendelian Randomization analysis for small cell lung cancer on plasma levels of seven potential causal proteins. OR: odds ratio, per standard deviation increase in plasma protein levels.
SUPPLEMENTARY FIGURE S2 | Protein-protein interaction network among the suggestive causal proteins.
References
Deng, Y. T., Ou, Y. N., Wu, B. S., Yang, Y. X., Jiang, Y., Huang, Y. Y., et al. (2022). Identifying causal genes for depression via integration of the proteome and transcriptome from brain and blood. Mol. Psychiatry 27 (6), 2849–2857. doi:10.1038/s41380-022-01507-9
Emilsson, V., Ilkov, M., Lamb, J. R., Finkel, N., Gudmundsson, E. F., Pitts, R., et al. (2018). Co-regulatory networks of human serum proteins link genetics to disease. Science 361 (6404), 769–773. doi:10.1126/science.aaq1327
Ferkingstad, E., Sulem, P., Atlason, B. A., Sveinbjornsson, G., Magnusson, M. I., Styrmisdottir, E. L., et al. (2021). Large-scale integration of the plasma proteome with genetics and disease. Nat. Genet. 53 (12), 1712–1721. doi:10.1038/s41588-021-00978-w
Folkersen, L., Fauman, E., Sabater-Lleal, M., Strawbridge, R. J., Frånberg, M., Sennblad, B., et al. (2017). Mapping of 79 loci for 83 plasma protein biomarkers in cardiovascular disease. PLoS Genet. 13 (4), e1006706. doi:10.1371/journal.pgen.1006706
Hauptman, J. S., Bollo, R., Damerla, R., Gibbs, B., Lo, C., Katz, A., et al. (2018). Coincident myelomeningocele and gastroschisis: Report of 2 cases. J. Neurosurg. Pediatr. 21 (6), 574–577. doi:10.3171/2017.11.Peds17540
Kang, K., Wu, Y., Yao, Z., and Lu, Y. (2023). Tackling the current dilemma of immunotherapy in extensive-stage small cell lung cancer: A promising strategy of combining with radiotherapy. Cancer Lett. 565, 216239. doi:10.1016/j.canlet.2023.216239
Li, Y., Klena, N. T., Gabriel, G. C., Liu, X., Kim, A. J., Lemke, K., et al. (2015). Global genetic analysis in mice unveils central role for cilia in congenital heart disease. Nature 521 (7553), 520–524. doi:10.1038/nature14269
McKay, J. D., Hung, R. J., Han, Y., Zong, X., Carreras-Torres, R., Christiani, D. C., et al. (2017). Large-scale association analysis identifies new lung cancer susceptibility loci and heterogeneity in genetic susceptibility across histological subtypes. Nat. Genet. 49 (7), 1126–1132. doi:10.1038/ng.3892
Megyesfalvi, Z., Gay, C. M., Popper, H., Pirker, R., Ostoros, G., Heeke, S., et al. (2023). Clinical insights into small cell lung cancer: Tumor heterogeneity, diagnosis, therapy, and future directions. CA Cancer J. Clin. doi:10.3322/caac.21785
Reay, W. R., and Cairns, M. J. (2021). Advancing the use of genome-wide association studies for drug repurposing. Nat. Rev. Genet. 22 (10), 658–671. doi:10.1038/s41576-021-00387-z
Rudin, C. M., Brambilla, E., Faivre-Finn, C., and Sage, J. (2021). Small-cell lung cancer. Nat. Rev. Dis. Prim. 7 (1), 3. doi:10.1038/s41572-020-00235-0
Suhre, K., Arnold, M., Bhagwat, A. M., Cotton, R. J., Engelke, R., Raffler, J., et al. (2017). Connecting genetic risk to disease end points through the human blood plasma proteome. Nat. Commun. 8, 14357. doi:10.1038/ncomms14357
Sun, B. B., Maranville, J. C., Peters, J. E., Stacey, D., Staley, J. R., Blackshaw, J., et al. (2018). Genomic atlas of the human plasma proteome. Nature 558 (7708), 73–79. doi:10.1038/s41586-018-0175-2
Wang, Q., Gümüş, Z. H., Colarossi, C., Memeo, L., Wang, X., Kong, C. Y., et al. (2023). Sclc: Epidemiology, risk factors, genetic susceptibility, molecular pathology, screening, and early detection. J. Thorac. Oncol. 18 (1), 31–46. doi:10.1016/j.jtho.2022.10.002
Wang, W. Z., Shulman, A., Amann, J. M., Carbone, D. P., and Tsichlis, P. N. (2022). Small cell lung cancer: Subtypes and therapeutic implications. Semin. Cancer Biol. 86 (2), 543–554. doi:10.1016/j.semcancer.2022.04.001
Yao, C., Chen, G., Song, C., Keefe, J., Mendelson, M., Huan, T., et al. (2018). Genome-wide mapping of plasma protein QTLs identifies putatively causal genes and pathways for cardiovascular disease. Nat. Commun. 9 (1), 3268. doi:10.1038/s41467-018-05512-x
Zhang, C., Zhang, C., Wang, K., and Wang, H. (2023). Orchestrating smart therapeutics to achieve optimal treatment in small cell lung cancer: Recent progress and future directions. J. Transl. Med. 21 (1), 468. doi:10.1186/s12967-023-04338-6
Keywords: drug target, biomarker, small cell lung cancer, plasma proteome, Mendelian Randomization
Citation: Wu Y, Wang Z, Yang Y, Han C, Wang L, Kang K and Zhao A (2023) Exploration of potential novel drug targets and biomarkers for small cell lung cancer by plasma proteome screening. Front. Pharmacol. 14:1266782. doi: 10.3389/fphar.2023.1266782
Received: 25 July 2023; Accepted: 28 August 2023;
Published: 06 September 2023.
Edited by:
Mohamed Abdo Rizk, Mansoura University, EgyptReviewed by:
Mohamed Z. Sayed-Ahmed, Mansoura University, EgyptShimaa El-Sayed, Mansoura University, Egypt
Copyright © 2023 Wu, Wang, Yang, Han, Wang, Kang and Zhao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Kai Kang, kaikang@wchscu.cn; Ailin Zhao, irenez20@outlook.com
†These authors have contributed equally to this work and share first authorship