Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 21 October 2022
Sec. Gynecological Oncology
This article is part of the Research Topic New Molecular Approaches to Improve Gynecological Cancer Management View all 11 articles

Combining serum peptide signatures with International Federation of Gynecology and Obstetrics (FIGO) risk score to predict the outcomes of patients with gestational trophoblastic neoplasia (GTN) after first-line chemotherapy

Fei Wang&#x;Fei Wang1†Zi-ran Wang&#x;Zi-ran Wang1†Xue-song Ding&#x;Xue-song Ding2†Hua YangHua Yang2Ye GuoYe Guo1Hao SuHao Su2Xi-run WanXi-run Wan2Li-juan WangLi-juan Wang3Xiang-yang JiangXiang-yang Jiang4Yan-hua XuYan-hua Xu5Feng ChenFeng Chen6Wei Cui*Wei Cui6*Feng-zhi Feng*Feng-zhi Feng2*
  • 1Department of Laboratory Medicine, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
  • 2Department of Obstetrics and Gynecology, National Clinical Research Center for Obstetric & Gynecologic Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
  • 3Department of Gynecological Oncology, Sun Yat-Sen Memorial Hospital of Sun Yat-Sen University, Guangzhou, China
  • 4Department of Obstetrics and Gynecology, Shanxi Provincial People’s Hospital, Xian, China
  • 5Department of Obstetrics and Gynecology, Jinan Maternity and Child Health Care Hospital, Jinan, China
  • 6Department of Clinical Laboratory, State Key Laboratory of Molecular Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China

Background: Gestational trophoblastic neoplasia (GTN) is a group of clinically rare tumors that develop in the uterus from placental tissue. Currently, its satisfactory curability derives from the timely and accurately classification and refined management for patients. This study aimed to discover biomarkers that could predict the outcomes of GTN patients after first-line chemotherapy.

Methods: A total of 65 GTN patients were included in the study. Patients were divided into the good or poor outcome group and the clinical characteristics of the patients in the two groups were compared. Furthermore, the serum peptide profiles of all patients were uncovered by using weak cation exchange magnetic beads and matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. Feature peaks were identified by three machine learning algorithms and then models were constructed and compared using five machine learning methods. Additionally, liquid chromatography mass spectrometry was used to identify the feature peptides.

Results: Multivariate logistic regression analysis showed that the International Federation of Gynecology and Obstetrics (FIGO) risk score was associated with poor outcomes. Eight feature peaks (m/z =1287, 2042, 2862, 2932, 2950, 3240, 3277 and 6626) were selected for model construction and validation by the three algorithms. Based on the panel combining FIGO risk score and peptide serum signatures, the neural network (nnet) model showed promising performance in both the training (AUC=0.9635) and validation (AUC=0.8788) cohorts. Peaks at m/z 2042, 2862, 2932, 3240 were identified as the partial sequences of transthyretin, fibrinogen alpha chain (FGA), beta-globin and FGA, respectively.

Conclusion: We combined FIGO risk score and serum peptide signatures using the nnet method to construct the model which can accurately predict outcome of GTN patients after first-line chemotherapy. With this model, patients can be further classified and managed, and those with poor predicted outcomes can be given more attention for developing treatment failure.

Introduction

Gestational trophoblastic neoplasia (GTN) is a group of clinically rare tumors that develop in the uterus from placental tissue, including invasive mole (IM), choriocarcinoma (CC), placental site trophoblastic tumor (PSTT), and epithelioid trophoblastic tumor (ETT) (1). GTN, which had a poor outcome in the last century, can now have a cure rate of over 90% owing to the combination of different treatment regimens and the refined management for high-risk patients (2). Constant monitoring of serum human chorionic gonadotropin (hCG) levels is essential after GTN treatment and persistently elevated hCG is considered a tumor-marker of poor outcome. The International Federation of Gynecology and Obstetrics (FIGO) scoring system has been developed to predict the prognosis of GTN patients. Based on the FIGO scores, patients are divided into a low-risk group (FIGO score < 7), high-risk group (7 ≤FIGO score ≤ 12), and an ultra-high-risk group (FIGO score > 12) (3). However, more personalized biomarkers are desired to help further evaluate the outcomes of GTN patients after first-line chemotherapy.

The low molecular weight (LMW) peptides in the serum contain abundant histological information, which may be useful in the early diagnosis of disease (4). Notably, specific serum peptide patterns are strongly associated with outcomes of cancer patients (5). However, due to the diverse and complex protein/peptide composition of serum, the fractionation and identification of LMW serum peptides was incredibly challenging. With the development of technology, weak cation exchange magnetic beads (MB-WCX) method has been shown to be efficient in capturing low abundance LMW peptides in serum and matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) can be applied to the analysis of captured serum peptides (6). Recently, MALDI-TOF MS-based serum peptide signatures have exhibited enormous potential in identifying patients with early lung cancer (7), cervical intraepithelial neoplasia (8), colorectal cancer (9), esophageal squamous cell carcinoma (10), and coronavirus disease 2019 (COVID-19) (11).

In the present study, the serum peptide profiles of 65 patients with GTN were revealed by MALDI-TOF MS combined with MB-WCX. Simultaneously, this study was dedicated to develop a panel based on combining serum peptide signatures and clinical feature to predict the outcome of GTN patients after first-line chemotherapy using a variety of machine learning models.

Materials and methods

Study design and patients population

A prospective collection of peripheral blood of patients with treatment-naïve GTN were conducted within 7 days before the initial therapy at 5 centers from June 2017 to June 2019. Based on FIGO cancer Report and clinical practice in China about management of gestational trophoblastic disease (2, 1214), single-agent dactinomycin chemotherapy was prescribed as first-line treatment for low-risk GTN (FIGO score of less than 5) and multiagent chemotherapy using 5-FU based regimen or EMACO as first-treatment for low-risk GTN with FIGO score 5-6 or high-risk GTN. Chemotherapy response during first-line therapy was monitored by hCG assay at the start of each treatment cycle. Complete remission (CR) was defined as the normalization of hCG for at least 4 consecutive weeks. Chemotherapy resistance is defined by a plateau in hCG (<10% change) or a rise in hCG over two consecutive cycles, and then second-line chemotherapy is considered. The date cut-off of first-line chemotherapy response ended in completion of first-line chemotherapy for patients who have achieved CR (defined as good outcome group), or for patients who experienced chemotherapy resistance (defined as poor outcome group). And then, the data cut-off of patient survival ended in April 2022. Patients who switched to second-line therapy due to toxicity of first-line chemotherapy were excluded. Additionally, given that first-line treatment regimens for PSTT and ETT differ significantly from that for IM and CC, patients with PSTT or ETT were also excluded.

The patient’s medical records including age, histology, antecedent pregnancy, hCG, FIGO score, FIGO stage, and treatment were collected. This study was approved by Chinese Academy of Medical Science, and Peking Union Medical College (approval no. 18-218/1796). The patients provided their written informed consent in accordance with the declaration of Helsinki.

Sample collection and preparation

Blood was collected from all patients in the morning and placed at 37°C for 30min to clot. Then, the blood was centrifuged at 3000 rpm for 15 minutes to obtain the serum. The serum of all patients was stored at -80°C for the next step of serum peptide analysis.

MB-WCX

Serum peptides were extracted using the serum peptide extraction kit (Bioyong Tech, Beijing, China) following the instructions. Firstly, 10μl magnetic beads, 95μl binding buffer and 10μl serum sample were mixed and incubated for 5min at room temperature. Subsequently, the supernatant derived from the mixture by using a magnetic bead separator was added to the wash buffer to remove the high molecular weight peptides/proteins. Finally, the LMW peptides were eluted by adding elution buffer and then subjected to MALDI-TOF MS analysis.

MALDI-TOF MS

The eluted peptide samples were mixed with 5mg/ml α-cyano-4-hydroxycinnamic acid (CHCA) and 1μl mixture was applied to the MALDI-TOF MS target plate then dried naturally for detection. The MALDI-TOF MS instrument (Bioyong Tech, Beijing, China) was calibrated using commercial peptide and protein calibration standards (Sigma-Aldrich, Louis, MO, USA) before assaying the samples. The spectra were automatically captured in linear mode within the range 1000 -10000 mass-to-change ratio (m/z). Each spectrum was normalized, baseline-corrected and smooth-applied using BioExplorer software (Bioyong Tech, Beijing, China). The assay for each sample was repeated three times and the final peak intensity was averaged from the three times.

Liquid chromatography mass spectrometry

Serum peptides were sequenced and identified by using a nano-liquid chromatography electrospray ionization-tandem mass spectrometry (nano LC/ESI-MS/MS), which consists of an Aquity UPLC system (Waters, USA) and a LTQ Orbitrap XL mass spectrometer (Thermo Fisher Scientific, USA) equipped with a nano-ESI source. Prior to LC-MS analysis of the target samples, the instrument was calibrated using the Pierce Retention Time Calibration Mixture (Thermo Fisher Scientific, USA). The peptides were desalted and then separated by UPLC system and analyzed using MS/MS instrument. The obtained raw mass spectra were analyzed by using Proteome Discoverer software (version 2.1, Thermo Fisher Scientific, USA). The UniProt database (UniProt-homo+sapiens.fasta, downloaded 10/08/21) was used to perform the identifications. The detailed search parameters were as follows: enzyme: trypsin, max missed cleavages: 2, fixed modifications: Carbamidomethyl, variable modifications: oxidation, protein N-terminal acetylation and deamidation, peptide mass tolerance value: 10ppm, fragment mass tolerance value: 0.02 Da.

Statistical analysis

All statistical tests were performed using R project (version 4.1.2) and SPSS 20.0 software (SPSS Inc., Chicago, IL, USA). The t-test or Wilcoxon test was used for analysis of continuous data while the chi-square test or Fisher’s exact probability for analysis of categorical data. Progression free survival (PFS) were analyzed using the Kaplan-Meier method and the log-rank test. Multivariate logistic regression analysis was used to identify risk factors for patients with poor outcome. The R packages “caret”, and “nnet” were used for the construction and validation of the models. Receiver operating characteristic (ROC) curves were used to evaluate the diagnostic performance of the model using MedCalc software (version 19.6.1) or Prism GraphPad (version 9.0). P<0.05 was considered statistically significant

Results

Clinical characteristics and outcomes

Between June 2017 and June 2019, 99 patients’ samples were collected at 5 centers. Thirty-four patients were excluded, including 8 patients with non-GTN, 15 patients with the change of chemotherapy regimen due to toxicity of first-line chemotherapy, 6 patients with PSTT, 1 patient with non-gestational CC, and 4 patients with failure of serum peptide extraction. Ultimately, 65 patients met the eligibility criteria.

The clinical characteristics of the 65 GTN patients were summarized in Table 1. The median age of all patients was 32 years old, of which 27 were diagnosed with CC and 38 with IM. Patients had a median FIGO score of 3 and 91% (59 of 65) had obvious lesions located in uterus, or other organs including lung, vagina, liver, spleen, great omentum, and pelvic. Based on the status of response during first-line chemotherapy, 35 patients were classified as good outcomes group and 30 patients were classified as poor outcomes group. As shown in Figure 1A, patients in the good outcomes group had a longer PFS than those in the poor outcomes group (Hazard Ratio=0.10 (0.02-0.61), P=0.0125). Furthermore, patients in the poor outcome group had higher FIGO scores (6 vs 2, P<0.001) and higher proportion of CC (57% vs 29%, P=0.041) than those in the good outcomes group (Table 1). In addition, multivariate logistic regression analysis showed that the FIGO score (Odds Ratio=1.32(1.03-1.75), P=0.04) was associated with poor outcomes. (Figure 1B). As illustrated in Figure 1C, the FIGO risk score exhibited moderate performance in discriminating between patients with different outcomes, with the area under the curve (AUC) of the ROC being 0.749 (P<0.001). Overall, the FIGO risk score was effective in predicting outcome of GTN patients after first-line chemotherapy, but its performance could be further strengthened.

TABLE 1
www.frontiersin.org

Table 1 Clinical characteristics and treatments of GTN patients.

FIGURE 1
www.frontiersin.org

Figure 1 Analysis of the clinical characteristics and outcomes of patients with GTN after first-line chemotherapy (A), disease-free survival (DFS) between two groups with good or poor outcomes; (B), multivariate regression analysis of factors associated with outcomes; (C), ROC analysis using the FIGO risk score to discriminate between different outcome groups.

Serum peptide profiles analysis

The workflow of serum peptide profiles analysis was shown in Figure 2. Serum obtained by centrifugation of blood samples from patients contained a wide range of low and high molecular peptides/proteins. LMW serum peptides were extracted by MB-WCX method and analyzed using MALDI-TOF MS. Representative mass spectra of patients with GTN who had good or poor outcome were shown in Figure 3A, respectively. A total of 37 peaks in the range of 1000-10000 m/z were detected in all 65 patients. The intensities of these peaks are shown in Figure 3B, with different patients exhibiting diverse intensity patterns.

FIGURE 2
www.frontiersin.org

Figure 2 The workflow of serum peptide profiles analysis. Serum from patients containing various peptides/proteins was collected and subsequently processed for analysis using MALDI-TOF MS. The obtained peak patterns were normalized and then the machine learning algorithms were used to perform feature selection. The different machine learning methods were implemented and compared to eventually yield a robust model, which was further evaluated and validated.

FIGURE 3
www.frontiersin.org

Figure 3 Serum peptide patterns in patients with GTN (A), MALDI-TOF spectra of the serum samples from patients with good or poor outcome; (B), heatmap of the intensity distribution of the 37 peptide peaks for all patients.

Selection of feature peaks

To construct a robust model for predicting the outcomes of GTN patient, after normalizing the intensity of the peaks, all patients were split into training cohort (n=48) and validation cohort (n=17) with an allocation of 7:3. In training cohort, three machine learning algorithms were used to screen the feature peaks: partial least-squares-discriminant analysis (PLS-DA), recursive feature elimination (RFE) and random forest (RF). All peaks were ranked according to the relative importance score using above three algorithms and the top 20 peaks were shown in Figures 4A-C. Furthermore, 8 feature peaks (m/z =1287, 2042, 2862, 2932, 2950, 3240, 3277 and 6626) were selected for model construction and validation by taking the intersection of the top 20 peaks filtered by the three algorithms (Figure 4D).

FIGURE 4
www.frontiersin.org

Figure 4 Selecting signature peptide peaks using machine learning algorithms (A), top 20 features prioritized by PLS-DA ranked by the decrease in feature importance scores; (B), top 20 features prioritized by RFE ranked by the decrease in feature importance scores; (C), top 20 features prioritized by RF ranked by the decrease in feature importance scores; (D), final 8 serum peptide features were achieved by taking the intersection of the top 20 features filtered by the three machine learning algorithms.

Model construction and validation

Based on the fact that the FIGO score is a vital factor for poor outcome in GTN patients and the eight characteristic peaks that were screened by three machine learning algorithms, we further explored whether these features could be combined to build a powerful model to predict clinical outcomes. Thus, the above features were applied to construct five machine learning model methods, including neural network (nnet), recursive partitioning (rpart), naive bayes (nb), support vector machine (svm), logistic regression model (lm). We compared the performance of five methods by cross-validation to calculate ROC, sensitivity and specificity. As illustrated in Figure 5A, the nnet method had higher average ROC, sensitivity and specificity values. Overall, our results indicated the nnet method was superior to the other models; therefore, we chosen the nnet method for further analysis. In the training cohort, the confusion matrix of the nnet method showed that only three patients with poor outcomes were misclassified as good outcomes while no patients with good outcomes were misclassified as poor outcomes (Figure 5B). Remarkably, the nnet method reached an AUC of 0.9635 in the training cohort (Figure 5C). In the independent validation cohort, the nnet method also achieved a favorable performance: only one patient with good or poor outcome was misclassified respectively, with the AUC value reaching 0.8788 (Figures 5D, E). The accuracy, error rate, precision, recall, and F1-score of the nnet method in both the training and validation cohorts were shown in Figure 5F. Intriguingly, the AUC for the FIGO score alone to discriminate between good and poor outcome subgroups was 0.786 and 0.568 in the training and validation cohort, respectively (Figure 6). Collectively, the nnet machine learning model constructed by combining the FIGO score and serum peptide signatures had satisfactory classification performance in predicting outcomes of GTN patients after first-line chemotherapy.

FIGURE 5
www.frontiersin.org

Figure 5 Performance of machine learning models based on FIGO risk scores and serum peptide signatures. (A), comparison of the performance among 5 machine learning methods by cross-validation; (B), confusion matrix of the classification results by the nnet model in training cohort; (C), ROC analysis using the nnet model to discriminate between different outcome groups in training cohort; (D), confusion matrix of the classification results by the nnet model in validation cohort; (E), ROC analysis using the nnet model to discriminate between different outcome groups in validation cohort; (F), summary of accuracy, error rate, precision, recall and F1 for the nnet model in the training and validation cohort.

FIGURE 6
www.frontiersin.org

Figure 6 ROC analysis using the FIGO risk score to discriminate between different outcome groups. (A), ROC analysis in the training cohort; (B), ROC analysis in the validation cohort.

Identification of peptide peaks

To further elucidate the role of serum peptide profiles in disease, we used LC-MS/MS for the identification of the peptide peaks. As depicted in Table 2, the amino acid sequences of the four peptide peaks were successfully identified. Peaks at m/z 2042, 2862, 2932, 3240 were identified as the partial sequences of transthyretin, fibrinogen alpha chain (FGA), beta-globin and FGA, respectively.

TABLE 2
www.frontiersin.org

Table 2 Identified candidate peptide biomarkers.

Discussion

Currently, most GTN patients could be cured with the preservation of their reproductive function, which benefits from timely and appropriate initial management and prognostic follow-up. It is essential to implement less-toxic monotherapy for patients at low-risk and aggressive multiagent therapy for patients at high-risk, based on the individual risk factors (15). Constant surveillance of hCG levels is imperative and elevated hCG levels after treatment herald poor outcomes, such as relapse or resistance. In this scenario, it would be meaningful to explore individual difference-based biomarkers for predicting treatment outcomes so as to achieve individualized and refined governance for GTN patients after first-line chemotherapy. It is worth noting that the FIGO risk scoring system includes the evaluation of 8-index scale for age, antecedent pregnancy, interval months from index pregnancy, pretreatment serum hCG, largest tumor size, site of metastases, number of metastases and previous failed chemotherapy (15). Previous publications have highlighted the role of the FIGO risk score in assessing prognosis of patients: a higher FIGO score (≥12) forecasted increased probability of treatment failure and poor outcome (1618). In line with the above studies, we identified the FIGO score as a vital factor for poor outcomes by multivariate logistic regression. Some studies have pointed out that the FIGO risk score is a better predictor of clinical outcomes than single indicators in it (19, 20). On this basis, we attempted to use the ROC curves to determine the power of the FIGO score to predict outcomes and obtained the AUC of 0.749 in our study cohort. It is implied that the FIGO risk score can be used as a valid predictor of outcomes for GTN patients after first-line chemotherapy but there is still room for development.

Proteases are involved in many physiological processes in the body which are crucial for maintaining homeostasis, whilst aberrant protease activity changes are closely associated with tumorigenesis and progression (21). Specifically, proteases can, on the one hand, degrade the extracellular matrix barrier to promote tumor metastasis and progression; on the other hand, cleave proteins in the serum to produce peptide fragments of variable size. In other words, the serum peptide pattern may vary in patients with disparate outcomes. Bedin et al. revealed peculiar changes in the serum peptide profile of colorectal cancer from pre-cancer lesion to metastatic disease, implying the potential usefulness of serum peptide signatures as biomarkers in early diagnosis and prognosis of patients with tumors (22). Therefore, in present study we intended to discover serum peptide signatures that are closely related to the outcome of GTN patients. For all patients, we identified 37 peptide peaks, however not all of them were meaningful. For instance, the peptide peak with m/z=2020 had minor variation in most patients. To avoid over-fitting of the constructed model, it was necessary to include only sensible features. Feature selection based on machine learning algorithms has now been reported to filter out risk factors associated with patient diagnosis and prognosis (11, 23). Hereby, we used three machine learning algorithms (PLS-DA, RFE and RF) to help us recognize valuable peptide peaks. PLS-DA is a supervised multivariate classification method that could be a variable selection method by ranking the most important loadings in decreasing order (24). RFE is a greedy algorithm that builds gene sets by iterating continuously, removing the less important genes and selecting the optimal subset from gene sets (25). RF is an algorithm for estimating variable importance based on multiple decision trees, which measures mainly the mean decrease in accuracy or mean decrease in Gini (24). After integrating the three excellent feature selection algorithms, we have determined eight peptide signatures for further analysis.

Recently, it is a hot topic in the field of tumor diagnosis and prognosis evaluation to use selected features in the training set for model building and training by machine learning methods, and to perform the verification in the independent validation set (2628). In this study, we combined a clinical feature (FIGO score) closely related to outcomes with serum peptide features to generate a prediction panel. Furthermore, models based on this panel were constructed and compared by five machine learning methods (nnet, rpart, nb, svm, lm). Through cross-validation, we found that the neural network (nnet) method outperformed the other methods. The nnet method is considered as a powerful tool for deep learning and artificial intelligence by simulating the functions of the brain’s neural networks to help make decisions (29). The nnet method has been reported to possess enormous strengths in attaining classification of images, and it has been observed to yield a high degree of accuracy in the differentiation of bone marrow cell morphologies (30). She et al. has developed a deep learning survival neural network model which can predict the survival of lung cancer patients and test the reliability of recommended treatments (31). We found that the model constructed using the panel-based nnet method (AUC: 0.9635 in the training cohort; 0.8788 in the validation cohort) showed a substantial improvement in the efficacy of predicting outcomes compared to the FIGO score alone (AUC: 0.786 in the training cohort; 0.568 in the validation cohort).

Machine learning poses a number of challenges when it comes to building accurate models from complicated data, for instance, its complexity and uncertainty make models opaque and difficult to interpret, also known as “black box” (32). On this basis, the selected eight peptides were further identified by using MS/LS-MS to enhance the interpretability of the model. We have determined four of the eight peptides, m/z 2042, 2862, 2932, 3240, which were fragments of transthyretin, FGA, beta-globin and FGA, respectively. Transthyretin, also known as prealbumin, is a homotetramer plasma protein of approximately 55 KD that can transport thyroxine by binding to retinol-binding protein (33). It has been reported that transthyretin, a secreted protein downstream of STAT3, could promote oncogenic gene activation, enhance cytokine function in the tumor microenvironment as well as increase the production of reactive oxygen species to facilitate the progression of lung cancer (34). Swiatly et al. reported that transthyretin was a serum peptide biomarker that contributed to the clinical diagnosis of ovarian cancer (35). Moreover, gestational trophoblastic disease is one of the rare causes of hyperthyroidism (36). Thus, it is implied that transthyretin may be closely related to the pathological course and treatment outcome of patients with GTN. Fibrinogen is the precursor of fibrin, which is the key component of the blood coagulation system (37). Fibrinogen alpha chain (FGA) has been reported as a serum peptide signature being a hallmark of a variety of tumors, including colorectal cancer (9), esophageal squamous cell carcinoma (10), non-small cell lung cancer (38), pancreatic ductal adenocarcinoma (39). Duan et al. have revealed FGA as an attractive target for evaluating prognosis in gastric cancer by using DNA microarray analysis (40). Notably, Goldstein et al. found that patients with trophoblastic disease had increased concentration of fibrinogen along with altered fibrinolytic activity (41). In our study, two of the four successfully identified peptides were fragments of FGA, suggesting that FGA may be an important predictor of outcome for GTN patients. Certainly, the underlying mechanism of which deserves further exploration. The combination of the β globin-encoded polypeptides and the α globin-encoded polypeptides could produce distinct hemoglobin tetramers in red blood cells for oxygen transport, while aberrations in this process may lead to the development of severe hemoglobinopathies β-thalassemia (42). For patients with GTN, chemotherapy is a vital treatment, but it can also bring toxicity, such as anemia (43). Given that β globin is a regulator of the maintenance of erythrocyte function, we hypothesized that the toxicity associated with the treatment would have an imprint on it. Therefore, the β globin peptide fragment in the serum may reflect the degree of toxicity of chemotherapy which in turn provides the evidence of outcome.

Overall, the strength of this study was that combining clinical feature and serum peptide signatures constructed a robust model to predict outcomes for GTN patients after first chemotherapy through machine learning approaches. With this model, patients can be further classified and managed, and those with poor predicted outcomes can be given more attention for developing treatment failure or relapse. We were convinced that our study could provide a novel perspective on the treatment and follow-up of GTN after first chemotherapy.

There were also some limitations in this study. The major point was the limited sample size. Nevertheless, GTN as a rare disease has a low incidence. A larger cohort is desired to appraise the reliability of the model in the future. Apart from this, the amino acid sequences of the remaining four serum polypeptide signatures have not been disclosed. Moreover, since there are only 8 differentially expressed peptides identified, these peptides mass should also be verify using multiple reaction monitoring (MRM) based targeted workflow. Peptides along with two MS/MS fragments need to be targeted for absolute quantitation. However, due to the limited volume of serum retained from patients, we may not be able to perform MRM analysis at this time. In the future, we would enroll larger cohorts of patients to establish the level of these peptide during the disease progression by using MRM technology. Notably, other confirmatory experiments could contribute to establishing the utility of these peptides in the prognostic assessment of GTN patients after first-line chemotherapy.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving human participants were reviewed and approved by the Ethics Committee of National Cancer Center/Cancer Hospital, Chinese Academy of Medical Science, and Peking Union Medical College. The patients/participants provided their written informed consent to participate in this study.

Author contributions

WC and F-ZF contributed to the conception and design of this study. HS, X-RW and L-JW contributed to data acquisition. X-SD and X-YJ contributed to data interpretation and analysis. YX and FC contributed to study supervision. Z-RW and FW contributed to manuscript editing. HY and YG contributed to manuscript revising. All authors contributed to manuscript review.

Funding

This study was supported by the CAMS Innovation Fund for Medical Sciences (CIFMS) (No. 2017-I2M-3-005), Beijing Natural Science Foundation (No. 7202165), National Natural Science Foundation of China (No. 82072361) and the Non-profit Central Research Institute Fund of Chinese Academy of Medical Sciences(2021-PT320-001).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Abu-Rustum NR, Yashar CM, Bean S, Bradley K, Campos SM, Chon HS, et al. Gestational trophoblastic neoplasia, version 2.2019, NCCN clinical practice guidelines in oncology. J Natl Compr Canc Netw (2019) 17(11):1374–91. doi: 10.6004/jnccn.2019.0053

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Hoekstra AV, Lurain JR, Rademaker AW, Schink JC. Gestational trophoblastic neoplasia: Treatment outcomes. Obstet. Gynecol. (2008) 112(2 Pt 1):251–8. doi: 10.1097/AOG.0b013e31817f58ae

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Ngan HYS, Seckl MJ, Berkowitz RS, Xiang Y, Golfier F, Sekharan PK, et al. Update on the diagnosis and management of gestational trophoblastic disease. Int J Gynaecol. Obstet. (2018) 143 Suppl 2:79–85. doi: 10.1002/ijgo.12615

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Tirumalai RS, Chan KC, Prieto DA, Issaq HJ, Conrads TP, Veenstra TD. Characterization of the low molecular weight human serum proteome. Mol Cell Proteomics (2003) 2(10):1096–103. doi: 10.1074/mcp.M300031-MCP200

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Villanueva J, Shaffer DR, Philip J, Chaparro CA, Erdjument-Bromage H, Olshen AB, et al. Differential exoprotease activities confer tumor-specific serum peptidome patterns. J Clin Invest (2006) 116(1):271–84. doi: 10.1172/jci26022

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Qiu F, Liu HY, Zhang XJ, Tian YP. Optimization of magnetic beads for maldi-TOF MS analysis. Front Biosci (Landmark Ed) (2009) 14:3712–23. doi: 10.2741/3482

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Widlak P, Pietrowska M, Polanska J, Marczyk M, Ros-Mazurczyk M, Dziadziuszko R, et al. Serum mass profile signature as a biomarker of early lung cancer. Lung Cancer (2016) 99:46–52. doi: 10.1016/j.lungcan.2016.06.011

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Liu Y, Wei F, Wang F, Li C, Meng G, Duan H, et al. Serum peptidome profiling analysis for the identification of potential biomarkers in cervical intraepithelial neoplasia patients. Biochem Biophys Res Commun (2015) 465(3):476–80. doi: 10.1016/j.bbrc.2015.08.042

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Wang H, Luo C, Zhu S, Fang H, Gao Q, Ge S, et al. Serum peptidome profiling for the diagnosis of colorectal cancer: Discovery and validation in two independent cohorts. Oncotarget (2017) 8(35):59376–86. doi: 10.18632/oncotarget.19587

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Jia K, Li W, Wang F, Qu H, Qiao Y, Zhou L, et al. Novel circulating peptide biomarkers for esophageal squamous cell carcinoma revealed by a magnetic bead-based MALDI-TOFMS assay. Oncotarget (2016) 7(17):23569–80. doi: 10.18632/oncotarget.8123

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Yan L, Yi J, Huang C, Zhang J, Fu S, Li Z, et al. Rapid detection of COVID-19 using MALDI-TOF-Based serum peptidome profiling. Anal Chem (2021) 93(11):4782–7. doi: 10.1021/acs.analchem.0c04590

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Li L, Wan X, Feng F, Ren T, Yang J, Zhao J, et al. Pulse actinomycin d as first-line treatment of low-risk post-molar non-choriocarcinoma gestational trophoblastic neoplasia. BMC Cancer (2018) 18(1):585. doi: 10.1186/s12885-018-4512-5

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Jiang F, Yang K, Wan XR, Xiang Y, Feng FZ, Ren T, et al. Reproductive outcomes after floxuridine-based regimens for gestational trophoblastic neoplasia: A retrospective cohort study in a national referral center in China. Gynecol. Oncol (2020) 159(2):464–9. doi: 10.1016/j.ygyno.2020.08.018

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Li Y, Kong Y, Wan X, Feng F, Ren T, Zhao J, et al. Results with floxuridine, actinomycin d, etoposide, and vincristine in gestational trophoblastic neoplasias with international federation of gynecology and obstetrics scores ≥5. Oncologist (2021) 26(12):e2209–16. doi: 10.1002/onco.13943

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Soper JT. Gestational trophoblastic disease: Current evaluation and management. Obstet. Gynecol. (2021) 137(2):355–70. doi: 10.1097/aog.0000000000004240

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Yang J, Xiang Y, Wan X, Feng F, Ren T. Analysis of the prognosis and related factors for patients with stage IV gestational trophoblastic neoplasia. Int J Gynecol. Cancer (2014) 24(3):594–9. doi: 10.1097/igc.0000000000000070

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Xiao C, Yang J, Zhao J, Ren T, Feng F, Wan X, et al. Management and prognosis of patients with brain metastasis from gestational trophoblastic neoplasia: a 24-year experience in Peking union medical college hospital. BMC Cancer (2015) 15:318. doi: 10.1186/s12885-015-1325-7

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Li J, Yang J, Liu P, Ren T, Zhao J, Feng F, et al. Clinical characteristics and prognosis of 272 postterm choriocarcinoma patients at Peking union medical college hospital: a retrospective cohort study. BMC Cancer (2016) 16:347. doi: 10.1186/s12885-016-2383-1

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Mortakis AE, Braga CA. "Poor prognosis" metastatic gestational trophoblastic disease: the prognostic significance of the scoring system in predicting chemotherapy failures. Obstet. Gynecol. (1990) 76(2):272–7.

PubMed Abstract | Google Scholar

20. Dubuc-Lissoir J, Zweizig S, Schlaerth JB, Morrow CP. Metastatic gestational trophoblastic disease: a comparison of prognostic classification systems. Gynecol. Oncol (1992) 45(1):40–5. doi: 10.1016/0090-8258(92)90488-5

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Yang P, Li ZY, Li HQ. Potential roles of protease inhibitors in cancer progression. Asian Pac. J Cancer Prev (2015) 16(18):8047–52. doi: 10.7314/apjcp.2015.16.18.8047

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Bedin C, Crotti S, Ragazzi E, Pucciarelli S, Agatea L, Tasciotti E, et al. Alterations of the plasma peptidome profiling in colorectal cancer progression. J Cell Physiol (2016) 231(4):915–25. doi: 10.1002/jcp.25196

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Guan X, Zhang B, Fu M, Li M, Yuan X, Zhu Y, et al. Clinical and inflammatory features based machine learning model for fatal risk prediction of hospitalized COVID-19 patients: results from a retrospective cohort study. Ann Med (2021) 53(1):257–66. doi: 10.1080/07853890.2020.1868564

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Gromski PS, Xu Y, Correa E, Ellis DI, Turner ML, Goodacre R. A comparative investigation of modern feature selection and classification approaches for the analysis of mass spectrometry data. Anal Chim Acta (2014) 829:1–8. doi: 10.1016/j.aca.2014.03.039

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Chen Q, Meng Z, Su R. WERFE: A gene selection algorithm based on recursive feature elimination and ensemble strategy. Front Bioeng. Biotechnol (2020) 8:496. doi: 10.3389/fbioe.2020.00496

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Fan NJ, Gao CF, Zhao G, Wang XL, Liu QY. Serum peptidome patterns of breast cancer based on magnetic bead separation and mass spectrometry analysis. Diagn Pathol (2012) 7:45. doi: 10.1186/1746-1596-7-45

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Wang L, Tang C, Xu B, Yang L, Qu L, Li L, et al. Mass spectrometry-based serum peptidome profiling accurately and reliably predicts outcomes of pemetrexed plus platinum chemotherapy in patients with advanced lung adenocarcinoma. PloS One (2017) 12(6):e0179000. doi: 10.1371/journal.pone.0179000

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Zheng D, Ding Y, Ma Q, Zhao L, Guo X, Shen Y, et al. Identification of serum MicroRNAs as novel biomarkers in esophageal squamous cell carcinoma using feature selection algorithms. Front Oncol (2018) 8:674. doi: 10.3389/fonc.2018.00674

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Kriegeskorte N, Golan T. Neural network models and deep learning. Curr Biol (2019) 29(7):R231–r236. doi: 10.1016/j.cub.2019.02.034

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Matek C, Krappe S, Münzenmayer C, Haferlach T, Marr C. Highly accurate differentiation of bone marrow cell morphologies using deep neural networks on a large image data set. Blood (2021) 138(20):1917–27. doi: 10.1182/blood.2020010568

PubMed Abstract | CrossRef Full Text | Google Scholar

31. She Y, Jin Z, Wu J, Deng J, Zhang L, Su H, et al. Development and validation of a deep learning model for non-small cell lung cancer survival. JAMA Netw Open (2020) 3(6):e205842. doi: 10.1001/jamanetworkopen.2020.5842

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Azodi CB, Tang J, Shiu SH. Opening the black box: Interpretable machine learning for geneticists. Trends Genet (2020) 36(6):442–55. doi: 10.1016/j.tig.2020.03.005

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Lim A, Sengupta S, McComb ME, Théberge R, Wilson WG, Costello CE, et al. In vitro and in vivo interactions of homocysteine with human plasma transthyretin. J Biol Chem (2003) 278(50):49707–13. doi: 10.1074/jbc.M306748200

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Lee CC, Ding X, Zhao T, Wu L, Perkins S, Du H, et al. Transthyretin stimulates tumor growth through regulation of tumor, immune, and endothelial cells. J Immunol (2019) 202(3):991–1002. doi: 10.4049/jimmunol.1800736

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Swiatly A, Horala A, Hajduk J, Matysiak J, Nowak-Markwitz E, Kokot ZJ. MALDI-TOF-MS analysis in discovery and identification of serum proteomic patterns of ovarian cancer. BMC Cancer (2017) 17(1):472. doi: 10.1186/s12885-017-3467-2

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Lacka K, Maciejewski A. Rare thyroid non-neoplastic diseases. Thyroid Res (2015) 8:5. doi: 10.1186/s13044-015-0017-3

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Kamath S, Lip GY. Fibrinogen: biochemistry, epidemiology and determinants. Qjm (2003) 96(10):711–29. doi: 10.1093/qjmed/hcg129

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Song Y, Xu X, Wang N, Zhang T, Hu C. MALDI-TOF-MS analysis in low molecular weight serum peptidome biomarkers for NSCLC. J Clin Lab Anal (2022) 36(4):e24254. doi: 10.1002/jcla.24254

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Huang Y, Chen F, Zhang L, Lv Q, Yan J, Cui W. MALDI-TOF-MS analysis in the discovery and identification of the serum peptide pattern of pancreatic ductal adenocarcinoma. Lab Med (2021) 52(6):558–66. doi: 10.1093/labmed/lmab024

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Duan S, Gong B, Wang P, Huang H, Luo L, Liu F. Novel prognostic biomarkers of gastric cancer based on gene expression microarray: COL12A1, GSTA3, FGA and FGG. Mol Med Rep (2018) 18(4):3727–36. doi: 10.3892/mmr.2018.9368

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Goldstein DP, Brakman P, Marshall JR. The plasma fibrinolytic system in patients with trophoblastic disease and undelivered hydatidiform mole. Am J Obstet. Gynecol. (1966) 94(1):21–8. doi: 10.1016/0002-9378(66)90375-9

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Kiefer CM, Hou C, Little JA, Dean A. Epigenetics of beta-globin gene regulation. Mutat Res (2008) 647(1-2):68–76. doi: 10.1016/j.mrfmmm.2008.07.014

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Wang M, Shen L, Xu X, Duan W, Miao J, Kong W, et al. Real-world study of cisplatin, etoposide, and bleomycin chemotherapy regimen in gestational trophoblastic neoplasia. BioMed Res Int (2021) 2021:6661698. doi: 10.1155/2021/6661698

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: gestational trophoblastic neoplasia, serum peptide profiles, machine learning, biomarker, FIGO

Citation: Wang F, Wang Z-r, Ding X-s, Yang H, Guo Y, Su H, Wan X-r, Wang L-j, Jiang X-y, Xu Y-h, Chen F, Cui W and Feng F-z (2022) Combining serum peptide signatures with International Federation of Gynecology and Obstetrics (FIGO) risk score to predict the outcomes of patients with gestational trophoblastic neoplasia (GTN) after first-line chemotherapy. Front. Oncol. 12:982806. doi: 10.3389/fonc.2022.982806

Received: 30 June 2022; Accepted: 06 October 2022;
Published: 21 October 2022.

Edited by:

Sara Ricardo, Universidade do Porto, Portugal

Reviewed by:

Faraz Rashid, Henry Ford Health System, United States
Komsun Suwannarurk, Thammasat University, Thailand

Copyright © 2022 Wang, Wang, Ding, Yang, Guo, Su, Wan, Wang, Jiang, Xu, Chen, Cui and Feng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Wei Cui, d2VuZHljdWl3ZWlAc2luYS5jbg==; Feng-zhi Feng, ZmVuZ2Z6MTk2OUBzaW5hLmNvbQ==

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.