Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 19 November 2021
Sec. Cancer Imaging and Image-directed Interventions
This article is part of the Research Topic Computational tools in inferring cancer tissue-of-origin and molecular classification towards personalized cancer therapy, Volume III View all 29 articles

Evaluating the Risk of Breast Cancer Recurrence and Metastasis After Adjuvant Tamoxifen Therapy by Integrating Polymorphisms in Cytochrome P450 Genes and Clinicopathological Characteristics

Hui Pang&#x;Hui Pang1†Guoqiang Zhang&#x;Guoqiang Zhang1†Na Yan,&#x;Na Yan2,3†Jidong Lang,Jidong Lang2,3Yuebin Liang,Yuebin Liang2,3Xinyuan XuXinyuan Xu1Yaowen CuiYaowen Cui1Xueya WuXueya Wu1Xianjun LiXianjun Li1Ming ShanMing Shan1Xiaoqin WangXiaoqin Wang2Xiangzhi MengXiangzhi Meng4Jiaxiang LiuJiaxiang Liu4Geng Tian,Geng Tian2,3Li Cai*Li Cai1*Dawei Yuan*Dawei Yuan2*Xin Wang*Xin Wang4*
  • 1Department of Medical Oncology, Harbin Medical University Cancer Hospital, Harbin, China
  • 2Department of Science, Geneis (Beijing) Co., Ltd., Beijing, China
  • 3Department of Science, Qingdao Geneis Institute of Big Data Mining and Precision Medicine, Qingdao, China
  • 4Department of Breast Surgical Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China

Tamoxifen (TAM) is the most commonly used adjuvant endocrine drug for hormone receptor-positive (HR+) breast cancer patients. However, how to accurately evaluate the risk of breast cancer recurrence and metastasis after adjuvant TAM therapy is still a major concern. In recent years, many studies have shown that the clinical outcomes of TAM-treated breast cancer patients are influenced by the activity of some cytochrome P450 (CYP) enzymes that catalyze the formation of active TAM metabolites like endoxifen and 4-hydroxytamoxifen. In this study, we aimed to first develop and validate an algorithm combining polymorphisms in CYP genes and clinicopathological signatures to identify a subpopulation of breast cancer patients who might benefit most from TAM adjuvant therapy and meanwhile evaluate major risk factors related to TAM resistance. Specifically, a total of 256 patients with invasive breast cancer who received adjuvant endocrine therapy were selected. The genotypes at 10 loci from three TAM metabolism-related CYP genes were detected by time-of-flight mass spectrometry and multiplex long PCR. Combining the 10 loci with nine clinicopathological characteristics, we obtained 19 important features whose association with cancer recurrence was assessed by importance score via random forests. After that, a logistic regression model was trained to calculate TAM risk-of-recurrence score (TAM RORs), which is adopted to assess a patient’s risk of recurrence after TAM treatment. The sensitivity and specificity of the model in an independent test cohort were 86.67% and 64.56%, respectively. This study showed that breast cancer patients with high TAM RORs were less sensitive to TAM treatment and manifested more invasive characteristics, whereas those with low TAM RORs were highly sensitive to TAM treatment, and their conditions were stable during the follow-up period. There were some risk factors that had a significant effect on the efficacy of TAM. They were tissue classification (tumor Grade < 2 vs. Grade ≥ 2, p = 2.2e−16), the number of lymph node metastases (Node-Negative vs. Node < 4, p = 5.3e−07; Node < 4 vs. Node ≥ 4, p = 0.003; Node-Negative vs. Node ≥ 4, p = 7.2e−15), and the expression levels of estrogen receptor (ER) and progesterone receptor (PR) (ER < 50% vs. ER ≥ 50%, p = 1.3e−12; PR < 50% vs. PR ≥ 50%, p = 2.6e−08). The really remarkable thing is that different genotypes of CYP2D6*10(C188T) show significant differences in prediction function (CYP2D6*10 CC vs. TT, p < 0.019; CYP2D6*10 CT vs. TT, p < 0.037). There are more than 50% Chinese who have CYP2D6*10 mutation. So the genotype of CYP2D6*10(C188T) should be tested before TAM therapy.

Introduction

Hormone receptor-positive (HR+) breast cancer accounts for 75% of all breast cancer patients and is the most common molecular subtype of this disease (1, 2). According to the National Comprehensive Cancer Network 2017 (NCCN2017) (3), HR+ includes estrogen receptor-positive (ER+) and/or progesterone receptor-positive (PR+). Currently, the standard adjuvant endocrine therapy for HR+ breast cancer is 5-year treatment with tamoxifen (TAM) or aromatase inhibitor (AI) (3).

TAM is the earliest and most classical drug in endocrine therapy for breast cancer (47). In 1998, The Early Breast Cancer Trialists’ Collaborative Group (EBCTCG) published a meta-analysis of 37,000 randomized clinical trials in 55 groups in The Lancet. The study suggested that oral TAM for 5 years in HR+ breast cancer patients can reduce the risk of recurrence of early breast cancer by 47% and the risk of death by 26%, with a survival rate improvement of at least 10 years. The efficacy was independent of age, menstrual status, lymph node metastasis, or prior chemotherapy. In 2011, EBCTCG updated the results, further confirming the efficacy of 5-year TAM treatment after surgery, and a continuing effect until 15 years after surgery. These results established the foundation for oral 5-year TAM as a standard protocol for adjuvant endocrine therapy for breast cancer patients (8). In 2017, EBCTCG studied 88 clinical trials with follow-up over 5 to 20 years, which assessed the risk of breast cancer recurrence in 62,923 patients with at least 5-year TAM treatment. It was found that even in patients with low histological grade of T1N0, 10% of patients had developed distant metastasis 20 years later. Therefore, it is necessary to prolong the time of endocrine therapy or strengthen endocrine therapy for patients with high risk of recurrence (9).

The same dose of TAM (10 mg, b.i.d.) was administrated to patients, however, with significantly different effectiveness in individual patients (10), which presents the need for precision medicine (11, 12). This individualized difference in effectiveness could not be fully explained by liver and kidney function, age, lifestyle or a combination of medication, and patient compliance. Genetic factors might play an important decisive role (13, 14). A number of studies have shown that TAM metabolized through the cytochrome P450 enzymes of the liver to the active products 4-hydroxytamoxifen and endoxifen to play pharmacological effects. However, the cytochrome P450 enzyme activity is influenced by its genetic polymorphism.

CYP2D6 is a key enzyme in the metabolic process of TAM, and the relationship between its genetic polymorphism with TAM metabolism and efficacy has attracted much attention (13, 15). Several studies have shown that the CYP2D6 enzymatic activity in breast cancer patients with CYP2D6*3(775delA), CYP2D6*4(G506-1A), CYP2D6*5 (fragment deletion), CYP2D6*10(C188T), and CYP2D6*41 (c.985+39G>A) alleles is reduced; the levels of activated intermediate metabolites 4-hydroxytamoxifen and endoxifen are decreased after TAM treatment; and the recurrence rate of breast cancer is higher, while the survival rate after recurrence is lower (1619). Among Chinese population, the distribution frequency of CYP2D6*10 was shown to be as high as 50%, which was thought to be a major factor affecting in vivo activation efficiency of TAM (20). The serum concentrations of endoxifen in breast cancer patients with CYP2D6*1/*10(CT) and CYP2D6*10/*10(TT) were shown to be decreased more significantly than those of individuals with wild-type CYP2D6 (10).

CYP2C19 is a typical CYP450 enzyme that affects the metabolism of TAM transforming into 4-hydroxytamoxifen, and it also participates in the metabolism of estradiol and estrone (1618). The enzymatic activity of CYP2C19 in patients with CYP2C19*2(G681A) and *3(G636A) alleles is decreased, and the 5-year disease-free survival (DFS) rate is lower than that in patients with CYP2C19 wild type (2123). However, some studies suggest that the activity of CYP2C19 in breast cancer patients with CYP2C19*17(C-806T) allele was enhanced, and the application of TAM treatment was beneficial to those patients (16, 24).

N-Demethylation of TAM is mainly mediated by CYP3A5, and CYP3A5*3(A6986G) mutation reduces the enzymatic activity (9, 25). Goetz et al. showed that the DFS time, DFS rate, and overall survival (OS) rate of breast cancer patients with different CYP3A5 genotypes were similar (26). However, another study (27) found that the recurrence risk of individuals with CYP3A5*3/*3(GG) is significantly decreased after 5 years of TAM therapy, suggesting that CYP3A5 polymorphism might also be an important factor affecting the efficacy of TAM.

In summary, there lacks a unified quantitative indicator to predict the superiority of TAM in the treatment of patients with early HR+ breast cancer. There lacks predictive model to specifically differentiate the patients with recurrence risk after early TAM treatment. We aimed to develop such a model to more efficiently guide such patients for improved DFS from individualized TAM therapy.

Materials and Methods

Patient Selection

In this retrospective study, patients’ information was retrieved from the sample database of the Galactophore Department, Cancer Hospital Affiliated to Harbin Medical University. The keywords used to screen patients from the sample storage management system included invasive breast cancer, HR+, endocrine therapy, and TAM. Briefly, 5,731 patients were retrieved. Among these, patients were excluded based on the following criteria (Figure S1): 1) patients without clinicopathological information or incomplete clinical pathology information; 2) patients with HER2 (3+) or HER2 fluorescence in situ hybridization (FISH) (+) who received trastuzumab treatment; 3) patients without disease progression treated with toremifene or AIs; 4) patients without blood samples; or 5) patients with failed repeated extraction of blood sample. Finally, 256 patients were included in the study.

This was a retrospective study. Informed consent from patients was not required in this study. All samples were retrieved from the sample library of Cancer Hospital Affiliated to Harbin Medical University. This study was approved by the Ethics Committee of Cancer Hospital Affiliated to Harbin Medical University (Ethical No. KY2017-03).

This study involved the clinical information and pathological data that might be related to the incidence of recurrence and metastasis, including tissue classification, the maximum diameter of tumor, the number of lymph node metastases, whether the patients were in menopause or not, patient’s age, and the expression levels of ER, PR, HER2, and Ki67. Recurrence and metastasis were defined as the recurrence of primary lesions, metastasis of axillary lymph nodes, mammary glands, and distant organs. All the patients received TAM 10 mg each time, twice daily. In addition, all patients’ information and blood samples used in this study were obtained following the approval from the hospital.

Detection of Genetic Polymorphism

In recent years, time-of-flight mass spectrometry (TOFMS) has become a very effective method for genotyping single-nucleotide polymorphisms (SNPs). TOFMS can detect genotypes rapidly and efficiently (28). Several SNP genotyping methods have been implemented with a high degree of automation and are being applied for large-scale association studies. It is the working principle of TOFMS. Firstly, a segment of DNA containing SNP site was amplified by PCR (about 50 bp before and after SNP site). And then SAP enzyme was used to remove the dNTP and the primers in the PCR system. A single-base extension primer was added in which three “terminal base” was next to SNP site and used four kinds of ddNTP instead of dNTP (ddNTP corresponds to the allele of SNP locus). So only one base is extended at the SNP locus. TOFMS was used to detect the difference of the molecular weight between the extended product and the non-extended primer, which can determine the base at this point (29, 30).

Genetic polymorphisms were assessed using DNA extraction from retained blood samples and were examined based on TOFMS platform and multiplex long PCR. TOFMS platform obtained the genotypes at nine loci of three genes at a time. We designed three primers for each site, and the primer sequence information was provided in the Supplementary Materials (Table S1, S2). The primers were designed by an online software named Agena Bioscience (www.agenacx.com). The deletion of CYP2D6*5 fragments was obtained by multiplex long PCR.

Derivation of the Prediction Model

A total of 256 patients who met the inclusion criteria were divided into the training cohort and test cohort according to the surgery time before or after June 1, 2013 (Figure 1 and Figure S2). A total of 117 patients were assigned to the training cohort, while the remaining 139 patients were assigned to the test cohort (Table S6). The training cohort was used to analyze the correlation between clinicopathological factors, gene locus polymorphism, the recurrence and metastasis of disease. The test cohort was used to test the performance of the algorithm model. The training cohort and test cohort both included patients who developed recurrence and metastasis, as well as those without disease progression during 5 years of clinical follow-up. All patients’ clinical information and detection results of genetic polymorphism were assigned a value based on the degree of disease malignancy and TAM metabolic enzyme activity, which was used for subsequent mathematical statistics (Table 1 and Tables S1, S3–S5).

FIGURE 1
www.frontiersin.org

Figure 1 Schematic for development of TAM RORs. In total, 256 patients were eligible for analysis. Samples were split into training and independent test sets by the surgery time. The training set was used to tune the parameters and select the best model using five-fold cross-validation. After training, the test set was used to independently assess the performance of the final model. TAM RORs, tamoxifen risk-of-recurrence score.

TABLE 1
www.frontiersin.org

Table 1 Patients’ information and tumor characteristics.

We constructed a model to predict the risk of breast cancer recurrence and metastasis by clinicopathological factors and gene locus polymorphism. The Random Forest algorithm was used for assessing the importance of all known features. We selected top features according to MeanDecreaseGini score of the Random Forest algorithm. The top features were used as the input for further logistic regression analysis to predict the risk of breast cancer recurrence and metastasis.

There are hyper-parameters in our model including number of features, tree number, and link function. A grid search algorithm was used to select the hyper-parameters as below.

Feature Number = [3, 6, 9, 12, 15, 18]

Tree Number = [1000, 10000, 20000]

Feature Importance Index = [“MeanDecreaseGini”, “MeanDecreaseAccuracy

(absolute value)”, “MeanDecreaseAccuracy”].

We used the five-fold cross-validation to select these hyper-parameters. The results are shown in Table 2. As a result, we set the feature number to be 9, Feature Importance Index to be “MeanDecreaseGini,” and Tree num to be 20,000 (Table 2). After that, the model with these hyper-parameter set was trained by the whole training dataset. The whole process can be found in Table 3.

TABLE 2
www.frontiersin.org

Table 2 Hyper-parameter selection by cross-validation.

TABLE 3
www.frontiersin.org

Table 3 The process of building TAM RORs model.

In the training cohort (N = 117), the Random Forest algorithm was used for the importance assessment of 19 variables including patient’s age, whether the patient was menopausal or not, the number of lymph node metastases, the maximum diameter of tumor, tissue classification, and the expression levels of ER, PR, HER2, Ki67, CYP2D6 *2, *3, *4, *5, *10, *41, CYP2C19 *2, *3, *17, and CYP3A5*3 (Figure S3). Ten variables with MeanDecreaseGini score less than 2.1 were excluded; the remaining nine variables were used for further logistic regression analysis. As a result, TAM RORs (1) = −2.74 + 3.54Grade + 0.75LN + 0.28CYP2C19*2 + 0.49PR + 0.31CYP2D6*10 + 1.11ER − 0.1CYP3A5*3 − 0.28Ki67 − 0.37Size was obtained. Then, TAM RORs (1) were converted into binary results. Specifically, a patient is considered to have high risk of breast cancer recurrence and metastasis if TAM RORs (2) = 1/(1 + e-TAM RORs(1)) is greater or equal to 0.175 and have low risk if the value is less than 0.175. The cutoff 0.175 is trained by the training dataset, by which we obtained a training area under the curve (AUC) of 0.87. Lastly, the TAM RORs of each patient in the test cohort were calculated to verify the performance of the model in an independent testing data (AUC = 0.86) (Table S7 and Figure S4). We also tested the other two methods: neural network and support vector machine (SVM).

SVM is one of the popular supervised learning algorithms. It is used for Classification as well as Regression problems. Primarily, it is used for Classification situation in machine learning. The aim of the SVM algorithm is to find the best line or decision boundary, which can divide n-dimensional space into classes in order to put the new data point in the correct space easily in the future. This best decision boundary is termed a hyperplane. SVM selects the extreme vectors that help in finding the hyperplane. These extreme cases are named as support vectors, and then the algorithm is termed as SVM.

Neural networks are used almost in every machine learning application because of their reliability and mathematical power. Each neuron in the neural networks is divided into different groups according to the order of receiving information. Each group can be regarded as a neural layer. The neurons in each layer receive the output of the neurons in the previous layer and output to the neurons in the next layer. The information in the whole network propagates in one direction, and there is no reverse information propagation. The feedforward network can be represented by a directed acyclic graph. The feedforward network can be regarded as a function, and the complex mapping from input space to output space is realized through the multiple compositions of simple non-linear functions. The network structure is simple and easy to implement.

In this article, we applied four-level neural networks on our classification problem by using R programming.

R version 3.4.3 (2017-11-30) was used for the classification and training of the prediction model (Figure 1).

Statistical Analysis

The statistical package stats of R version 3.4.3 software was used for statistical analysis. We studied the difference of clinicopathological variables or genotypes between two groups with different TAM RORs value. Specifically, we conducted Wilcoxon’s test, adjusted the p-values by the Benjamini–Hochberg method, and added the p-values to ggplot for box blots and dot plots (Figures S6, S7). Friedman’s test was adopted to discover the significant difference between logistic regression, the SVM algorithms and Feedforward Neural Network and the compared algorithms on the test dataset.

Results

A Logistic Regression Model to Predict the Performance of Tamoxifen Adjuvant Therapy

In the training cohort (N = 117), 75 (64.10%) patients obtained high TAM RORs (2) scores, while 42 (35.90%) patients obtained low TAM RORs (2) scores. Among 75 patients with high TAM RORs (2) scores, 42 (56.00%) patients developed recurrence and metastasis after TAM treatment. Among 42 patients with low TAM RORs (2) scores, 35 (83.33%) patients had no disease progression during the follow-up period. Therefore, we speculated that patients with high TAM RORs (2) scores had poor prognosis and were more likely to exhibit invasive tumor characteristics, while those with low TAM RORs (2) scores had good prognosis and stable disease control after TAM treatment (Table 4).

TABLE 4
www.frontiersin.org

Table 4 TAM RORs predicts recurrence and metastasis in breast cancer.

In the test cohort, the recurrence and metastasis probability of breast cancer patients after TAM treatment was evaluated, and our hypothesis was verified. Among 139 patients in the test dataset, 80 (57.55%) patients obtained high TAM RORs, while 59 (42.45%) patients obtained low TAM RORs. Among 80 patients with high TAM RORs, 52 (65.00%) developed recurrence and metastasis after TAM treatment. In addition, among 59 patients with low TAM RORs, 51 (86.44%) patients did not have disease progression during the follow-up period. Therefore, we verified that the sensitivity and specificity of TAM RORs (2) were 86.67% and 64.56%, respectively (Table 2). Moreover, our hypothesis was verified: breast cancer patients with high TAM RORs (2) were always less sensitive to TAM treatment and had tumor invasion occurrence. Conversely, breast cancer patients with low TAM RORs have always high sensitivity to TAM treatment and had stable disease control (Figures 2 and 3, Table S4, Figure S5).

FIGURE 2
www.frontiersin.org

Figure 2 TAM RORs predicts recurrence and metastasis in breast cancer. Box-whisker plots of TAM RORs (1) values. TAM RORs, tamoxifen risk-of-recurrence score.

FIGURE 3
www.frontiersin.org

Figure 3 Disease-free survival (DFS) in TAM RORs. TAM RORs, tamoxifen risk-of-recurrence score.

Major Risk Factors for Tamoxifen Resistance

According to the TAM RORs value, the effects of breast cancer patients’ information and tumor characteristics on the efficacy of TAM were observed. Among patients’ information and their tumor characteristics, four factors including the number of lymph node involvement, the expression levels of ER and PR, tumor diameter, and tumor tissue classification had a significant effect on the efficacy of TAM (Node-Negative vs. Node < 4, p = 5.3e−07; Node < 4 vs. Node ≥ 4, p = 0.003; Node-Negative vs. Node ≥ 4, p = 7.2e−15; ER < 50% vs. ER ≥ 50%, p = 1.3e−12; PR < 50% vs. PR ≥ 50%, p = 2.6e−08; tumor Size < 2cm vs. tumor Size ≥ 2cm, p = 0.013; tumor Grade < 2 vs. Grade ≥ 2, p = 2.2e−16; Figure S6). Similarly, the effect of TAM metabolism-related gene locus polymorphisms on the efficacy of TAM in breast cancer patients was observed. We detected 10 loci in CYP2D6, CYP3A5, and CYP2C19. Among these loci, three loci, i.e., CYP2D6*10(C188T), CYP2C19*2(G681A), and CYP3A5*3(A6986G), had a significant effect on the efficacy of TAM (CYP2D6*10(C188T): CC vs. TT, p < 0.019; CT vs. TT, p < 0.037; CYP2C19*2(G681A): GG vs. GA, p < 0.02; GG vs. AA, p < 0.0043; CYP3A5*3(A6986G): AG vs. GG, p < 0.0029; AA vs. GG, p < 0.038; Figure S7). There were significant differences between wild type and CYP2D6*10/*10 (p < 0.019), as well as between CYP2D6*1/*10 and CYP2D6*10/*10 (p < 0.037). Patients with CYP2D6*10 were more likely to obtain higher TAM RORs (1) scores, and their disease progression occurred during the follow-up period. There are more than 50% Chinese whose have CYP2D6*10 mutation. So it is very important to know which genotypes CYP2D6*10(C188T) patients have. The genetic polymorphism of this locus significantly affected the activation efficiency of TAM in vivo and is an important predictor of prognosis in TAM treatment. There was also a significant difference between wild type and CYP3A5*1/*3 (p < 0.038). However, a significant difference existed between CYP3A5*1/*3 and CYP3A5*3/*3 (p < 0.003): 53.08% CYP3A5*3/*3 patients obtained low TAM RORs, and no disease progression was observed during the follow-up period. The results were similar to those reported by Wegman et al., who suggested that the risk of recurrence in individuals with CYP3A5*3/*3 significantly decreases after 5 years of TAM treatment (27).

Comparison of the Predictive Effectiveness

Clinically, STEPP analysis is used to predict the risk of recurrence in premenopausal patients with HER2-negative /HR+ early breast cancer. The prediction results of Subpopulation Treatment Effect Pattern Plot (STEPP) analysis were used to determine whether Ovarian Function Suppression (OFS) should be performed in conjunction with endocrine therapy. However, this method only involves clinicopathological indicators, and it did not take into account the impact of TAM metabolism.

We screened 132 patients with HER2-negative premenopausal breast cancer from 256 patients. Among them, 44 patients had breast cancer recurrence, whereas 88 patients had no recurrence or metastasis in 5 years of follow-up. The prognosis was predicted by STEPP score. Among the 44 patients with breast cancer recurrence, 39 patients were diagnosed as “medium-high risk,” and 34 patients as “low risk” among the 88 patients without recurrence and metastasis. The predictive sensitivity was 88.64% and specificity was only 38.64% in the screened 265 patients. The prognosis was predicted by TAM RORs. Among the 44 patients with breast cancer recurrence, 38 patients with high risk of recurrence were identified, and 40 patients with “low risk” of recurrence were identified among 88 patients without recurrence metastasis. The predictive sensitivity of TAM RORs in 265 patients reached 86.36% and specificity reached 54.54%. The comparison of the two formulas can be found to be equally excellent in predicting patients with high recurrence risk. However, in the prediction of low-risk patients, TAM RORs showed better specificity.

We have tested other two methods among which deep learning has the best performance (AUC: 0.88). The sensitivity and specificity of SVM are lower than those of the logistic regression. That is, the predictive ability of SVM is inferior to that of the logistic regression. On the other hand, the performance of Fully Connected Feedforward Neural Network is slightly better than that of the current model.

To judge whether or not our approaches were statistically significant, Friedman’s test was conducted at α = 0.05 in terms of TAM RORs from 139 patients in the test dataset. TAM RORs of each patient from the three algorithms were the data used to conduct Friedman’s test. The results of algorithms on 139 patients of the test dataset are shown in Table 5. Here, χ2 is the chi-square, df is the degree of freedom, and p is the p-value. Friedman’s test results told us that there was no strong significant difference between the three compared algorithms because p-value was not less than the specific value of alpha, which is set to be 0.05.

TABLE 5
www.frontiersin.org

Table 5 Results of Friedman’s test between our approaches and the three compared algorithms.

Friedman’s test results told us that there was no strong significant difference between the three compared algorithms. However, the Fully Connected Feedforward Neural Network is implicit and has explicit formula for predicting, which is difficult in mining the biological mechanisms behind the prediction and it. Thus, we finally chose the logistic regression in this study.

Discussion

TAM, as a standard drug for the endocrine therapy of breast cancer (3), effectively reduces the recurrence and mortality of HR+ breast cancer patients (4, 8, 31, 32). Nonetheless, 50% of the patients did not benefit from it, which presents a great concern in clinical practice (3335). The prognosis of ER-positive breast cancer patients has always been a hot topic in clinical research. At present, there are five recognized clinical prediction methods (3, 3641): 1) Oncotype Dx recurrence score (RS), 2) PAM50-based Prosigna risk of recurrence (ROR), 3) Breast Cancer Index (BCI), 4) EndoPredict (EPclin), and 5) MammaPrint Netherland Kanker Institute 70-gene signature. These prognosis prediction methods are all based on multigene expression profiles. Compared with independent clinical factors, multigene expression profiles combined with clinical factors have shown significantly predictive efficiency (42). Nevertheless, following the progression of TAM metabolic mechanism research, it has been shown that being the activation product of TAM, endoxifen has high activity to inhibit the growth of tumor by competitive binding of ER with estradiol and blocking estrogenic effect. This process is regulated by multiple enzymes including CYP2D6, CYP2C19, and CYP3A5 (4345). The activities of these enzymes are influenced by the genetic polymorphism at different loci, thus showing individual differences (2325). These enzymes further reveal the individual differences in drug sensitivity of TAM and the efficacy of treatment. At present, the genetic polymorphisms affecting the metabolic efficiency of TAM had not been covered by any algorithm model of clinical prognostic assessment, which can be interpreted as a knowledge gap to predict the prognosis of ER-positive breast cancer patients with TAM treatment (3641, 4649). The present study was based on the factors affecting TAM metabolic efficiency, combined with clinicopathological information necessary to establish an algorithm model to evaluate the benefit of patients after TAM treatment.

The efficacy of TAM was predicted by three methods, which have the same training set and test set. They are Logistic Regression, SVM, and Fully Connected Feedforward Neural Network. In SVM, when kernel = “linear,” the sensitivity and specificity are 58% and 92%; when kernel = “sigmoid,” the sensitivity and specificity are 58% and 92%; and when kernel = “radial,” the sensitivity and specificity are 65% and 89.9%. The predictive ability of SVM is inferior to that of the logistic regression. In the Fully Connected Feedforward Neural Network, set the number of network layers to 4 and the middle-hidden layers to 2, the sensitivity and specificity are 86.67% and 73.41%, respectively. On the other hand, Friedman’s test results told us that there was no strong significant difference between our approaches and the three compared algorithms. Since the Fully Connected Feedforward Neural Network has bad interpretability, in considering the practical application, we finally chose the logistic regression.

There were limitations in this study. This was a retrospective study based on the collection of clinical samples. Although we have included as much as possible of the available clinical information, the clinical treatments for invasive breast cancer still had a certain impact on the study; e.g., after surgical treatment, patients received routine chemotherapy before they underwent TAM endocrine therapy. This might explain why a small number of patients had high TAM RORs, suggesting a higher risk of recurrence, but there was no disease progression found in actual clinical follow-up. Nonetheless, although the datasets had some limitations, they performed well in the current verification. We also tried to enroll as many patients as possible in the study to further verify our TAM RORs. In addition, we adopted a simple logistic regression model to perform the prediction, which might not be optimal. Deep learning and network-based methods have been proven to be effective in many similar prediction problems (5055), which will be tested in the future to improve the prediction accuracy.

In the analysis of genetic polymorphisms in 256 patients, we found that the genotype frequency of CYP2C19*2(G681A) was different from that in the National Center for Biotechnology Information (NCBI) SNP database. It was CYP2C19*2(G681A) AA genotype (in our study: 7.03% vs. China population frequency from NCBI: 0%) (Table S8). We collected 243 blood samples from healthy individuals in Northeastern China and verified this phenotype: CYP2C19*2(G681A) AA genotype 8.2%. It shows the distribution characteristics of CYP2C19*2(G681A) in Northeast China.

Over the recent years, with the rapid development of precision medicine, gene detection has become more and more important for cancer diagnosis, prognosis, and drug selection (5662). Several studies have shown that the metabolic efficiency of TAM was related to the genetic polymorphisms of certain P450 enzymes, thereby affecting the efficacy of drug therapy. Based on this theory, our TAM RORs can be used to predict the efficacy of TAM treatment and improve personalized endocrine therapy in patients with invasive breast cancer.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Ethics Statement

Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author Contributions

Conception and design: LC, DY, and XWa. Study development and methods: NY, GZ, and HP. Medical and technical support: DY, NY, and HP. Collection and assembly of data: GZ, HP, XX, YC, GT, XWu, XL, MS, and XQW. Data analysis and interpretation: NY, JDL, YBL, XM, and JXL. Manuscript writing: NY, XWa, GZ, and HP. All authors contributed to the article and approved the submitted version.

Funding

This study was funded by the National Key Research and Development Program of China (Grant No. 2019YFE0110000), National Natural Science Foundation of China (Grant No. 82072097), Science and Technology Fund of Heilongjiang Province of China (Grant Number LH2020H126), and Hai Yan Fund from Harbin Medical University Cancer Hospital [Grant Number JJZD2020-06], and National Cancer Center climbing Foundation (NCC201808B014).

Conflict of Interest

NY, JDL, YBL, XQW, GT, and DY were employed by company Geneis (Beijing) Co. Ltd.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We wish to thank all members from Harbin Medical University Cancer Hospital for assisting with the specimen handling and management. Also, we would like to thank Guoliang Zang, Xinjie Wang, Yang Liu, and Zundong Liu for performing the molecular biology experiments.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2021.738222/full#supplementary-material

References

1. Osborne CK, Schiff R. Mechanisms of Endocrine Resistance in Breast Cancer. Annu Rev Med (2011) 62:233–47. doi: 10.1146/annurev-med-070909-182917

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Schettini F, Buono G, Cardalesi C, Desideri I, De Placido S, Del Mastro L. Hormone Receptor/Human Epidermal Growth Factor Receptor 2-Positive Breast Cancer: Where We are Now and Where We Are Going. Cancer Treat Rev (2016) 46:20–6. doi: 10.1016/j.ctrv.2016.03.012

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Gradishar WJ, Anderson BO, Balassanian R, Blair SL, Burstein HJ, Cyr A, et al. Breast Cancer, Version 4.2017, NCCN Clinical Practice Guidelines in Oncology. J Natl Compr Cancer Netw JNCCN (2018) 16:310–20. doi: 10.6004/jnccn.2018.0012

CrossRef Full Text | Google Scholar

4. Davies C, Pan H, Godwin J, Gray R, Arriagada R, Raina V, et al. Long-Term Effects of Continuing Adjuvant Tamoxifen to 10 Years Versus Stopping at 5 Years After Diagnosis of Oestrogen Receptor-Positive Breast Cancer: ATLAS, a Randomised Trial. Lancet (London England) (2013) 381:805–16. doi: 10.1016/S0140-6736(12)61963-1

CrossRef Full Text | Google Scholar

5. Fang Y, Xu P, Yang J, Qin Y. A Quantile Regression Forest Based Method to Predict Drug Response and Assess Prediction Reliability. PloS One (2018) 13:e0205155. doi: 10.1371/journal.pone.0205155

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Liu C, Wei D, Xiang J, Ren F, Huang L, Lang J, et al. An Improved Anticancer Drug-Response Prediction Based on an Ensemble Method Integrating Matrix Completion and Ridge Regression. Mol Ther Nucleic Acids (2020) 21:676–86. doi: 10.1016/j.omtn.2020.07.003

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Liu X, Yang J, Zhang Y, Fang Y, Wang F, Wang J, et al. A Systematic Study on Drug-Response Associated Genes Using Baseline Gene Expressions of the Cancer Cell Line Encyclopedia. Sci Rep (2016) 6:22811. doi: 10.1038/srep22811

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Davies C, Godwin J, Gray R, Clarke M, Cutter D, Darby S, et al. Relevance of Breast Cancer Hormone Receptors and Other Factors to the Efficacy of Adjuvant Tamoxifen: Patient-Level Meta-Analysis of Randomised Trials. Lancet (London England) (2011) 378:771–84. doi: 10.1016/S0140-6736(11)60993-8

CrossRef Full Text | Google Scholar

9. Lindsay CR, Faugeroux V, Michiels S, Pailler E, Facchinetti F, Ou D, et al. A Prospective Examination of Circulating Tumor Cell Profiles in Non-Small-Cell Lung Cancer Molecular Subgroups. Ann Oncol Off J Eur Soc Med Oncol (2017) 28:1523–31. doi: 10.1093/annonc/mdx156

CrossRef Full Text | Google Scholar

10. Kiyotani K, Mushiroda T, Imamura CK, Tanigawara Y, Hosono N, Kubo M, et al. Dose-Adjustment Study of Tamoxifen Based on CYP2D6 Genotypes in Japanese Breast Cancer Patients. Breast Cancer Res Treat (2012) 131:137–45. doi: 10.1007/s10549-011-1777-7

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Cai Y, Huang T, Yang J. Applications of Bioinformatics and Systems Biology in Precision Medicine and Immunooncology. BioMed Res Int (2018) 2018:1427978. doi: 10.1155/2018/1427978

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Yang J, Liao B, Zhang T, Xu Y. Bioinformatics Analysis of Single Cell Sequencing Data and Applications in Precision Medicine. Front Genet (2020) 10:1358. doi: 10.3389/fgene.2019.01358

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Walko CM, McLeod H. Use of CYP2D6 Genotyping in Practice: Tamoxifen Dose Adjustment. Pharmacogenomics (2012) 13:691–7. doi: 10.2217/pgs.12.27

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Barginear MF, Jaremko M, Peter I, Yu C, Kasai Y, Kemeny M, et al. Increasing Tamoxifen Dose in Breast Cancer Patients Based on CYP2D6 Genotypes and Endoxifen Levels: Effect on Active Metabolite Isomers and the Antiestrogenic Activity Score. Clin Pharmacol Ther (2011) 90:605–11. doi: 10.1038/clpt.2011.153

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Okishiro M, Taguchi T, Jin Kim S, Shimazu K, Tamaki Y, Noguchi S. Genetic Polymorphisms of CYP2D6 10 and CYP2C19 2, 3 are Not Associated With Prognosis, Endometrial Thickness, or Bone Mineral Density in Japanese Breast Cancer Patients Treated With Adjuvant Tamoxifen. Cancer (2009) 115:952–61. doi: 10.1002/cncr.24111

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Schroth W, Antoniadou L, Fritz P, Schwab M, Muerdter T, Zanger UM, et al. Breast Cancer Treatment Outcome With Adjuvant Tamoxifen Relative to Patient CYP2D6 and CYP2C19 Genotypes. J Clin Oncol Off J Am Soc Clin Oncol (2007) 25:5187–93. doi: 10.1200/JCO.2007.12.2705

CrossRef Full Text | Google Scholar

17. Zafra-Ceres M, de Haro T, Farez-Vidal E, Blancas I, Bandres F, de Dueñas EM, et al. Influence of CYP2D6 Polymorphisms on Serum Levels of Tamoxifen Metabolites in Spanish Women With Breast Cancer. Int J Med Sci (2013) 10:932–7. doi: 10.7150/ijms.5708

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Lim JS, Chen XA, Singh O, Yap YS, Ng RC, Wong NS, et al. Impact of CYP2D6, CYP3A5, CYP2C9 and CYP2C19 Polymorphisms on Tamoxifen Pharmacokinetics in Asian Breast Cancer Patients. Br J Clin Pharmacol (2011) 71:737–50. doi: 10.1111/j.1365-2125.2011.03905.x

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Schroth W, Goetz MP, Hamann U, Fasching PA, Schmidt M, Winter S, et al. Association Between CYP2D6 Polymorphisms and Outcomes Among Women With Early Stage Breast Cancer Treated With Tamoxifen. Jama (2009) 302:1429–36. doi: 10.1001/jama.2009.1420

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Qin S, Shen L, Zhang A, Xie J, Shen W, Chen L, et al. Systematic Polymorphism Analysis of the CYP2D6 Gene in Four Different Geographical Han Populations in Mainland China. Genomics (2008) 92:152–8. doi: 10.1016/j.ygeno.2008.05.004

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Ruiter R, Bijl MJ, van Schaik RH, Berns EM, Hofman A, Coebergh JW, et al. CYP2C19*2 Polymorphism Is Associated With Increased Survival in Breast Cancer Patients Using Tamoxifen. Pharmacogenomics (2010) 11:1367–75. doi: 10.2217/pgs.10.112

PubMed Abstract | CrossRef Full Text | Google Scholar

22. van Schaik RH, Kok M, Sweep FC, van Vliet M, van Fessem M, Meijer-van Gelder ME, et al. The CYP2C19*2 Genotype Predicts Tamoxifen Treatment Outcome in Advanced Breast Cancer Patients. Pharmacogenomics (2011) 12:1137–46. doi: 10.2217/pgs.11.54

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Yin SJ, Ni YB, Wang SM, Wang X, Lou YQ, Zhang GL. Differences in Genotype and Allele Frequency Distributions of Polymorphic Drug Metabolizing Enzymes CYP2C19 and CYP2D6 in Mainland Chinese Mongolian, Hui and Han Populations. J Clin Pharm Ther (2012) 37:364–9. doi: 10.1111/j.1365-2710.2011.01298.x

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Gjerde J, Geisler J, Lundgren S, Ekse D, Varhaug JE, Mellgren G, et al. Associations Between Tamoxifen, Estrogens, and FSH Serum Levels During Steady State Tamoxifen Treatment of Postmenopausal Women With Breast Cancer. BMC Cancer (2010) 10:313. doi: 10.1186/1471-2407-10-313

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Tucker AN, Tkaczuk KA, Lewis LM, Tomic D, Lim CK, Flaws JA. Polymorphisms in Cytochrome P4503A5 (CYP3A5) May Be Associated With Race and Tumor Characteristics, But Not Metabolism and Side Effects of Tamoxifen in Breast Cancer Patients. Cancer Lett (2005) 217:61–72. doi: 10.1016/j.canlet.2004.08.027

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Goetz MP, Rae JM, Suman VJ, Safgren SL, Ames MM, Visscher DW, et al. Pharmacogenetics of Tamoxifen Biotransformation Is Associated With Clinical Outcomes of Efficacy and Hot Flashes. J Clin Oncol Off J Am Soc Clin Oncol (2005) 23:9312–8. doi: 10.1200/JCO.2005.03.3266

CrossRef Full Text | Google Scholar

27. Wegman P, Elingarami S, Carstensen J, Stål O, Nordenskjöld B, Wingren S. Genetic Variants of CYP3A5, CYP2D6, SULT1A1, UGT2B15 and Tamoxifen Response in Postmenopausal Patients With Breast Cancer. Breast Cancer Res BCR (2007) 9:R7. doi: 10.1186/bcr1640

CrossRef Full Text | Google Scholar

28. Ikryannikova LN, Shitikov EA, Zhivankova DG, Il’ina EN, Edelstein MV, Govorun VM. A MALDI TOF MS-Based Minisequencing Method for Rapid Detection of TEM-Type Extended-Spectrum Beta-Lactamases in Clinical Strains of Enterobacteriaceae. J Microbiol Methods (2008) 75:385–91. doi: 10.1016/j.mimet.2008.07.005

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Xiang D, Chen C, He S, Li M, Wang C. Comparing PyroMark Q24 Pyrosequencing and MALDI-TOF MS for Identification of CYP2D6*10. Clin Lab (2019) 65(5). doi: 10.7754/Clin.Lab.2018.180909

CrossRef Full Text | Google Scholar

30. Meyer S, Vollmert C, Trost N, Sigurdardottir S, Portmann C, Gottschalk J, et al. MNSs Genotyping by MALDI-TOF MS Shows High Concordance With Serology, Allows Gene Copy Number Testing and Reveals New St(a) Alleles. Br J Haematol (2016) 174:624–36. doi: 10.1111/bjh.14095

PubMed Abstract | CrossRef Full Text | Google Scholar

31. (EBCTCG) EBCTCG. Effects of Chemotherapy and Hormonal Therapy for Early Breast Cancer on Recurrence and 15-Year Survival: An Overview of the Randomised Trials. Lancet (London England) (2005) 365:1687–717. doi: 10.1016/S0140-6736(05)66544-0

CrossRef Full Text | Google Scholar

32. Group* EBCTC. Tamoxifen for Early Breast Cancer: An Overview of the Randomised Trials. Early Breast Cancer Trialists’ Collaborative Group. Lancet (London England) (1998) 351:1451–67. doi: 10.1016/S0140-6736(97)11423-4

CrossRef Full Text | Google Scholar

33. Saladores P, Mürdter T, Eccles D, Chowbay B, Zgheib NK, Winter S, et al. Tamoxifen Metabolism Predicts Drug Concentrations and Outcome in Premenopausal Patients With Early Breast Cancer. Pharmacogenomics J (2015) 15:84–94. doi: 10.1038/tpj.2014.34

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Lorizio W, Wu AH, Beattie MS, Rugo H, Tchu S, Kerlikowske K, et al. Clinical and Biomarker Predictors of Side Effects From Tamoxifen. Breast Cancer Res Treat (2012) 132:1107–18. doi: 10.1007/s10549-011-1893-4

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Bowles EJ, Buist DS, Chubak J, Yu O, Johnson J, Chestnut J, et al. Endocrine Therapy Initiation From 2001 to 2008 Varies by Age at Breast Cancer Diagnosis and Tumor Size. J Oncol Pract (2012) 8:113–20. doi: 10.1200/JOP.2011.000417

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Harris LN, Ismaila N, McShane LM, Andre F, Collyar DE, Gonzalez-Angulo AM, et al. Use of Biomarkers to Guide Decisions on Adjuvant Systemic Therapy for Women With Early-Stage Invasive Breast Cancer: American Society of Clinical Oncology Clinical Practice Guideline. J Clin Oncol Off J Am Soc Clin Oncol (2016) 34:1134–50. doi: 10.1200/JCO.2015.65.2289

CrossRef Full Text | Google Scholar

37. Duffy MJ, Harbeck N, Nap M, Molina R, Nicolini A, Senkus E, et al. Clinical Use of Biomarkers in Breast Cancer: Updated Guidelines From the European Group on Tumor Markers (EGTM). Eur J Cancer (Oxford Engl 1990) (2017) 75:284–98. doi: 10.1016/j.ejca.2017.01.017

CrossRef Full Text | Google Scholar

38. Coates AS, Winer EP, Goldhirsch A, Gelber RD, Gnant M, Piccart-Gebhart M, et al. Tailoring Therapies–Improving the Management of Early Breast Cancer: St Gallen International Expert Consensus on the Primary Therapy of Early Breast Cancer 2015. Ann Oncol Off J Eur Soc Med Oncol (2015) 26:1533–46. doi: 10.1093/annonc/mdv221

CrossRef Full Text | Google Scholar

39. Senkus E, Kyriakides S, Ohno S, Penault-Llorca F, Poortmans P, Rutgers E, et al. Primary Breast Cancer: ESMO Clinical Practice Guidelines for Diagnosis, Treatment and Follow-Up. Ann Oncol Off J Eur Soc Med Oncol (2015) 26 Suppl 5:v8–30. doi: 10.1093/annonc/mdv298

CrossRef Full Text | Google Scholar

40. Sestak I, Buus R, Cuzick J, Dubsky P, Kronenwett R, Denkert C, et al. Comparison of the Performance of 6 Prognostic Signatures for Estrogen Receptor-Positive Breast Cancer: A Secondary Analysis of a Randomized Clinical Trial. JAMA Oncol (2018) 4:545–53. doi: 10.1001/jamaoncol.2017.5524

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Cuzick J, Dowsett M, Pineda S, Wale C, Salter J, Quinn E, et al. Prognostic Value of a Combined Estrogen Receptor, Progesterone Receptor, Ki-67, and Human Epidermal Growth Factor Receptor 2 Immunohistochemical Score and Comparison With the Genomic Health Recurrence Score in Early Breast Cancer. J Clin Oncol Off J Am Soc Clin Oncol (2011) 29:4273–8. doi: 10.1200/JCO.2010.31.2835

CrossRef Full Text | Google Scholar

42. Massagué J. Sorting Out Breast-Cancer Gene Signatures. New Engl J Med (2007) 356:294–7. doi: 10.1056/NEJMe068292

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Kiyotani K, Mushiroda T, Zembutsu H, Nakamura Y. Important and Critical Scientific Aspects in Pharmacogenomics Analysis: Lessons From Controversial Results of Tamoxifen and CYP2D6 Studies. J Hum Genet (2013) 58:327–33. doi: 10.1038/jhg.2013.39

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Johnson MD, Zuo H, Lee KH, Trebley JP, Rae JM, Weatherman RV, et al. Pharmacological Characterization of 4-Hydroxy-N-Desmethyl Tamoxifen, a Novel Active Metabolite of Tamoxifen. Breast Cancer Res Treat (2004) 85:151–9. doi: 10.1023/B:BREA.0000025406.31193.e8

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Mürdter TE, Schroth W, Bacchus-Gerybadze L, Winter S, Heinkele G, Simon W, et al. Activity Levels of Tamoxifen Metabolites at the Estrogen Receptor and the Impact of Genetic Polymorphisms of Phase I and II Enzymes on Their Concentration Levels in Plasma. Clin Pharmacol Ther (2011) 89:708–17. doi: 10.1038/clpt.2011.27

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Filipits M, Rudas M, Jakesz R, Dubsky P, Fitzal F, Singer CF, et al. A New Molecular Predictor of Distant Recurrence in ER-Positive, HER2-Negative Breast Cancer Adds Independent Information to Conventional Clinical Risk Factors. Clin Cancer Res an Off J Am Assoc Cancer Res (2011) 17:6012–20. doi: 10.1158/1078-0432.CCR-11-0926

CrossRef Full Text | Google Scholar

47. Tang G, Cuzick J, Costantino JP, Dowsett M, Forbes JF, Crager M, et al. Risk of Recurrence and Chemotherapy Benefit for Patients With Node-Negative, Estrogen Receptor-Positive Breast Cancer: Recurrence Score Alone and Integrated With Pathologic and Clinical Factors. J Clin Oncol Off J Am Soc Clin Oncol (2011) 29:4365–72. doi: 10.1200/JCO.2011.35.3714

CrossRef Full Text | Google Scholar

48. Cardoso F, van’t Veer LJ, Bogaerts J, Slaets L, Viale G, Delaloge S, et al. 70-Gene Signature as an Aid to Treatment Decisions in Early-Stage Breast Cancer. New Engl J Med (2016) 375:717–29. doi: 10.1056/NEJMoa1602253

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Dowsett M, Sestak I, Regan MM, Dodson A, Viale G, Thürlimann B, et al. Integration of Clinical Variables for the Prediction of Late Distant Recurrence in Patients With Estrogen Receptor-Positive Breast Cancer Treated With 5 Years of Endocrine Therapy: CTS5. J Clin Oncol (2018) 36(19):1941–8. doi: 10.1200/JCO.2017.76.4258

CrossRef Full Text | Google Scholar

50. Zhang Y, Feng T, Wang S, Dong R, Yang J, Su J, et al. A Novel XGBoost Method to Identify Cancer Tissue-Of-Origin Based on Copy Number Variations. Front Genet (2020) 11:585029. doi: 10.3389/fgene.2020.585029

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Yang J, Peng S, Zhang B, Houten S, Schadt E, Zhu J, et al. Human Geroprotector Discovery by Targeting the Converging Subnetworks of Aging and Age-Related Diseases. Geroscience (2020) 42:353–72. doi: 10.1007/s11357-019-00106-x

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Tang X, Cai L, Meng Y, Xu J, Lu C, Yang J. Indicator Regularized Non-Negative Matrix Factorization Method-Based Drug Repurposing for COVID-19. Front Immunol (2020) 11:603615. doi: 10.3389/fimmu.2020.603615

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Xu J, Zhu W, Cai L, Liao B, Meng Y, Xiang J, et al. LRMCMDA: Predicting miRNA-Disease Association by Integrating Low-Rank Matrix Completion With miRNA and Disease Similarity Information. IEEE Access (2020) 8:80728–38. doi: 10.1109/ACCESS.2020.2990533

CrossRef Full Text | Google Scholar

54. Wang L, Xiao Y, Li J, Feng X, Li Q, Yang J. IIRWR: Internal Inclined Random Walk With Restart for LncRNA-Disease Association Prediction. IEEE Access (2019) 7:54034–41. doi: 10.1109/ACCESS.2019.2912945

CrossRef Full Text | Google Scholar

55. Yang J, Huang T, Song WM, Petralia F, Mobbs CV, Zhang B, et al. Discover the Network Underlying the Connections Between Aging and Age-Related Diseases. Sci Rep (2016) 6:32566. doi: 10.1038/srep32566

PubMed Abstract | CrossRef Full Text | Google Scholar

56. Liu H, Qiu C, Wang B, Bing P, Tian G, Zhang X, et al. Evaluating DNA Methylation, Gene Expression, Somatic Mutation, and Their Combinations in Inferring Tumor Tissue-Of-Origin. Front Cell Dev Biol (2021) 9:619330. doi: 10.3389/fcell.2021.619330

PubMed Abstract | CrossRef Full Text | Google Scholar

57. He B, Dai C, Lang J, Bing P, Tian G, Wang B, et al. A Machine Learning Framework to Trace Tumor Tissue-of-Origin of 13 Types of Cancer Based on DNA Somatic Mutation. Biochim Biophys Acta Mol Basis Dis (2020) 1866:165916. doi: 10.1016/j.bbadis.2020.165916

PubMed Abstract | CrossRef Full Text | Google Scholar

58. He B, Lang J, Wang B, Liu X, Lu Q, He J, et al. TOOme: A Novel Computational Framework to Infer Cancer Tissue-Of-Origin by Integrating Both Gene Mutation and Expression. Front Bioeng Biotechnol (2020) 8:394. doi: 10.3389/fbioe.2020.00394

PubMed Abstract | CrossRef Full Text | Google Scholar

59. Liang Y, Wang H, Yang J, Li X, Dai C, Shao P, et al. A Deep Learning Framework to Predict Tumor Tissue-Of-Origin Based on Copy Number Alteration. Front Bioeng Biotechnol (2020) 8:701. doi: 10.3389/fbioe.2020.00701

PubMed Abstract | CrossRef Full Text | Google Scholar

60. Song Z, Chen X, Shi Y, Huang R, Wang W, Zhu K, et al. Evaluating the Potential of T Cell Receptor Repertoires in Predicting the Prognosis of Resectable Non-Small Cell Lung Cancers. Mol Ther Methods Clin Dev (2020) 18:73–83. doi: 10.1016/j.omtm.2020.05.020

PubMed Abstract | CrossRef Full Text | Google Scholar

61. He B, Zhang Y, Zhou Z, Wang B, Liang Y, Lang J, et al. A Neural Network Framework for Predicting the Tissue-Of-Origin of 15 Common Cancer Types Based on RNA-Seq Data. Front Bioeng Biotechnol (2020) 8:737. doi: 10.3389/fbioe.2020.00737

PubMed Abstract | CrossRef Full Text | Google Scholar

62. Yan N, Zhang Y, Guo X, Yuan D, Tian G, Yang J. A Review on Cancer of Unknown Primary Origin: The Role of Molecular Biomarkers in the Identification of Unknown Primary Origin. Methods Mol Biol (2020) 2204:109–19. doi: 10.1007/978-1-0716-0904-0_10

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: breast cancer, hormone receptor-positive, tamoxifen, risk factors, risk-of-recurrence score

Citation: Pang H, Zhang G, Yan N, Lang J, Liang Y, Xu X, Cui Y, Wu X, Li X, Shan M, Wang X, Meng X, Liu J, Tian G, Cai L, Yuan D and Wang X (2021) Evaluating the Risk of Breast Cancer Recurrence and Metastasis After Adjuvant Tamoxifen Therapy by Integrating Polymorphisms in Cytochrome P450 Genes and Clinicopathological Characteristics. Front. Oncol. 11:738222. doi: 10.3389/fonc.2021.738222

Received: 08 July 2021; Accepted: 25 October 2021;
Published: 19 November 2021.

Edited by:

Cheng Guo, Columbia University, United States

Reviewed by:

Bing Wang, Anhui University of Technology, China
Manolis Tsiknakis, Hellenic Mediterranean University, Greece
Haiyan Liu, Changsha Medical University, China

Copyright © 2021 Pang, Zhang, Yan, Lang, Liang, Xu, Cui, Wu, Li, Shan, Wang, Meng, Liu, Tian, Cai, Yuan and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Li Cai, Y2FpbGlAZW1zLmhyYm11LmVkdS5jbg==; Dawei Yuan, eXVhbmR3QGdlbmVpcy5jbg==; Xin Wang, eGlud2FuZ0B2aXAuMTI2LmNvbQ==

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.