- 1Department of Breast and Thyroid Surgery, Southwest Hospital, Army Medical University, Chongqing, China
- 2Institute of Pathology and Southwest Cancer Center, Key Laboratory of the Ministry of Education, Southwest Hospital, Army Medical University, Chongqing, China
- 3Institute of Toxicology, College of Preventive Medicine, Army Medical University, Chongqing, China
The improvement in the quality of life is accompanied by an accelerated pace of living and increased work-related pressures. Recent decades has seen an increase in the proportion of obese patients, as well as an increase in the prevalence of breast cancer. More and more evidences prove that obesity may be one of a prognostic impact factor in patients with breast cancer. Obesity presents unique diagnostic and therapeutic challenges in the population of breast cancer patients. Therefore, it is essential to have a better understanding of the relationship between obesity and breast cancer. This study aims to construct a prognostic risk prediction model combining obesity and breast cancer. In this study, we obtained a breast cancer sample dataset from the GEO database containing obesity data [determined by the body mass index (BMI)]. A total of 1174 genes that were differentially expressed between breast cancer samples of patients with and without obesity were screened by the rank-sum test. After weighted gene co-expression network analysis (WGCNA), 791 related genes were further screened. Relying on single-factor COX regression analysis to screen the candidate genes to 30, these 30 genes and another set of TCGA data were intersected to obtain 24 common genes. Finally, lasso regression analysis was performed on 24 genes, and a breast cancer prognostic risk prediction model containing 6 related genes was obtained. The model was also found to be related to the infiltration of immune cells. This study provides a new and accurate prognostic model for predicting the survival of breast cancer patients with obesity.
Introduction
According to the World Health Organization (WHO), a person with a body mass index (BMI) equal to or greater than 30 kg/m2 is considered obese (1). According to this definition, some epidemiological studies have shown a significant increase in the number of obese individuals in the last decades (2, 3). Obesity is considered as an indicator of metabolic syndrome (MetS). The presence of MetS can increase the risk and influence the prognosis of various tumors, such as breast cancer, colorectal cancer, and prostate cancer (4–6). Recent studies have demonstrated that overweight and obesity are associated with higher risks of adenocarcinoma of the esophagus, thyroid, pancreas, colon, rectum, endometrium, prostate, gallbladder, ovary, and breast, in addition to multiple myeloma (7). Breast cancer is the second leading cause of cancer related deaths in women; therefore, it is of great public health significance to understand how obesity affects this disease. Several studies have identified obesity as a risk factor for breast cancer and are associated with different types of breast cancer, also depending on different stages or ages (8, 9). In addition, obesity has been identified as a poor prognostic factor for breast cancer. For example, several studies have established obesity as a risk factor for postmenopausal breast cancer, specifically estrogen receptor-positive and triple-negative phenotypes.
With the emergence and rapid development of chip and sequencing technologies in various tumors, bioinformatics analysis has been widely used to identify more effective potential biomarkers for the diagnosis, treatment, and prognosis of a variety of diseases. Over the past decade, the Gene Expression Omnibus (GEO) and Cancer Genome Atlas (TCGA) databases have accumulated abundant genomes and gene expression profiles that can be used in various diseases. Weighted gene co-expression network analysis (WGCNA) is a systems biology method used to describe the correlation patterns between genes in microarray samples. WGCNA can be used to find highly correlated gene modules, and the identified correlation networks facilitate network-based gene screening methods that can be used to identify candidate biomarkers or therapeutic targets. WGCNA has been successfully used to study cancer-related targeting modules and central genes (10, 11). Polygenic combinations have been reported to possess better predictive ability than single genes for cancer prognosis (12). Therefore, novel biological algorithms need to be explored to construct more accurate diagnostic or prognostic models.
In this study, we used public microarray expression to comprehensively analyze breast cancer patient data from the Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA) databases, and modules associated with obesity were identified by WGCNA. Cox and LASSO regression models were used to construct a risk score prediction model, which could help better predict the prognosis of breast cancer patients with obesity.
Materials and Methods
Data Resource
We downloaded the expression profile data and sample information of GSE24185 using the GEOquery package (13) in R software version 4.0 (http://www.r-project.org), and downloaded the corresponding GPL96 chip information of expression profile data. The chip information in the expression profile data was converted into gene symbol, and part of the data without the gene symbol information was removed during the conversion, and the duplicate data were averaged. We used the cgdsr package in R software to download the required breast cancer gene expression data and sample clinical information. In the GSE24185 dataset, we selected data from 74 breast cancer patients, including 38 obese samples (BMI >30 Kg/m2) and 36 normal samples (BMI 18.5-24.9 Kg/m2). Another TCGA dataset was downloaded from Xena using the UCSCXenaTools package for validation.
Enrichment Analysis and Weighted Gene Co-Expression Network Analysis
Principal component analyses (PCA) were performed using the R software. GO and KEGG enrichment analyses were performed using clusterProfiler package (14) in R software version 4.0, and enrichment pathways with statistical significance were screened under the conditions of p < 0.05 and q < 0.2. The metabolic syndrome is a common metabolic disorder. Obesity is considered as an indicator of MetS. Co-expression analysis of the resulting differentially expressed genes was performed using WGCNA package (15) in R. Finally, modules with higher correlation with obesity were identified based on correlation coefficient and the modules with statistical significance were screened under the conditions of p < 0.05.
COX Proportional Hazard Model
Compared with the traditional stepwise regression model, the lasso regression model has the advantage of processing all the independent variables at the same time, which greatly strengthens the stability of the model, yielding a model with fewer variables at a faster speed. Lasso regression analysis was performed by glmnet package (16) in R.
Immune Score and Matrix Score
CIBERSORT is a deconvolution algorithm that uses a feature matrix of 547 genes to represent 22 infiltrating immune cells (17, 18). CIBERSORT uses Monte Carlo sampling to derive a deconvoluted p value for each sample, and deletes samples with P > 0.05. The ESTIMATE algorithm (estimate package in R 4.0 software) was used to calculate the immune and matrix scores of each tumor sample, and the correlation between tumor and obesity was analyzed according to the matrix score.
Results
Stromal and Immune Scores
The expression profile data and sample information of GSE24185 were downloaded using the GEOquery package, and the GPL96 chip information corresponding to the expression profile data was downloaded. The samples were subjected to stromal and immune scores, and grouped according to BMI into two groups: an obese group (n = 38) and a normal group (n = 36). Kruskal-Wallis (KW) analysis was performed according to the scoring results of the grouped samples at P > 0.05 (Supplementary Figure 1). There were no significant differences in the results between the two groups, which is consistent with the fact that our two groups were from patients with cancer. KW analysis of other factors and obesity in the sample information from the GSE24185 data showed that menstrual status and HER2 status were associated with obesity (Supplementary Table 1). We selected 74 samples from the GSE24185 data set, all of which were female patients. The detailed information about age, classification, and menopausal status is shown in Table 1 and Supplementary Table 2.
Screening of Differentially Expressed Genes (DEGs)
First, we conducted a PCA analysis and observed the degree of deviation between normal and obese samples (Supplementary Figure 2). We can see that there is no significant deviation between normal and obese samples, which is consistent with the fact that they are all cancer samples.The 74 samples from GSE24185 were divided into an obese and a normal group for the rank-sum test (Wilcox Test), and genes exhibiting significantly differential expression (P < 0.05) were screened. According to the differential gene expression analysis, 1173 genes were differentially expressed, of which 842 were upregulated and 331 were downregulated. To illustrate the distribution of each category, volcano maps and expression heat maps of differential genes were plotted (Figures 1A, B).
Identification of Obesity-Associated Modules
Coexpression analysis of the obtained DEGs were analyzed using the WGCNA software package. Data were first transformed into log2+1, and outliers were then processed. The GSM594925 sample was found to be in an outlier position, and it was removed from the analysis (Figure 2A). As shown in Figure 2B, the appropriate soft threshold values were screened out, and the resulting topological matrix was clustered using dissimilarity between genes. The tree was divided into modules (with a minimum of 30 genes per module) using the dynamic clipping method, and a total of five models were obtained (Figure 2C).
Figure 2 Identification of the functional modules associated with obesity. (A) Cluster dendrogram; (B) Analysis of network topologies for various soft threshold powers; (C) Clustering dendrogram of DEGs with dissimilarity based on topological overlap, together with assigned module colors; (D) Module-trait heatmap. Each row corresponds to a module eigengene, each column corresponds to a trait. Each cell contains the corresponding correlation and P value.
We plotted a heat map of module feature relationships to assess the association between each module and two clinical features (obese and nonobese). Figure 2D shows the correlation between the characteristic genes in a module and obesity traits. We performed co-expression analysis for DEGs, so grey module correlations were particularly high. This is because differentially screened genes are prone to form single or several genes with high correlation with traits, and cannot form effective gene modules. We selected the genes (blue module: r = 0.45, P = 8e-04 and turquoise module: r = 0.42, P = 2e-04) as the two gene modules exhibiting the best correlation, except for the grey module. The modules were then sorted to obtain 791 genes for further analysis.
Functional Enrichment Analysis of Genes in Key Modules
In order to understand the functional differences of different modules, we conducted a difference analysis of the 5 module genes and performed GO analysis. The Blue module is mainly involved in “Golgi vesicle transport”, “protein secretion”, “response to oxidative stress” and other ways. The brown module is mainly related to “mesenchyme development”, “muscle tissue development”, “axonogenesis” and other pathways. The turquoise module is mainly related to “mRNA splicing”, “regulation of metabolic process”, “organic acid transport”, “regulation of neurotransmitter levels” and other pathways. And the turquoise module is also the group that has the most enriched GO pathways (Supplementary Tables 3–5); while the grey and yellow modules have not enriched any GO pathways.
To further understand the biological function and pathway correlation of the blue and turquoise group modules, GO and KEGG pathway enrichment analyses were conducted. The results of the GO enrichment analysis showed that the modules were significantly enriched in the following terms: “regulation of cellular biosynthetic process,” “macromolecule biosynthetic process,” “DNA and nucleic acid binding.” Only two groups were enriched under MF entries, and KEGG results had only one pathway, so only the top 10 results of BP and CC were shown (Figure 3). Figure 3 shows the top 10 enriched GO entries of the blue and turquoise group modules. Supplementary Table 1 presents the results of all enriched genes according to the GO and KEGG enrichment analyses in detail.
PPI Network Construction and Identification of Hub Genes
The main objective of our study was to analyze the degree of correlation between genes in breast cancer and obesity and to determine the importance of genes in the different modules. The interaction network between proteins helps to mine the core regulatory genes. Using String online database (https://string-db.org/) and Cytoscape software were used to analyze the co-expressed genes. Among them were some interacting genes with high confidence, such as the potential interaction between mitochondrial ribosomal protein family genes (MRPS10, MRPS14, MRPS27, MRPL44), heterogeneous ribonucleoprotein family genes (HNRNPA0, HNRNPA3, HNRNPL),PCBP2). There was a potential interaction between PBM25, SREK1, PRPF40A, DDX46. A total of 55 genes, filtered into the PPI network (Figure 4).
Figure 4 Visualization of the protein-protein interaction (PPI) network and the candidate hub genes.
Prognostic Factors Were Screened by Univariate Cox Regression Analysis
We downloaded the expression profile data of these genes in breast cancer (including 1100 samples) from the TCGA database as well as the sample information table. Genes that had null expression values in all samples and 0 expression values in most samples were deleted. Finally, 740 genes were finally retained for univariate COX analysis. Univariate Cox analysis was performed for these 740 genes using the coxph function in the survival software package, and 30 potential prognosis-related genes were screened out (p value < 0.05; Table 2). Four of these genes were selected for survival analysis, and obvious survival differences can be seen between the two curves of each characteristic gene (Supplementary Figure 3).
Lasso Regression Analysis
To facilitate subsequent validation, we obtained 24 genes from the intersection of the above 30 genes with another set of genes from the TCGA database. Expression profile data and survival analysis of these 24 genes were analyzed by lasso regression using glmnet package. We selected six pseudogenes with independent prognostic values: SELENBP1, CACNA1D, CDC42EP3, HRH3, FCER1A, and PNOC. We extracted the expression values of six characteristic genes and divided the samples into low- and high-risk groups (Figure 5).
Figure 5 Prognostic risk assessment model. (A) Tenfold cross‐validation for tuning parameter selection in the LASSO model; (B) Heat map analysis of the gene expression of six pseudogenes in the high- and low-risk groups.
Difference of Immune Infiltration Between High- and Low-Risk Score Groups
The Cibersort algorithm was used to calculate immune infiltration of 1100 samples for 22 types of immune cells. Figure 6 presents a comparison of immune infiltration in the high- and low-risk groups (Figure 6). According to the results of rank-sum test (Table 3), revealed the presence of resting mast cells, eosinophils, CD8+ T cells, regulatory T cells (Tregs), naïve CD4 T cells, resting dendritic cells, neutrophils, naïve B cells, M0 macrophages, activated memory CD4+ T cells, activated dendritic cells, activated mast cells, resting memory CD4+ T cells, and memory B cells. There were differences in the proportion of infiltration of the 14 types of immune cells between the two groups.
Figure 6 Differences in infiltration of different immune cell types between the high- and low-risk groups.
Relationship Between the Prognostic Model and Clinical Parameters
The indicators related to breast cancer in the sample were evaluated according to the calculated risk value of each sample. As shown in the Figure 7, the prediction model constructed in the present study has good stability provided there is sufficient data. The survival time and survival rate were both higher in the low-risk group than in the high-risk group under different conditions such as age, T-stage, n-stage, gender, menstrual status, ER status, PR status, and lymph condition. To further understand the relationship between the prognostic model and other clinical data, we conducted univariate and multivariate COX regression analyses of TCGA data under different factors. The results revealed that the P values of the PR status and N-type in our prognostic risk prediction model were <0.05, and the p value and beta-value of the model we constructed were still within a reasonable range (Tables 4, 5). These results confirm the independent prognostic value of the risk score.
Figure 7 Evaluation of indicators related to breast cancer in the sample, blue is the low-risk group, light red is the high-risk group, each picture is a sample of different indicators.
Discussion
Obesity has seen an unprecedented growth in recent decades, and its impact on health has become increasingly evident. Esposito et al. (19) showed that the presence of metabolic syndrome was associated with a 52% increased risk of breast cancer (P < 0.01). Some studies have shown that metabolic syndrome is associated with higher recurrence and mortality in breast cancer patients (20–22). Metabolic syndrome is considered to be a prognostic factor in patients with breast cancer. Here, to better predict the prognosis of breast cancer we constructed a risk prediction model of obesity in combination with breast cancer. We selected a set of breast cancer sample data sets in the GEO database that included obesity data (determined by the BMI). The data were scored by the ESTIMATE method for immunity and matrix, and the dataset was divided into two groups: obese group and nonobese group, for K-W analysis (P < 0.05), to determine suitability of the samples for use in this study. DEGs in the validated samples were screened using the Wilcoxon rank-sum test.
WGCNA is used to construct a gene coexpression network, where co-expression modules can be identified using the WGCNA package in R language (15). WGCNA has many outstanding advantages over other methods as it explores the association between co-expression modules and clinical features, and the results have higher reliability and biological significance compared to other methods. In this study, we used the WGCNA method to construct 5 co-expression modules from 1173 DEGs in 74 samples, and we calculated the correlation between co-expression modules and obesity. The blue group and turquoise group modules, which are highly correlated with clinical characteristics, are considered key modules in exploring the association with obesity. GO and KEGG enrichment analysis and PPI network analysis were carried out, and a total of 55 genes were filtered into the PPI network, including 55 and nodes. Then, univariate COX analysis and lasso regression analysis were performed to identify six pseudogenes with independent prognostic value.
SELENBP1 has been shown to be expressed at low levels in cancers such as renal cell carcinoma, lung adenocarcinoma (23), and breast cancer (24) and is generally predictive of poor clinical outcomes. In our results, SELENBP1 expression was lower in breast cancer samples in the nonobese group than in the obese group. CACNA1D is believed to regulate cell firing, and is highly associated with prostate cancer (25). Relevant bioinformatics analysis also confirmed that CACNA1D was highly expressed in prostate cancer, breast cancer, colorectal cancer, gastric cancer, lung cancer, uterine cancer, and other cancers (26). In our study, CACNA1D expression was lower in breast cancer samples of nonobese patients than obese patients. Results of the GO and KEGG analyses revealed that CACNA1D was enriched in the GO entries “neurotransmitter transport” and “cellular localization” and in the KEGG entry “Herpes simplex virus 1 infection”, and may be a novel oncogene in cancer development. In vivo “admix” experiments with breast cancer cells demonstrated that Cdc42EP3 is required for efficient tumor growth. Cdc42EP3/BORG2 has been reported to be needed for matrix remodeling, invasion, angiogenesis, and the tumor growth-promoting abilities of cancer-associated fibroblasts (27). Studies have shown that HRH3 plays an important role in promoting tumor invasion and metastasis. Its expression is upregulated in lung cancer tissues, and it is associated with poor prognosis of lung cancer patients (28, 29). In our study, HRH3 expression was higher in breast cancer samples of nonobese patients than obese patients. The HRH3 gene was enriched in the entries of “neurotransmitter transport” and “cellular localization” in GO analysis. The relationship between FCER1A gene variants and allergic diseases has been demonstrated in human studies (30, 31). Some studies have hypothesized about the possible mechanisms underlying the association between the FCER1A gene and breast cancer, and suggested that immune-stimulating conditions such as infectious diseases and allergies may actually confer susceptibility to breast cancer (32). In our study, FCER1A expression was higher in breast cancer samples of nonobese patients than of obese patients. Ablation of PNOCARC neurons protects from obesity. PNOC expression was found to be significantly upregulated in gliomas (33). In our study, PNOC expression was higher in breast cancer samples of nonobese patients than of obese patients.
Early clinical studies have shown that immune infiltration has a great impact on the clinical course of concentrated types of cancer (34, 35). The results of immune infiltration in the high- and low-risk groups showed that the infiltration proportion of 14 of 22 types of immune cells significantly differed between the two groups. Resting mast cells accounted for the highest proportion of immune cells in both the groups, and the proportion of activated mast cells were found to significantly differ between the two risk groups. Mast cells can secrete several factors for the regulation of cancer cell growth (36). Eosinophils and T cells also account for a large proportion. Relevant studies have shown that an increase in the number of eosinophils coexists with obesity, and there is a positive correlation between blood eosinophil count and body mass index (BMI) or metabolic syndrome (37, 38). Therapeutic interventions targeting eosinophils in adipose tissue may have the potential to reduce inflammation and body fat while improving metabolic dysfunction in obese patients (39). Obesity-promoted breast tumor development is associated with loss of functional CD8+ T cells (40).
Patients in the high-risk group had a poor prognosis, and their survival time decreased with increase in the risk value. Under a range of different conditions, including age, T stage, N stage, sex, menstrual status, ER status, PR status, and lymph status, the survival time and survival rate were higher in the low-risk group than in the high-risk group. We also observed that PR status and N-typing remained important and independent risk factors for long-term survival. In both the univariate and multivariate COX regression analysis, the model constructed in the present study was found to be a good predictor of the prognosis, further indicating the prognostic value of this model.
In this study, the combination of WGCNA and the Cox proportional hazard model achieved reliable results in the identification of a co-expression network associated with survival and the construction of a risk score model. Our study provides potential models and biomarkers for further immune-related work and personalized drug treatment of breast cancer in breast cancer patients with obesity. However, this study also has some limitations. This is a retrospective study, and the predictive value of this model for prognosis has not been confirmed experimentally in clinical samples and the number of patients in the study type is limited. Finally, in vivo and in vitro experiments are needed to validate the findings of our study and to elucidate the molecular mechanisms underlying the roles of these genes in breast cancer patients with obesity.
Data Availability Statement
Publicly available datasets were analyzed in this study. This data can be found here: GSE24185 dataset at https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE24185: TCGA dataset at https://xena.ucsc.edu/public/.
Ethics Statement
Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.Written informed consent was not obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.
Author Contributions
NS, DM, YZ, and XQ conceived and designed the analysis and implemented the experimental studies. NS performed statistical analysis, interpreted results, graphed data, and wrote the paper. PG, YL, ZY, FH, and ZP modified the draft. YZ and XQ approved the draft of the paper. All authors contributed to the article and approved the submitted version.
Funding
This work was supported by the Project of National Key Clinical Specialty Construction (413F1Z113), the Natural Science Foundation of Chongqing (cstc2018jcyjA0317), Military Medical Staff Innovation Plan of Army Medical University (No. XZ-2019-505-042) and Military Medical Staff Innovation Plan of Southwest Hospital (No. SWH2018BJLC-04).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2021.712513/full#supplementary-material
Supplementary Figure 1 | Stromal and immune scores.
Supplementary Figure 2 | The principal component analysis(PCA) results for GSE24185 dataset.
Supplementary Figure 3 | Prognostic value of differentially expressed genes in breast cancer patients.
Supplementary Table 2 | Information of each sample in the GSE24185 dataset
Supplementary Table 3 | The results of the GO term enrichment of the differential genes of the blue module.
Supplementary Table 4 | The results of the GO term enrichment of the differential genes of the brown module.
Supplementary Table 5 | The results of the GO term enrichment of the differential genes of the turquoise module.
References
1. Onidentification CG. Onidentification CG. Clinical Guidelines on the Identification, Evaluationand Treatment of Overweight and Obesity in Adults the Evidence Report. Obes Res (1998) 2:51S–179S. doi: 10.1002/j.1550-8528.1998.tb00690.x
2. GBD 2015 Obesity Collaborators, Afshin A, Forouzanfar MH, Reitsma MB, Sur P, Estep K, et al. Health Effects of Overweight and Obesity in 195 Countries Over 25 Years. N Engl J Med (2017) 377:13–27. doi: 10.1056/NEJMoa1614362
3. Arhire IL. Personal and Social Responsibility in Obesity. Romanian J Diabetes Nutr Metab Dis (2015) 22:321–31. doi: 10.1515/rjdnmd-2015-0039
4. Shen Z, Ye Y, Bin L, Yin M, Yang X, Jiang K, et al. Metabolic Syndrome is an Important Factor for the Evolution of Prognosis of Colorectal Cancer: Survival, Recurrence, and Liver Metastasis. Am J Surg (2010) 200:59–63. doi: 10.1016/j.amjsurg.2009.05.005
5. Xiang YZ, Xiong H, Cui ZL, Jiang SB, Xia QH, Zhao Y, et al. The Association Between Metabolic Syndrome and the Risk of Prostate Cancer, High-Grade Prostate Cancer, Advanced Prostate Cancer, Prostate Cancer-Specific Mortality and Biochemical Recurrence. J Exp Clin Cancer Res (2013) 2:9. doi: 10.1186/1756-9966-32-9
6. Thomas GA, Alvarez-Reeves M, Lu L, Yu H, Irwin ML. Effect of Exercise on Metabolic Syndrome Variables in Breast Cancer Survivors. Int J Endocrinol (2013) 13:168797. doi: 10.1155/2013/168797
7. Gentry M. World Cancer Research Fund International (WCRF). Impact. (2017) 4:32–3. doi: 10.21820/23987073.2017.4.32
8. Turkoz FP, Solak M, Petekkaya I, Keskin O, Kertmen N, Sarici F, et al. Association Between Common Risk Factors and Molecular Subtypes in Breast Cancer Patients. Breast (2013) 22:344–50. doi: 10.1016/j.breast.2012.08.005
9. Pierobon M, Frankenfeld CL. Obesity as a Risk Factor for Triple-Negative Breast Cancers: A Systematic Review and Meta-Analysis. Breast Cancer Res Treat (2013) 137:307–14. doi: 10.1007/s10549-012-2339-3
10. Wang Y, Chen L, Wang G, Cheng S, Qian K, Liu X, et al. Fifteen Hub Genes Associated With Progression and Prognosis of Clear Cell Renal Cell Carcinoma Identified by Coexpression Analysis. J Cell Physiol (2018) 234:10225–37. doi: 10.1002/jcp.27692
11. Zhou S, Liu S, Zhang L, Guo S, Shen J, Li Q, et al. Recurrence Risk Based on Pathologic Stage After Neoadjuvant Chemoradiotherapy in Esophageal Squamous Cell Carcinoma: Implications for Risk-Based Postoperative Surveillance Strategies. Ann Surg Oncol (2018) 25:3639–46. doi: 10.1245/s10434-018-6736-7
12. Zuo S, Dai G, Ren X. Identification of a 6-Gene Signature Predicting Prognosis for Colorectal Cancer. Cancer Cell Int (2019) 19:6. doi: 10.1186/s12935-018-0724-7
13. Davis S, Meltzer PS. GEOquery: A Bridge Between the Gene Expression Omnibus (GEO) and BioConductor. Bioinformatics (2007) 23:1846–7. doi: 10.1093/bioinformatics/btm254
14. Yu G, Wang LG, Han Y, He QY. Clusterprofiler: An R Package for Comparing Biological Themes Among Gene Clusters. Omics-a J Integr Biol (2012) 16:284–7. doi: 10.1089/omi.2011.0118
15. Langfelder P, Horvath S. WGCNA: An R Package for Weighted Correlation Network Analysis. BMC Bioinf (2008) 9:559. doi: 10.1186/1471-2105-9-559
16. Blanco JL, Porto-Pazos AB, Pazos A, Fernandez-Lozano C. Prediction of High Anti-Angiogenic Activity Peptides In Silico Using a Generalized Linear Model and Feature Selection. Sci Rep (2018) 8:15688. doi: 10.1038/s41598-018-33911-z
17. Goswami S. Redrawing a Prognostic Landscape of Tumor Infiltrating Immune Cells Across Different Human Cancer. (2018)
18. Newman AM, Liu CL, Green MR, Gentles AJ, Feng W, Xu Y, et al. Robust Enumeration of Cell Subsets From Tissue Expression Profiles. Nat Methods (2015) 12:453–7. doi: 10.1038/nmeth.3337
19. Esposito K, Chiodini P, Colao A, Lenzi A, Giugliano D. Metabolic Syndrome and Risk of Cancer: A Systematic Review and Meta-Analysis. Diabetes Care (2012) 35:2402–11. doi: 10.2337/dc12-0336
20. Berrino F, Villarini A, Traina A, Bonanni B, Panico S, Mano MP, et al. Metabolic Syndrome and Breast Cancer Prognosis. Breast Cancer Res Treat (2014) 147:159–65. doi: 10.1007/s10549-014-3076-6
21. Calip GS, Malone KE, Gralow JR, Stergachis A, Hubbard RA, Boudreau DM. Metabolic Syndrome and Outcomes Following Early-Stage Breast Cancer. Breast Cancer Res Treat (2014) 148:363–77. doi: 10.1007/s10549-014-3157-6
22. Simon MS, Beebe-Dimmer JL, Hastert TA, Manson JE, Cespedes Feliciano EM, Neuhouser ML, et al. Cardiometabolic Risk Factors and Survival After Breast Cancer in the Women’s Health Initiative. Cancer (2018) 124:1798–807. doi: 10.1002/cncr.31230
23. Caswell DR, Chuang CH, Ma RK, Winters IP, Snyder EL, Winslow MM. Tumor Suppressor Activity of Selenbp1, A Direct Nkx2-1 Target, in Lung Adenocarcinoma. Mol Cancer Res Mcr (2018) 16:1737–49. doi: 10.1158/1541-7786.MCR-18-0392
24. Sheng Z, Feng L, Mamoun Y, Hao L, Changyi C, Qizhi Y, et al. Reduced Selenium-Binding Protein 1 in Breast Cancer Correlates With Poor Survival and Resistance to the Anti-Proliferative Effects of Selenium. PloS One (2013) 8:e63702e. doi: 10.1371/journal.pone.0063702
25. Mariot P, Prevarskaya N, Roudbaraki MM, Bourhis XL, Skryma R. Evidence of Functional Ryanodine Receptor Involved in Apoptosis of Prostate Cancer (LNCaP) Cells. Prostate (2015) 43:205–14. doi: 10.1002/(SICI)1097-0045(20000515)43:3<205::AID-PROS6>3.0.CO;2-M
26. Wang CY, Lai MD, Phan NN, Sun Z, Lin YC. Meta-Analysis of Public Microarray Datasets Reveals Voltage-Gated Calcium Gene Signatures in Clinical Cancer Patients. PloS One (2015) 10:e0125766. doi: 10.1371/journal.pone.0125766
27. Calvo F, Ranftl R, Hooper S, Farrugia AJ, Moeendarbary E, Bruckbauer A, et al. Cdc42EP3/BORG2 and Septin Network Enables Mechano-Transduction and the Emergence of Cancer-Associated Fibroblasts. Cell Rep (2015) 13:2699–714. doi: 10.1016/j.celrep.2015.11.052
28. Francis H, Onori P, Gaudio E, Franchitto A, Alpini G. H3 Histamine Receptor-Mediated Activation of Protein Kinase Calpha Inhibits the Growth of Cholangiocarcinoma In Vitro and In Vivo. Mol Cancer Res Mcr (2009) 7:1704. doi: 10.1158/1541-7786.MCR-09-0261
29. Yu D, Zhao J, Wang Y, Hu J, Zhao Q, Li J, et al. Upregulated Histamine Receptor H3 Promotes Tumor Growth and Metastasis in Hepatocellular Carcinoma. Oncol Rep (2019) 41:3347–54. doi: 10.3892/or.2019.7119
30. Potaczek DP, Sanak M, Szczeklik A. Additive Association Between FCER1A and FCER1B Genetic Polymorphisms and Total Serum IgE Levels. Allergy (2010) 62:1095–6. doi: 10.1111/j.1398-9995.2007.01446.x
31. Potaczek DP, Sanak M, Mastalerz L, Setkowicz M, Kaczor M, Nizankowska E, et al. The Alpha-Chain of High-Affinity Receptor for IgE (FcepsilonRIalpha) Gene Polymorphisms and Serum IgE Levels. Allergy (2006) 61:1230–3. doi: 10.1111/j.1398-9995.2006.01195.x
32. Lee J-Y, Park AK, Lee K-M, Park SK, Han S, Han W, et al. Candidate Gene Approach Evaluates Association Between Innate Immunity Genes and Breast Cancer Risk in Korean Women. Carcinogenesis (2009) 30:1528–31. doi: 10.1093/carcin/bgp084
33. Chan MH, Kleinschmidt-Demasters BK, Donson AM, Birks DK, Foreman NK, Rush SZ. Pediatric Brainstem Gangliogliomas Show Overexpression of Neuropeptide Prepronociceptin (PNOC) by Microarray and Immunohistochemistry. Pediatr Blood Cancer (2012) 59:1173–9. doi: 10.1002/pbc.24232
34. Geissler K, Fornara P, Lautenschl Ger C, Holzhausen H-J, Seliger B, Riemann D. Immune Signature of Tumor Infiltrating Immune Cells in Renal Cancer. Oncoimmunology (2015) 4:e985082. doi: 10.4161/2162402X.2014.985082
35. Chevrier S, Levine JH, Zanotelli VRT, Silina K, Schulz D, Bacac M, et al. An Immune Atlas of Clear Cell Renal Cell Carcinoma. Cell (2017) 169:736–49.e18. doi: 10.1016/j.cell.2017.04.016
36. Marichal T, Tsai M, Galli SJ. Mast Cells: Potential Positive and Negative Roles in Tumor Biology. Cancer Immunol Res (2013) 1:269. doi: 10.1158/2326-6066.CIR-13-0119
37. Wan SS, Kim HJ, Kang ES, Ahn CW, Lim SK, Lee HC, et al. The Association of Total and Differential White Blood Cell Count With Metabolic Syndrome in Type 2 Diabetic Patients. Diabetes Res Clin Pract (2006) 73:284–91. doi: 10.1016/j.diabres.2006.02.001
38. Sunadome H, Matsumoto H, Izuhara Y, Nagasaki T, Kanemitsu Y, Ishiyama Y. Correlation Between Eosinophil Count, its Genetic Background and Body Mass Index: The Nagahama Study. Allergol Int (2020) 69:46–52. doi: 10.1016/j.alit.2019.05.012
39. Calco GN, Fryer AD, Nie Z. Unraveling the Connection Between Eosinophils and Obesity. J Leukocyte Biol (2020) 108:123–8. doi: 10.1016/j.alit.2019.05.012. doi: 10.1002/JLB.5MR0120-377R
Keywords: obesity, breast cancer, TCGA, GEO, prognostic model
Citation: Sun N, Ma D, Gao P, Li Y, Yan Z, Peng Z, Han F, Zhang Y and Qi X (2021) Construction of a Prognostic Risk Prediction Model for Obesity Combined With Breast Cancer. Front. Endocrinol. 12:712513. doi: 10.3389/fendo.2021.712513
Received: 20 May 2021; Accepted: 27 July 2021;
Published: 09 September 2021.
Edited by:
Andrea P. Rossi, Integrated University Hospital Verona, ItalyReviewed by:
Marco Rossato, University of Padua, ItalyChi Kin Ip, Garvan Institute of Medical Research, Australia
Copyright © 2021 Sun, Ma, Gao, Li, Yan, Peng, Han, Zhang and Qi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yi Zhang, Wlk1MzgxMEAxNjMuY29t; Xiaowei Qi, cXh3OTkwOEBmb3htYWlsLmNvbQ==
†These authors have contributed equally to this work