- 1The Key Laboratory of Carbohydrate Chemistry & Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, China
- 2State Key Laboratory of Biochemical Engineering, Institute of Process Engineering, Chinese Academy of Sciences, Beijing, China
- 3Department of Breast Surgery, Fujian Medical University Union Hospital, Fuzhou, Fujian, China
- 4Department of General Surgery, Fujian Medical University Union Hospital, Fuzhou, Fujian, China
- 5Breast Cancer Institute, Fujian Medical University, Fuzhou, Fujian, China
- 6Department of Pathology, Fujian Medical University Union Hospital, Fuzhou, Fujian, China
Background: Breast cancer is one of the most frequently occurring malignant cancers worldwide. Invasive ductal carcinoma (IDC) and invasive lobular carcinoma (ILC) are the two most common histological subtypes of breast cancer. In this study, we aimed to deeply explore molecular characteristics and the relationship between IDC and ILC subtypes in luminal A subgroup of breast cancer using comprehensive proteomics and phosphoproteomics analysis.
Methods: Cancer tissues and noncancerous adjacent tissues (NATs) with the luminal A subtype (ER- and PR-positive, HER2-negative) were obtained from paired IDC and ILC patients respectively. Label-free quantitative proteomics and phosphoproteomics methods were used to detect differential proteins and the phosphorylation status between 10 paired breast cancer and NATs. Then, the difference in protein expression and its phosphorylation between IDC and ILC subtypes were explored. Meanwhile, the activation of kinases and their substrates was also revealed by Kinase-Substrate Enrichment Analysis (KSEA).
Results: In the luminal A breast cancer, a total of 5,044 high-confidence proteins and 3,808 phosphoproteins were identified from 10 paired tissues. The protein phosphorylation level in ILC tissues was higher than that in IDC tissues. Histone H1.10 was significantly increased in IDC but decreased in ILC, Conversely, complement C4-B and Crk-like protein were significantly decreased in IDC but increased in ILC. Moreover, the increased protein expression of Septin-2, Septin-9, Heterogeneous nuclear ribonucleoprotein A1 and Kinectin but reduce of their phosphorylation could clearly distinguish IDC from ILC. In addition, IDC was primarily related to energy metabolism and MAPK pathway, while ILC was more closely involved in the AMPK and p53/p21 pathways. Furthermore, the kinomes in IDC were primarily significantly activated in the CMGC groups.
Conclusions: Our research provides insights into the molecular characterization of IDC and ILC and contributes to discovering novel targets for further drug development and targeted treatment.
Introduction
Breast cancer is prevalent in women, with approximately 1.7 million cases diagnosed worldwide every year, representing a major leading cause of cancer-related death in women (1). Breast cancer is a heterogeneous disease with diverse clinical behaviors and histopathological patterns, and the heterogeneity of breast cancer biology represents major challenges to the drive for personalized treatment (2). Invasive ductal carcinoma (IDC) and invasive lobular carcinoma (ILC) constitute the two most common histological subtypes of breast cancer, ranging from 66-71% and 6-10% of cases, respectively (3, 4). IDC has played a dominant role in breast cancer, but the incidence of ILC is increasing (5, 6), and studies have evaluated the different clinical characteristics of these subtypes (7, 8). Compared to IDC patients, ILCs present with larger tumor sizes, more frequent lymph node invasion and higher probabilities of estrogen receptor (ER) and progesterone receptor (PR) positivity (9, 10). Clinicopathological analyses indicated that the overall outcome of ILC did not significantly differ from that of the common IDC type (11), and these tumors are currently treated similarly (12, 13). Moreover, studies suggested that the prognosis, long-term outcomes, aggressive and metastasis are different between IDC and ILC even with the specific molecular subgroup (14).
Recently, comprehensive genomic and transcriptomic studies of IDC and ILC have revealed differences between these two subtypes (15–17). Due to alternative splicing, RNA transcripts and proteins are not typically ‘one-to-one’ corresponding relationships (18). Additionally, a variety of protein posttranslational modifications (glycosylation, methylation, phosphorylation, ubiquitination) also contribute to proteoforms exhibiting more diversity than genes (19). Protein phosphorylation is one of the most important posttranslational modifications and plays a key role in protein activity regulation, signal transduction, and intercellular communication (20). Dysregulated protein phosphorylation usually leads to the onset of various malignant diseases, which could be developed as potential candidates for their detection, especially in cancers (21). Protein expression and modification, which directly indicates the pathological status of cancer tissues, have not been evaluated in IDC and ILC. MS-based proteomics has emerged as the most important and powerful tool for protein and phosphorylation analysis. Phosphoproteomics provides additional information, offering a method to qualify and quantify the state of kinase-dependent pathways and provides detailed posttranslational phosphorylation information (22, 23). The activity of kinases, involving the transfer of a phosphate group to a protein, and phosphatases, which remove a phosphate group from a protein, modulate these two enzymatic processes and modulate protein phosphorylation in cells in response to an external stimuli (24). To date, approximately 538 known kinases have been identified in the human genome, most of which are associated with human cancer initiation and progression (21). These kinases maintain cellular function by activating protein function, while corresponding phosphatases reverse this action (25, 26). These counter mechanisms greatly improve the plasticity of the epigenome by regulating protein activity.
In this study, we conducted a comprehensive proteomics and phosphoproteomics analysis on 10 paired luminal A breast cancer patients (5 paired IDC vs 5 paired ILC) to distinguish IDC and ILC from their protein expression levels, their degree of protein phosphorylation and kinase activity. These findings will help clarify the molecular patterns and different pathways of different luminal A breast cancer subtypes and help improve the sensitivity and accuracy of detecting these two subtypes.
Materials and methods
Patients and clinical specimens
The paired luminal A breast cancer samples used in this study were obtained from patients who underwent breast surgery (total mastectomy or breast-conserving surgery) in the Department of Breast Surgery, Fujian Medical University Union Hospital. Patients were randomly selected from October 2017 to October 2019. None of the patients underwent any neoadjuvant chemotherapy or radiotherapy prior to surgery. Surgically resected primary tumor tissues and paired noncancerous adjacent tissues (> 3 cm apart from the tumor edge) were collected from 5 patients with invasive ductal carcinoma and 5 patients with invasive lobular carcinoma. All patients were histopathologically diagnosed with the luminal subtype (ER- and PR-positive, HER2-negative; HER2 status was strictly defined according to ASCO guidelines). Clinical information was collected, including age, tumor size, status of axillary lymph node metastasis, AJCC stage and histological grade. Each specimen was collected intraoperatively and immediately transferred to a sterile tube, snap frozen in liquid nitrogen and stored at -80°C until use.
Tissue homogenization and protein extraction
Frozen tissue samples were homogenized in lysis buffer containing a protease inhibitor cocktail. The lysis buffer contained 8 M urea, 75 mM NaCl, 1 M NH4HCO3, 50 mM Tris (pH 8.0), 1 mM EDTA, 2 μg/mL aprotinin, 10 μg/mL leupeptin, 1 mM PMSF (Phenylmethanesulfonyl fluoride), 1 mM PIC2, and 1 mM PIC3. All reagents were purchased from Sigma–Aldrich. Phosphatase inhibitor cocktail 2/3 was used at a 1:100 dilution (v/v) with 20 μM PUGNAc (O-(2-Acetamido-2-deoxy-D-glucopyranosylidenamino) N-phenylcarbamate), and 10 mM NaF (sodium fluoride). The tissues were homogenized for 10 min in 25% rated power with 20 s intervals per 10 s sonication using ultrasonic crushing (Scientz, Ningbo, China). The protein concentrations were measured by a bicinchoninic acid (BCA) assay kit (Beyotime, Shanghai, China), and the extracted proteins were stored at -80°C until use.
Protein digestion by trypsin
Protein (1 mg) was reduced in 10 mM dithiothreitol (DTT, Sigma–Aldrich, U.S.) at 56°C for 45 min and alkylated by the addition of 20 mM iodoacetamide (IAM, Sigma–Aldrich, U.S.) at room temperature for 45 min in the dark. The samples were diluted 8-fold in 40 mM NH4HCO3. Trypsin (Promega, U.S.) with an enzyme/protein ratio of 1:50 (w/w) was added to the solution and incubated at 37°C overnight. All digested samples were desalted by running through C18 columns (Sep-Pak tC18 cartridge, Waters, U.S.) and dried by Speed-Vac (EYELA-UT-1000, Japan). In the experiment, we routinely obtained over 800 μg of tryptic peptide through digestion and C18 clean-up processes. From the total tryptic peptide, 10% was set apart for conventional global proteomics analysis, while the remaining sample was used for sequential IMAC enrichment. Finally, peptide (1 μg) from each group was subjected to LC−MS/MS on an Orbitrap Fusion Lumos mass spectrometer (Thermo Fisher, U.S.) (27).
Phosphopeptide enrichment
IMAC enrichment was performed using freshly prepared Fe3+-NTA agarose beads that were made by conjugating Ni2+-NTA agarose beads (Qiagen, Germany) and FeCl3 (Sigma–Aldrich, U.S.) aqueous solution. The Fe3+-NTA agarose beads were packaged to the top of C18 stagetip. Each sample was reconstituted in 80% (v/v) acetonitrile (ACN) and 0.1% (v/v) trifluoroacetic acid (TFA) binding/washing buffers and incubated with 25% (v/v) Fe3+-NTA agarose beads for 30 min at room temperature. the supernatants were collected by centrifugation. The beads were stacked on the stage tips and washed three times. The phosphopeptides were eluted in potassium phosphate buffer (500 mM KH2PO4, pH 7) and 50% (v/v) ACN and 0.1% (v/v) formic acid (FA). Then, the peptides were eluted, dried, and stored at -80°C prior to LC–MS/MS analysis of phosphoproteomics (28).
LC–MS/MS analysis
The dried global peptides and phosphopeptides were resuspended in 10 μL 2% ACN and 0.1% FA solution and then analyzed using an EASY-nLC 1200 system (Thermo Scientific, San Jose, CA) coupled with a high-resolution Orbitrap Fusion Lumos spectrometer (Thermo Scientific, San Jose, CA). Each injection volume was 3 μL. The samples were first separated on an EASY-nLC 1200 system (Thermo Scientific, San Jose, CA) in an RSLC C18 column (1.9 μm×100 μm×20 cm) packed in house. All samples were subjected to two LC−MS/MS runs on an Orbitrap Fusion Lumos mass spectrometer (Thermo Scientific, San Jose, CA). Data-dependent higher-energy collisional dissociation (HCD) fragmentation was performed on the top 20 most abundant ions. For the global peptides, the mobile phase consisting of water (A) and 0.1% FA and 90% ACN (B) was subjected to a gradient elution of 3-8% B, 8 min; 8–28% B, 80 min, 28–32% B, 22 min; 32–80% B, 5 min; and 80% B, 5 min. The flow rate was maintained at 450 nL/min. The spray voltage was in static mode. Spectra (AGC target of 4 × 10 (5) and maximum injection time of 50 ms) were collected from 350 to 2000 m/z at a resolution of 60,000, followed by data-dependent HCD MS/MS (at a resolution of 30,000, HCD collision energy 34%, stepped collision energy 5%, AGC target 5 × 10 (4), maximum injection time 35 ms and microscans 1). Charge state screening was enabled to reject singly charged ions and ions with more than eight charges. A dynamic exclusion time of 45 s was used to discriminate newly selected from previously selected ions. For the phosphopeptides, the gradient elution was 2-22% B, 88 min; 22-32% B, 22 min, 32-80% B, 5 min; and 80% B, 5 min. The flow rate was maintained at 550 nL/min. Spectra (AGC target 4 × 10 (5) and maximum injection time 50 ms) were collected from 350 to 1550 m/z at a resolution of 60,000, followed by data-dependent HCD MS/MS (at a resolution of 30,000, HCD collision energy 32%, stepped collision energy 5%, AGC target 5 × 10 (4), maximum injection time 35 ms and microscans 1).
Identification of global proteins and phosphoproteins
For the global peptides and phosphopeptides, the acquired MS/MS spectra were searched using MaxQuant software with the Homo sapiens_UniProt database updated in June 2019, which contains 20,432 proteins (29). The parameters were set as default if not otherwise stated. The enzyme specificity was set to trypsin/P (the C-terminus of Arg or Lys with cleavage at the proline bond allowed), and the maximum number of missed cleavage sites was set as two. Carbamidomethyl (C, 43.028) was set as the fixed modification. The deamidation (N, 29.018) and phosphorylation (S/T/Y, 97.977) of proteins were set as the variable modifications. Proteins and peptides were identified using a target-decoy approach in revert mode and quantified using intensity data (peak area of extracted ion chromatography) using the Andromeda search engine integrated into the MaxQuant environment. MaxQuant was searched with a fragment ion mass tolerance of 0.02 Da and a parent ion tolerance of 10.0 PPM. The false discovery rates (FDRs) of protein groups, peptides, and phosphosites were less than 0.01. The cutoff criterion for the localization probability at each phosphosite was greater than 0.75.
Data processing and visualization
To classify the data, a PCA score plot and correlation heatmap were generated using OmicStudio (https://www.omicstudio.cn/tool). Basic statistical analysis of the data was performed using Excel to obtain information such as the types of identified proteins and phosphoproteins and the number of modified sites. Quantitative analysis of data (standardized analysis and differential protein screening) was performed according to the modified global normalization formula (30). Screening of differential proteins was performed according to the following conditions: P-value < 0.05, fold change >1.50 or <0.67, and significant count pair percentage >60% (more than 3 pairs of the 5 pairs reached a fold change of 1.50 with an accordant trend). Missing values were not included in this quantitative analysis. A two-tailed Welch t-test was conducted to determine the statistical significance of the differences in protein expression or phosphorylation modification between noncancerous and cancer tissues. Volcano plots of differential proteins and phosphoproteins (P-value < 0.05, fold change >1.5 or <0.67) were generated using OmicStudio and R studio (R version 4.0.1, https://www.r-project.org/). Clustering heatmaps were generated using TBtools (https://github.com/CJ-Chen/TBtools) (31). Gene set enrichment analysis (GSEA) was performed to reveal the protein function and involved pathways of the differential proteins using ClusterProfiler (32) and Enrichplot packages (Bioconductor). Gene Ontology (GO) enrichment analysis and KEGG enrichment analysis were performed using R studio, Clusterprofiler packages and Database for Annotation, Visualization and Integrated Discovery (DAVID) (Version 6.8, https://david.ncifcrf.gov/) (33). Significantly enriched pathways were retrieved by searching against KEGG databases. Kinase-substrate enrichment analysis (KSEA) was performed to reveal the activation/inactivation of kinase using the KSEA algorithm (34, 35) using the PhosphoSitePlus database [https://www.phosphosite.org/] according to the following conditions: P-value cutoff < 0.05, substrate count cutoff ≥ 3. An activated-inactivated kinase location analysis plot was generated using the kinMAP app (http://www.kinhub.org/kinmap/citation.html) (36).
Results
Protein expression and phosphorylation between IDC and ILC breast cancer
To elucidate differences in the proteome and phosphoproteome profiles of IDC and ILC, we assembled 10 paired breast cancer clinical tissues and classified them into two groups based on their predominant subtype as described in the Materials and Methods section. Patient characteristics, including age, histological type, tumor stage and molecular subtypes, are summarized in Table 1.
Proteomics and phosphoproteomics data could provide an excellent chance to explore the relationships between protein expression and phosphorylation modification in different cancers. In the proteomics study, a total of 5,044 high-confidence proteins were identified from tumor and noncancerous adjacent tissues (NATs) of 10 paired breast cancer patients, in which the average identified protein number of NATs was lower than that of the cancer tissues (Figures S1A, B). Alterations in protein phosphorylation in breast cancer cells and tissues has revealed the heterogeneity of breast cancer (37, 38). However, differences in protein phosphorylation between IDC and ILC are still not very clear. In this study, a total of 3,808 phosphoproteins with 12,552 phosphopeptides were identified from breast cancer tissues. This result indicated that approximately 63% of phosphoproteins were identified in both cancer and NATs, similar to the proteomics results (Figure S1C). With respect to protein phosphorylation of the breast cancer subtypes, protein phosphorylation levels in ILC were higher than that of IDC, in which the number of phosphorylation sites in ILC were twofold the number in IDC (Figure S1D). The number of peptides with more than one phosphorylation site in IDC cancer tissues was almost equal to the number in NATs, while the number in ILC cancer tissues was markedly higher than that in NATs (Figure S1E).
To estimate the functional enrichment of the breast cancer tissue, proteins identified only in the cancer tissues of breast cancer were subjected to GO and KEGG analysis. The results indicated that these proteins were primarily involved in mitochondrial biogenesis gene expression, such as mitochondrial translation elongation and termination (Figure S2A). Mitochondria influence multiple processes that underpin tumor progression, including the proliferation of transformed cells, resistance to adverse microenvironmental conditions (39). Moreover, the proteins only in IDC tissues were mainly related with protein-containing complex disassembly while only in ILC tissues were mainly related with cellular component disassembly (Figures S2B, C). The KEGG analysis revealed that the proteins found only in cancer tissue were primarily involved the pathways of nucleocytoplasmic transport, endocytosis, and salmonella infection (Figure S3). In the IDC, proteins with multiple modified phosphorylation sites were primarily related to the spliceosome and other functions and were enriched only in the insulin signaling and mTOR signaling pathways. In the ILC, proteins with multiple phosphorylation sites were mainly related to various phosphorylation-related signaling pathways, such as the insulin signaling pathway, thyroid hormone signaling pathway, and AMPK signaling pathway (Figure S4).
Protein and phosphorylation features between IDC and ILC breast cancer subtypes
To quantitatively distinguish protein expression between IDC and ILC breast cancer subtypes, 1,259 proteins were identified as high-patient-coverage proteins presented in more than 60% of 10 paired luminal A breast cancer patients (Figure 1A). From the relative abundance ratio distribution (fold change of protein relative abundance between the breast cancer tissue and NATs), protein expression was globally upregulated in all cancer tissues compared to NATs, in which the ratio distribution moved to 1 (Figure S5A). Correlation analysis of the cancer and NATs proteomics revealed that the Pearson’s correlation parameters of the cancer tissue with all other cancer samples were significantly higher than those of paired NATs (Figure S5B). The PCA results showed that the cancer tissues and the NATs of luminal A breast cancer can be clearly classified, but it is difficult to globally distinguish the IDC and ILC subtype of luminal A breast cancer (Figures 1B, C and Figure S6). The heatmap of the proteins showed that cancer and NAT samples were clustered into two groups, but it was also difficult to distinguish the two subtypes (Figure 1D). All these proteins from luminal A breast cancer were enriched in protein translation and transfer biological processes and the cellular macromolecule catabolic process and mRNA metabolic process were upregulated (Figures 1E, F). While the ribonucleoprotein complex related proteins and RNA binding related proteins were also upregulated in the cancer tissues (Figures 1G, H). The KEGG analysis of proteins indicated relative upregulation of molecular metabolic and biosynthetic process proteins, as well as ribosome and spliceosome pathway proteins in the luminal A breast cancer tissues, which were associated with a decrease in cell-ECM receptor interaction and PPAR signaling pathway proteins (Figure 1I).
Figure 1 Proteome characterization of luminal A breast cancer. (A) Flowchart of data filter and normalization analysis of proteins identified from luminal A breast cancer. (B) PCA of 1,259 proteins in 10 paired luminal A breast cancer tissues. Red: tumor, and green: NAT. (C) PCA of 1,259 proteins identified from the cancer and NATs tissues. Red: IDC cancer, and green: ILC cancer. (D) Heatmap of proteins in tumors and NATs and their associated clinical information showing expression of proteins in cancer or NAT samples. (E-H) GSEA results of luminal A breast cancer tissues revealed Gene Ontology (biological process, cellular component, molecular function) results associated with important metabolic functions. (I) GSEA plots revealed KEGG pathways associated with classical cancer functions.
As shown in the volcano plot for different subtypes of breast cancer, 457 proteins in IDC and 405 proteins in ILC were significantly upregulated (fold change, FC>1.5) compared with their NATs. Moreover, 16 proteins in IDC and 40 proteins in ILC were significantly downregulated in ILC (FC<0.67) compared with their NATs. Among the top 5 differential proteins, VIGLN (Vigilin) was upregulated and SODE (extracellular superoxide dismutase) was downregulated in both the IDC and ILC subtypes (Figures 2A, B). Furthermore, the differential proteins in IDC and ILC were separated into four groups using the average FC to distinguish the two subtypes. Approximately 99.47% of differential proteins were increased or decreased in both subtypes. Only one protein (H1X, Histone H1.10) was markedly increased in IDC but decreased in ILC, while two proteins (CO4B [Complement C4-B] and CRKL [Crk-like protein]) were obviously decreased in IDC but increased in ILC (Figure 2C). Additionally, immunohistochemistry data from the Human Protein Atlas (HPA) were applied to validate expression differences between IDC and ILC (40). Compared to ILC tumor tissues in immunohistochemistry, expression of H1X in IDC was remarkably upregulated, the expression of CRKL was significantly downregulated (http://www.proteinatlas.org/). The verification results basically consistent with our proteomic results. Moreover, these three proteins may represent potential biomarkers to distinguish IDC and ILC from the PCA results (Figure 2D). The significantly upregulated proteins RAB14 (Ras-related protein Rab-14) and HCFC1 (Host cell factor 1) in IDC and ILC with the ER- and PR-positive, HER2-negative patients (cancer No. 415 and NATs No. 112) were analyzed from mRNA level based on the TCGA database. It showed that the tendency of mRNA level is similar with the protein expression between the cancer and NATs. From the survival analysis for ER- and PR-positive, HER2-negative patients using TCGA database, patients with RAB14high and HCFC1high had significantly shorter overall survival time than those with RAB14low and HCFC1low (Figures 2E, F).
Figure 2 Proteome relative abundance difference and clinical significance of IDC and ILC from luminal A breast cancer. (A), (B) Differential proteins between tumors and NATs from the IDC and ILC subtypes. Red: upregulated proteins blue: downregulated proteins (P-value < 0.05 from t-test and fold change > 1.50). (C) Intertumor comparative analysis of differential proteins from IDC and ILC. The second quadrant shows that the proteins are downregulated in IDC and upregulated in ILC, and the fourth quadrant shows the opposite. (D) PCA of selected proteins in 10 paired cancer tissues. Red: IDC cancer, and green: ILC cancer. (E-F) The mRNA expression and Kaplan-Meier survival analysis of significantly differential proteins (Ras-related protein Rab-14: RAB14; Host cell factor 1: HCFC1) in the Luminal A breast cancer using TCGA database. *, p<0.05.
To quantitatively investigate protein phosphorylation patterns in IDC and ILC tissues, the phosphoproteins derived from the IDC and ILC patient tissues were analyzed using the workflow shown in Figure 3A. A total of 560 phosphoproteins were all identified in greater than 60% of all paired luminal A breast cancer tissues with at least one phosphorylation site. These proteins were enriched in focal adhesion, MAPK and insulin signaling pathways (Figure S7). Similar to the proteomics results, differential phosphoproteins between IDC and ILC were screened by fold change (FC). In addition, 27 and 30 phosphoproteins were significantly differentially detected between IDC and ILC. Among these differential phosphoproteins, only two proteins (DCLK1 and FLNC) overlapped in both IDC and ILC, in which DCLK1, a kinase mostly involved in calcium-signaling, was significantly upregulated in IDC and downregulated in ILC, while FLNC, a large actin-cross-linking protein, was significantly downregulated in both IDC and ILC compared with their NATs (Figures 3B, C). These results indicated that differences in protein phosphorylation are more obvious than differences in protein expression. Among these significantly differential proteins/phosphoproteins, only ten differential phosphoproteins were also characterized as differential proteins by proteomics (Figures 3D, E). From the heatmap, ten proteins were clustered into three groups: (1) those that were upregulated in both ILCs and IDCs, while phosphorylation of these proteins was upregulated in IDCs but downregulated in ILCs. The PCA confirmed that these criteria clearly separates IDC and ILC (Figure 3F); (2) those that were upregulated in both ILCs and IDCs, while phosphorylation of these proteins was downregulated in IDCs but not in ILCs (Figure 3G); and (3) those that were maintained in IDCs and upregulated in ILCs, while phosphorylation of these proteins was significantly downregulated and maintained in ILCs (Figure 3H). In brief, our study established a comprehensive landscape of luminal A breast cancer for two subtypes at the proteome and phosphoproteome levels.
Figure 3 Phosphoproteome characterization of luminal A breast cancer. (A) Flowchart of data filter and normalization analysis of phosphoproteins identified from luminal A breast cancer. (B, C) Differential phosphoproteins between luminal A breast cancer tissues and their NATs. Red: Upregulated proteins, blue: Downregulated (P-value < 0.05 from t.-test and fold change > 1.50). (D) Overlap of differential proteins (Blue) and differential phosphoproteins (Red). (E) Protein expression levels and phosphorylation status of 10 selected phosphoproteins in IDC and ILC. (F) Left: Expression of SEPT2, SEPT9, ROA1 and KTN1 proteins and their phosphorylation status in IDC and ILC (P-value from Wilcoxon test). Right: PCA of these four proteins in 10 paired breast cancer tissues. Red, IDCs; green, ILCs. (G) Expression of the MAP1B, SC22B and CAVN1 proteins and their phosphorylation status in IDC and ILC (P-value from Wilcoxon test). (H) Expression of DREB, CALD1 and UTRO proteins and their phosphorylation status in IDC and ILC (P-value from Wilcoxon test). *, p<0.05; **, p<0.01.
Probing kinase activation differences between IDC and ILC
Site-selective phosphorylation of proteins is catalyzed by protein kinases (PKs), and these phosphorylation events play a fundamental role in regulating cellular functions. MS-based (phospho)proteomics provides a global and unbiased survey of kinase expression, activation and signaling. Many of the characterized protein functions in luminal A breast cancer participate in cancer-related signaling pathways, which are closely related to the phosphorylation and dephosphorylation of proteins and induce the activation or inhibition, respectively, of the signaling pathway. The phosphoproteomic shotgun experiment on luminal A breast cancer and NATs tissues detecting a total of >25,414 phosphorylation sites were examined to assess the activities of the kinases using Kinase-Substrate Enrichment Analysis (KSEA), and the KinMAP tool was used to analyze the location of kinases and to identify the location and distribution of the kinases in IDC and ILC (35, 41). The activities of 93 and 131 kinases were modulated in IDC and ILC, respectively. In addition, 89 of the kinases were commonly identified in both subtypes (Figure 4A). Kinases are categorized into ten groups based on statistical sequence analysis (42). The greatest number of identified phosphorylated protein kinases (39) belonged to the CMGC group, followed by the AGC group (26) and CAMK group (25) (Figure 4B). The activities of 13 and 4 kinases were predicted to be significantly different in IDC and ILC tissues compared to their activities in paired NATs (Figure 4C and Figure S8). In IDC, kinases MAPK13, MAPK3, MAPK8, CDK2, and GSK3B, which belong to the CMGC groups, were significantly activated, while SGK1, LATS1, KPCZ, STK38, and KS6B1, which belong to the AGC groups, were significantly inactivated. In addition, in ILCs, only one kinase, AURKB, which belongs to the AGC group, was significantly activated, while PDPK1 and SGK1 belong to the AGC group. PDK2, an atypical protein kinase, was significantly inactivated (Figure 4D). The target drugs for breast cancer were located in the CMGC group as inhibitors of CDK4/6, which are involved in the cell cycle control pathway and are checkpoints of G1/S phase. CDK2, which is also involved in the cell cycle control pathway, was significantly activated in IDC. Interestingly, the activities of PDPK1 and mTOR located in the mTOR signaling pathway were completely opposite between IDC and ILC (Figure S9).
Figure 4 Kinase regulation in luminal A breast cancer by KSEA analysis. (A) Venn diagrams of identified kinases from IDCs and ILCs. Blue: IDCs; red: ILCs. (B) The distribution of 135 identified kinases in each group of human kinases. (C) Kinase regulation characterization in IDC and ILC. Kinases with a P-value < 0.05 based on the KSEA algorithm are shown. These kinases are predicted to be activated and inactivated in cancer tissue. (D) Compendium of the kinome detected in IDC and ILC under standard conditions. The colored panes with red or green showed the identified kinases in IDC and ILC. Red means the kinase is activated while green means the kinase is inactivated. The star in the panes means the difference of kinase is significant between IDC and ILC.
Discussion
The molecular characteristics of invasive ductal carcinoma (IDC) and invasive lobular carcinoma (ILC) in luminal A breast cancer remain elusive, and it remains challenging to distinguish them, especially late distant recurrence of ILC, which displays aggressive metastatic behavior associated with stage-matched IDC. Therefore, current clinical practice guidelines recommend similar treatment paradigms for both histologic subtypes (43), but the prognosis and survival of these two subtypes are different even they are both ER/PR positive and Her2 negative. Previous studies have revealed the influence of copy number variations (CNVs) on important cancer-related genes of IDC and ILC using genomics methods and found that over half of ILCs differed from IDCs in global transcription programs (17, 44). However, the distinct profiles at the proteome and phosphoproteome levels, which directly identify the difference between IDC and ILC, were not evaluated in previous studies. In this study, approximately 60% of proteins and phosphoproteins were commonly detected between the breast cancer and NATs, while approximately 65% of proteins and phosphoproteins were commonly between the IDC and ILC subtypes. Among the significantly differential proteins, greater than 99% exhibited the same tendency, and only 3 proteins displayed opposite behaviors between IDC and ILC compared with their NATs. Combining significantly differentially proteomes with phosphoproteomes, 10 proteins were selected and clustered into three groups, which contributed to distinguishing IDCs and ILCs with respect to protein and phosphorylation status (Table 2). From previous studies, we know that phosphoproteomics, especially cancer phosphoproteomics, provides comprehensive insights regarding kinases that are typically targeted for therapeutic applications (27). In our study, approximately 12,552 phosphopeptides were matched to 135 protein kinases with nine groups. This result indicates that markedly activated kinases were primarily located in the CMGC group, which is the most productive class of drug targets.
Table 2 Differential proteins and their phosphorylation between IDC and ILC compared with their respective NATs.
The most striking proteomics and phosphoproteomics features distinguished molecular features connecting the clinical, pathological, and prognostic features of ILC and IDC. In a previous study, the E-cadherin gene was identified as a marker to distinguish between IDC and ILC breast cancer, in which ILC may achieve invasive growth through loss of E-cadherin (15). In our proteomics results, we only observed E-cadherin in IDC tissue samples. From the different gene expression pattern studies, several genes, such as cathepsin B, survival, and TP11, were identified to distinguish between IDC and ILC. Expression of these genes like cathepsin B are elevated in many but not all cancers. However, the promiscuous expression of these genes especially for cathepsin B in tumor cells raises questions related to safety and specificity (45). Overall, it is difficult to distinguish between IDC and ILC at the gene expression level. In our study, protein expression levels of histone H1.10, complement C4-B, and Crk-like protein were significantly different between the IDC and ILC. Histone H1 is highly expressed, particularly in cancers, such as prostate cancer, and serves as a target for the delivery of the therapeutic drug doxorubicin (DOX) (46). It explained the differential sensitivity of DOX in the clinical treatment of IDC and ILC. Meanwhile, complement C4-B and Crk-like protein, which are involved in the immune system, were highly expressed in breast cancer, but differences between IDC and ILC are not well characterized. Crk-like protein can bind to the plexin-semaphoring-integrin (PSI) domain of β1 integrin and activate the mitogen-activated protein (MAP) kinase pathway, leading to nuclear transcription and enhanced cell division and migration (47). Septins are cytoskeletal proteins associated with GTP binding and participate in membrane interactions, which contribute to metastatic cancer cell dissemination and invasion and are mostly associated with the RAS oncogene. It has been reported that SEPT2 may impact two parallel pathways, p53/p21 and MEK/ERK, in cancer cells and mediate proliferation via regulation of cellular metabolic proteins (48, 49). Septins are related to the phosphorylation of MEK1/2 and downstream of ERK1/2, and loss of septin phosphorylation could lead to defects in morphogenesis and cytokinesis (50). Human CMGCs and AGC are known to regulate a variety of cell growth, proliferation, survival, and anti-apoptosis activities and extensively participate in the control of cell fate decisions; thus, they have been of interest in cancer research.
IDC and ILC tumors are currently treated similarly and have similar outcomes. The inability of unsupervised clustering to distinguish between the two tumor types suggests that many proteins were expressed in common, and subtle differences in protein expression may be responsible for the phenotypic differences that exist between these two subtypes. In particular, IDC and ILC exhibited striking differences in the expression of proteins associated with cell adhesion and invasion, suggesting that they may achieve invasive growth through distinct mechanisms, similar to the gene analysis. Our research provides insights to clarify the molecular characterization of IDC and ILC and contributes to discovering novel targets for further drug development and targeted treatment. However, because of strict filtering, the sample number is not huge enough. We will verify the candidate proteins carefully in the further study.
This study revealed that the proteome, phosphoproteome and kinome possess unique features and, when appropriately integrated, convey new insights and opportunities to discover novel biomarkers and even powerful drug targets.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: MassIVE via accession ID: MSV000091530.
Ethics statement
The studies involving human participants were reviewed and approved by Ethics Committee of Fujian Medical University Union Hospital (Fujian, China). The patients/participants provided their written informed consent to participate in this study.
Author contributions
GY, X-DG, FF, and CYZ conceived and designed experiments. CYZ performed the majority of experiments. CYZ and XZ performed the majority of data and statistical analysis. GY, CYZ, and X-DG wrote and revised the manuscript. YL, HX, and MJ collected breast cancer tissues, clinical patient information and assisted data analysis. PW, Wenyan, and CRZ assisted sample processing of LC-MS. All authors contributed to the article and approved the submitted version.
Funding
This work was supported by grants from the Program of Introducing Talents of Discipline to Universities (No. 111-2-06), National Natural Science Foundation of China (No. 21778023), Top-notch Academic Programs Project of Jiangsu Higher Education Institutions, and Natural Science Foundation of Fujian Province (No. 2021J01737) and Shandong Provincial Major Scientific and Technological Innovation Project (No. 2019JZZY011006). We thank Wenyan Chen from School of Biotechnology ,Jiangnan University for her contribution in the sample processing.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2023.1127446/full#supplementary-material
References
1. Ginsburg O, Bray F, Coleman MP, Vanderpuye V, Eniu A, Kotha SR, et al. The global burden of women’s cancers: a grand challenge in global health. Lancet (2017) 389:847–60. doi: 10.1016/S0140-6736(16)31392-7
2. Daily K, Ho Sui SJ, Schriml LM, Dexheimer PJ, Salomonis N, Schroll R, et al. Molecular, phenotypic, and sample-associated data to describe pluripotent stem cell lines and derivatives. Sci Data. (2017) 4:170030. doi: 10.1038/sdata.2017.30
3. Krug K, Jaehnig EJ, Satpathy S, Blumenberg L, Karpova A, Anurag M, et al. Proteogenomic landscape of breast cancer tumorigenesis and targeted therapy. Cell. (2020) 183:1436–56 e31. doi: 10.1016/j.cell.2020.10.036
4. Li CI, Anderson BO, Daling JR, Moe RE. Trends in incidence rates of invasive lobular and ductal breast carcinoma. JAMA. (2003) 289:1421–4. doi: 10.1001/jama.289.11.1421
5. Nelson RA, Guye ML, Luu T, Lai LL. Survival outcomes of metaplastic breast cancer patients: results from a US population-based analysis. Ann Surg Oncol (2015) 22:24–31. doi: 10.1245/s10434-014-3890-4
6. Wong SM, Freedman RA, Sagara Y, Stamell EF, Desantis SD, Barry WT, et al. The effect of paget disease on axillary lymph node metastases and survival in invasive ductal carcinoma. Cancer. (2015) 121:4333–40. doi: 10.1002/cncr.29687
7. Cristofanilli M, Gonzalez-Angulo A, Sneige N, Kau SW, Broglio K, Theriault RL, et al. Invasive lobular carcinoma classic type: response to primary chemotherapy and survival outcomes. J Clin Oncol (2005) 23:41–8. doi: 10.1200/JCO.2005.03.111
8. Pestalozzi BC, Zahrieh D, Mallon E, Gusterson BA, Price KN, Gelber RD, et al. Distinct clinical and prognostic features of infiltrating lobular carcinoma of the breast: combined results of 15 international breast cancer study group clinical trials. J Clin Oncol (2008) 26:3006–14. doi: 10.1200/JCO.2007.14.9336
9. Ferlicot S, Vincent-Salomon A, Medioni J, Genin P, Rosty C, Sigal-Zafrani B, et al. Wide metastatic spreading in infiltrating lobular carcinoma of the breast. Eur J Cancer. (2004) 40:336–41. doi: 10.1016/j.ejca.2003.08.007
10. Li CI, Uribe DJ, Daling JR. Clinical characteristics of different histologic types of breast cancer. Br J Cancer. (2005) 93:1046–52. doi: 10.1038/sj.bjc.6602787
11. Silverstein MJ, Lewinsky BS, Waisman JR, Gierson ED, Colburn WJ, Senofsky GM, et al. Infiltrating lobular carcinoma. is it different from infiltrating duct carcinoma? Cancer. (1994) 73:1673–7. doi: 110.1002/1097-0142(19940315)73:6<1673::aid-cncr2820730620>3.0.co;2-b
12. Peiro G, Bornstein BA, Connolly JL, Gelman R, Hetelekidis S, Nixon AJ, et al. The influence of infiltrating lobular carcinoma on the outcome of patients treated with breast-conserving surgery and radiation therapy. Breast Cancer Res Treat (2000) 59:49–54. doi: 10.1023/A:1006384407690
13. Sastre-Garau X, Jouve M, Asselain B, Vincent-Salomon A, Beuzeboc P, Dorval T, et al. Infiltrating lobular carcinoma of the breast: Clinicopathologic analysis of 975 cases with reference to data on conservative therapy and metastatic patterns. Cancer. (1996) 77:113–20. doi: 10.1002/(SICI)1097-0142(19960101)77:1<113::AID-CNCR19>3.0.CO;2-8
14. Barroso-Sousa R, Metzger-Filho O. Differences between invasive lobular and invasive ductal carcinoma of the breast: results and therapeutic implications. Ther Adv Med Oncol (2016) 8:6. doi: 10.1177/1758834016644156
15. Korkola JE, DeVries S, Fridlyand J, Hwang ES, Estep AL, Chen YY, et al. Differentiation of lobular versus ductal breast carcinomas by expression microarray analysis. Cancer Res (2003) 63:7167–75.
16. Simpson PT, Reis-Filho JS, Lambros MB, Jones C, Steele D, Mackay A, et al. Molecular profiling pleomorphic lobular carcinomas of the breast: evidence for a common molecular genetic pathway with classic lobular carcinomas. J Pathol (2008) 215:231–44. doi: 10.1002/path.2358
17. Zhao H, Langerod A, Ji Y, Nowels KW, Nesland JM, Tibshirani R, et al. Different gene expression patterns in invasive lobular and ductal carcinomas of the breast. Mol Bio Cell (2004) 15:2523–36. doi: 10.1091/mbc.e03-11-0786
18. Weigelt B, Geyer FC, Natrajan R, Lopez-Garcia MA, Ahmad AS, Savage K, et al. The molecular underpinning of lobular histological growth pattern: a genome-wide transcriptomic analysis of invasive lobular carcinomas and grade- and molecular subtype-matched invasive ductal carcinomas of no special type. J Pathol (2010) 220:45–57. doi: 10.1002/path.2629
19. An Y, Zhou L, Huang Z, Nice EC, Zhang H, Huang C. Molecular insights into cancer drug resistance from a proteomics perspective. Expert Rev Proteomics. (2019) 16:413–29. doi: 10.1080/14789450.2019.1601561
20. Arshad OA, Danna V, Petyuk VA, Piehowski PD, Liu T, Rodland KD, et al. An integrative analysis of tumor proteomic and phosphoproteomic profiles to examine the relationships between kinase activity and phosphorylation. Mol Cell Proteomics. (2019) 18:S26–36. doi: 10.1074/mcp.RA119.001540
21. Bhullar KS, Lagaron NO, McGowan EM, Parmar I, Jha A, Hubbard BP, et al. Kinase-targeted cancer therapies: progress, challenges and future directions. Mol Cancer. (2018) 17:48. doi: 10.1186/s12943-018-0804-2
22. Delom F, Chevet E. Phosphoprotein analysis: from proteins to proteomes. Proteome Sci (2006) 4:15. doi: 10.1186/1477-5956-4-15
23. Jensen ON. Modification-specific proteomics: characterization of post-translational modifications by mass spectrometry. Curr Opin Chem Biol (2004) 8:33–41. doi: 10.1016/j.cbpa.2003.12.009
24. Coussens L, Parker PJ, Rhee L, Yang-Feng TL, Chen E, Waterfield MD, et al. Multiple, distinct forms of bovine and human protein kinase c suggest diversity in cellular signaling pathways. Science. (1986) 233:859–66. doi: 10.1126/science.3755548
25. Fabbro D, Cowan-Jacob SW, Moebitz H. Ten things you should know about protein kinases: IUPHAR review 14. Br J Pharmacol (2015) 172:2675–700. doi: 10.1111/bph.13096
26. Manning G, Whyte DB, Martinez R, Hunter T, Sudarsanam S. The protein kinase complement of the human genome. Science. (2002) 298:1912–34. doi: 10.1126/science.1075762
27. Abe Y, Hirano H, Shoji H, Tada A, Isoyama J, Kakudo A, et al. Comprehensive characterization of the phosphoproteome of gastric cancer from endoscopic biopsy specimens. Theranostics. (2020) 10:2115–29. doi: 10.7150/thno.37623
28. Zhou Y, Lih TM, Yang G, Chen SY, Chen L, Chan DW, et al. An integrated workflow for global, glyco-, and phospho-proteomic analysis of tumor tissues. Anal Chem (2020) 92:1842–9. doi: 10.1021/acs.analchem.9b03753
29. Tyanova S, Temu T, Cox J. The MaxQuant computational platform for mass spectrometry-based shotgun proteomics. Nat Protoc (2016) 11:2301–19. doi: 10.1038/nprot.2016.136
30. van den Berg RA, Hoefsloot HC, Westerhuis JA, Smilde AK, van der Werf MJ. Centering, scaling, and transformations: improving the biological information content of metabolomics data. BMC Genomics (2006) 7:142. doi: 10.1186/1471-2164-7-142
31. Chen C, Chen H, Zhang Y, Thomas HR, Frank MH, He Y, et al. TBtools: An integrative toolkit developed for interactive analyses of big biological data. Mol Plant (2020) 13:1194–202. doi: 10.1016/j.molp.2020.06.009
32. Yu G, Wang LG, Han Y, He QY. clusterProfiler: an r package for comparing biological themes among gene clusters. OMICS. (2012) 16:284–7. doi: 10.1089/omi.2011.0118
33. Dennis G Jr., Sherman BT, Hosack DA, Yang J, Gao W, Lane HC, et al. DAVID: Database for annotation, visualization, and integrated discovery. Genome Biol (2003) 4:P3. doi: 10.1186/gb-2003-4-5-p3
34. Casado P, Rodriguez-Prados JC, Cosulich SC, Guichard S, Vanhaesebroeck B, Joel S, et al. Kinase-substrate enrichment analysis provides insights into the heterogeneity of signaling pathway activation in leukemia cells. Sci Signal (2013) 6:rs6. doi: 10.1126/scisignal.2003573
35. Wiredja DD, Koyuturk M, Chance MR. The KSEA app: a web-based tool for kinase activity inference from quantitative phosphoproteomics. Bioinformatics. (2017) 33:3489–91. doi: 10.1093/bioinformatics/btx415
36. Eid S, Turk S, Volkamer A, Rippmann F, Fulle S. KinMap: a web-based tool for interactive navigation through human kinome data. BMC Bioinf (2017) 18:16. doi: 10.1186/s12859-016-1433-7
37. Mertins P, Mani DR, Ruggles KV, Gillette MA, Clauser KR, Wang P, et al. Proteogenomics connects somatic mutations to signalling in breast cancer. Nature. (2016) 534:55–62. doi: 10.1038/nature18003
38. Chappell K, Manna K, Washam CL, Graw S, Alkam D, Thompson MD, et al. Multi-omics data integration reveals correlated regulatory features of triple negative breast cancer. Mol Omics. (2021) 17(5):677–91. doi: 10.1039/D1MO00117E
39. Porporato PE, Filigheddu N, Pedro JMB, Kroemer G, Galluzzi L. Mitochondrial metabolism and cancer. Cell Res (2018) 28:265–80. doi: 10.1038/cr.2017.155
40. Pontén F, Jirström K, Uhlen M. The human protein atlas–a tool for pathology. J Pathol (2008) 216:387–93. doi: 10.1002/path.2440
41. Krug K, Mertins P, Zhang B, Hornbeck P, Raju R, Ahmad R, et al. A curated resource for phosphosite-specific signature analysis. Mol Cell Proteomics. (2019) 18:576–93. doi: 10.1074/mcp.TIR118.000943
42. Wilson LJ, Linley A, Hammond DE, Hood FE, Coulson JM, MacEwan DJ, et al. New perspectives, opportunities, and challenges in exploring the human protein kinome. Cancer Res (2018) 78:15–29. doi: 10.1158/0008-5472.CAN-17-2291
43. Nunes R, Sella T, Treuner K, Atkinson JM, Wong J, Zhang Y, et al. Prognostic utility of breast cancer index to stratify distant recurrence risk in invasive lobular carcinoma. Clin Cancer Res (2021) 27(20):5688–96. doi: 10.1158/1078-0432.CCR-21-0733
44. Turashvili G, Bouchal J, Baumforth K, Wei W, Dziechciarkova M, Ehrmann J, et al. Novel markers for differentiation of lobular and ductal invasive breast carcinomas by laser microdissection and microarray analysis. BMC cancer. (2007) 7:55. doi: 10.1186/1471-2407-7-55
45. Mijanovic O, Brankovic A, Panin AN, Savchuk S, Timashev P, Ulasov I, et al. Cathepsin b: A sellsword of cancer progression. Cancer Lett (2019) 449:207–14. doi: 10.1016/j.canlet.2019.02.035
46. Zhang S, Li ZT, Liu M, Wang JR, Xu MQ, Li ZY, et al. Anti-tumour activity of low molecular weight heparin doxorubicin nanoparticles for histone H1 high-expressive prostate cancer PC-3M cells. J Control Release. (2019) 295:102–17. doi: 10.1016/j.jconrel.2018.12.034
47. Ozawa MG, Cardó-Vila M, Mintz PJ, Arap W, Pasqualini R. Cracking the code for compartment-specific dual functionality proteins in cancer: The case for CRKL. Cell Cycle (2014) 9:8–9. doi: 10.4161/cc.9.1.10288
48. James NE, Cantillo E, Yano N, DiSilvestro PA, Moore RG, Ahsan N, et al. Septin-2 is overexpressed in epithelial ovarian cancer and mediates proliferation via regulation of cellular metabolic proteins. Oncotarget. (2019) 10:14. doi: 10.18632/oncotarget.26836
49. He H, Li J, Xu M, Kan Z, Gao Y, Yuan C. Expression of septin 2 and association with clinicopathological parameters in colorectal cancer. Oncol Lett (2019) 18:2376–83. doi: 10.3892/ol.2019.10528
Keywords: invasive ductal carcinoma (IDC), invasive lobular carcinoma (ILC), proteome, phosphoproteome, kinome, luminal A breast cancer
Citation: Yang G, Zuo C, Lin Y, Zhou X, Wen P, Zhang C, Xiao H, Jiang M, Fujita M, Gao X-D and Fu F (2023) Comprehensive proteome, phosphoproteome and kinome characterization of luminal A breast cancer. Front. Oncol. 13:1127446. doi: 10.3389/fonc.2023.1127446
Received: 19 December 2022; Accepted: 08 March 2023;
Published: 31 March 2023.
Edited by:
Jesús Espinal-Enríquez, National Institute of Genomic Medicine (INMEGEN), MexicoReviewed by:
Xi Chen, The Second Affiliated Hospital of Kunming Medical University, ChinaPabitra Kumar Parua, Albert Einstein College of Medicine, United States
Copyright © 2023 Yang, Zuo, Lin, Zhou, Wen, Zhang, Xiao, Jiang, Fujita, Gao and Fu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xiao-Dong Gao, eGRnYW9AaXBlLmFjLmNu; Fangmeng Fu, ZmZtQGZqbXUuZWR1LmNu
†These authors have contributed equally to this work