- 1Department of Epidemiology, College of Public Health and Health Professions and College of Medicine, University of Florida, Gainesville, FL, United States
- 2Department of Pathology, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, United States
- 3Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, United States
- 4Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, United States
Background: We investigated the associations of reproductive factors known to influence breast cancer risk with the expression of breast stem cell markers CD44, CD24, and ALDH1A1 in benign breast biopsy samples.
Methods: We included 439 cancer-free women with biopsy-confirmed benign breast disease within the Nurses’ Health Study (NHS) and NHSII. The data on reproductive and other breast cancer risk factors were obtained from biennial questionnaires. Immunohistochemistry (IHC) was performed on tissue microarrays. For each core, the IHC expression was assessed using a semi-automated platform and expressed as % of cells that stained positive for a specific marker out of the total cell count. Generalized linear regression was used to examine the associations of reproductive factors with a log-transformed expression of each marker (in epithelium and stroma), adjusted for other breast cancer risk factors.
Results: In multivariate analysis, the time between menarche and age at first birth was inversely associated with CD44 in epithelium (β per 5 years = −0.38, 95% CI −0.69; −0.06). Age at first birth and the time between menarche and age at first birth were inversely associated with ALDH1A1 (stroma: β per 5 years = −0.43, 95% CI −0.76; −0.10 and β = −0.47, 95% CI −0.79; −0.15, respectively; epithelium: β = −0.15, 95% CI −0.30; −0.01 and β = −0.17, 95% CI −0.30; −0.03, respectively). Time since last pregnancy was inversely associated with stromal ALDH1A1 (β per 5 years = −0.55, 95% CI −0.98; −0.11). No associations were found for CD24. The observed associations were similar in premenopausal women. In postmenopausal women, lifetime duration of breastfeeding was inversely associated with stromal ALDH1A1 expression (β for ≥24 vs. 0 to <1 months = −2.24, 95% CI 3.96; −0.51, p-trend = 0.01).
Conclusion: Early-life reproductive factors may influence CD44 and ALDH1A1 expression in benign breast tissue.
1 Introduction
Parity is a well-established breast cancer risk factor that has been consistently associated with reduced risk of benign breast disease (1), lower breast density (2–5), and reduced breast cancer risk (6–9). Reduction in stem cell pool is currently postulated to be one of the mechanisms underlying the protective effect of pregnancy on breast cancer risk (10). Pregnancy induces a portion of stem cells to differentiate, thus reducing the pool of transformation-susceptible cells in the breast. Stem cells are responsible for subsequent regression of the mammary gland into pre-pregnant state following post-partum involution as well as for regeneration of lactation-component ductal system during subsequent pregnancies (10). It was also suggested that stem cells are at the peak of their abundance and sensitivity to carcinogenic influences in the young mammary gland and during puberty, which is supported by epidemiologic evidence of the highest absolute risk of breast cancer in women exposed to environmental factors early in life (10).
Experimental evidence suggests that early but not late pregnancy reduces the number of mammary stem cells, which supports the evidence on the protective effect of earlier age at first birth and a shorter interval between age at menarche and first birth on breast cancer risk (10–14). Pregnancy is further believed to downregulate signaling pathways known to play a role in mammary stem cell function (10). Thus, a decrease in stem cell pool size and activity with increasing parity could potentially explain its association with breast cancer and intermediate phenotypes (7, 15). However, the evidence of these associations from epidemiological studies remains extremely limited. To fill this gap, we examined the associations of several reproductive factors (parity, age at first birth, breastfeeding, and duration of the time between menarche and first birth) with the expression of well-established stem cell markers CD44 molecule (CD44), CD24 molecule (CD24), and aldehyde dehydrogenase 1 family member A1 (ALDH1A1) in benign breast biopsy samples using prospective data in cancer-free women from the Nurses’ Health Study (NHS) and Nurses’ Health Study II (NHSII) and semi-automated computational pathology method for assessment of stem cell marker expression.
2 Materials and methods
2.1 Study population
Our analysis included cancer-free women with biopsy-confirmed benign breast disease (BBD) in the NHS and Nurses’ Health Study II (NHSII) cohorts who were previously included in a nested case–control study of breast cancer (16, 17). These prospective cohorts were registered nurses in the United States who were 30–55 years old (NHS) or 25–42 years old (NHSII) at enrollment. After administration of the initial questionnaire, the information on breast cancer risk factors [body mass index (BMI), reproductive history, postmenopausal hormone (PMH) use, and alcohol use] and any diagnoses of cancer or other diseases (including BBD) was updated through biennial questionnaires, which were then confirmed via medical record review (4, 18). Details of this nested case–control study and the BBD assessment have been previously described (16, 17).
Early NHS questionnaires (1976, 1978, and 1980) asked whether the participant had ever been diagnosed with “fibrocystic disease” or “other BBD” and whether she had been hospitalized in relation to this diagnosis. Beginning in 1982, the NHS questionnaires specifically asked about a history of biopsy-confirmed BBD (fibrocystic disease or other BBD). The initial 1989 NHS II questionnaire and all subsequent biennial questionnaires also asked participants to report any diagnosis of BBD and to indicate whether it was confirmed by biopsy or aspiration.
Cases were women with biopsy-confirmed BBD who reported a diagnosis of breast cancer during 1976–1998 for the NHS and 1989–1999 for the NHSII following their BBD diagnosis. Using incidence density sampling, four women with biopsy-confirmed BBD who were free of breast cancer at the time of the matching case’s diagnosis (controls) were matched to the respective case on the year of birth and year of benign breast biopsy (19). We attempted to obtain BBD pathology records and archived biopsy specimens for all cases and controls from their hospital pathology departments; our ability to obtain biopsy blocks did not significantly differ by case and control status. Women were excluded if they had evidence of in situ or invasive carcinoma or unknown lesion type at the time of benign breast biopsy (n = 34). All cases and controls from this nested case–control were cancer-free at the time of BBD diagnosis, with an average time of 9 years between biopsy and breast cancer diagnosis date. In the current analysis, we included 439 women who had complete data on reproductive factors and staining results for stem cell markers [92% of all women who had BBD samples (n = 476) used to construct tissue microarrays as described under tissue microarray (TMA) construction]. Women with and without BBD samples were similar with respect to the distribution of breast cancer risk factors (20).
The study protocol was approved by the institutional review boards of the Brigham and Women’s Hospital and Harvard T.H. Chan School of Public Health, and those of participating registries as required. Consent was obtained or implied by return of questionnaires.
2.2 Reproductive variables
The data on age at menarche, parity, age at first birth, and breastfeeding were available from baseline and biennial questionnaires, completed closest to the date of the biopsy. Among all eligible women with stem cell marker data (n = 450), the completeness of the data on parity was 97.6% (n = 439). Among parous women with marker readings (n = 398), information on age at first birth and breastfeeding was available for 91.5% and 97.2% of the sample, respectively. Age at menarche was available for 99.3% of the sample. For three women with missing age at menarche, a median value in the study sample was imputed, as performed in previous studies (3, 21, 22).
Age at first birth was modeled both as a categorical variable (<25 years, 25–29 years, and ≥30 years) and as a continuous variable. Parity was defined both as a binary variable (nulliparous and parous) and as categorical variable (1, 2, 3, and ≥4 children). Additionally, the number of children among parous women was modeled as a continuous variable. The lifetime duration of breastfeeding (sum of breastfeeding duration across all births) was classified as none to <1 month, 1 to <12 months, 12 to <24 months, and ≥24 months. Age at menarche was modeled both as a categorical variable (<12 years, 12 years, 13 years, and >13 years) and as a continuous variable. The time interval between menarche and first birth was modeled as continuous variable.
2.3 Benign breast biopsy confirmation and BBD subtypes
Hematoxylin and eosin (H&E) breast tissue slides were retrieved for biopsy-confirmed BBD patients who gave permission to review their biopsy records. The slides were previously independently reviewed by one of three pathologists in a blinded fashion; i.e., the evaluating pathologists were blinded to the type of BBD noted on the original diagnosis (23, 24). Any slide identified as having either questionable atypia or atypia was jointly reviewed by two pathologists (23, 24). For each set of slides, a detailed worksheet was completed, and the benign breast biopsy was classified according to the categories of Page et al. (25) as non-proliferative, proliferative without atypia, or atypical hyperplasia (ductal or lobular hyperplasia) (17).
2.4 Tissue microarray construction of BBD samples
After a centralized review of H&E-stained slides, we collected archived formalin-fixed paraffin-embedded (FFPE) benign breast biopsy blocks of participants. H&E sections of the corresponding FFPE tissue blocks were re-reviewed by a single pathologist to identify areas of benign proliferative lesions and normal terminal duct lobular units (TDLUs) and to identify the areas from which the cores for the TMAs would be taken. Normal TDLUs were regions of histopathologically normal tissue that may or may not be adjacent to benign lesions (e.g., atypical ductal hyperplasia and usual ductal hyperplasia) (17). TMAs were constructed at the Dana Farber/Harvard Cancer Center (DF/HCC) Tissue Microarray Core Facility by obtaining 0.6-mm cores from benign lesions and TDLUs. For each woman, up to three cores of normal TDLU were included in the TMA blocks. We previously evaluated our TMA construction methods and confirmed a high success rate (76%) of capturing normal TDLUs in these TMA blocks (26).
2.5 Immunohistochemistry for stem cell markers
The expression of the stem cell markers was evaluated using an automated immunohistochemistry (IHC) technique that allows the quantification of markers’ expression levels and localization of the target signal to specific cells/structures. For each of the three markers, one 5-μm paraffin section was cut from a single TMA block and then stained with antibodies for CD44, CD24, and ALDH1A1 at the University of Florida Pathology Core Lab on DAKO AutostainerPlus according to the previously standardized protocol with commercial antibodies [DAKO AutostainerPlus, CD44 (DAKO, Glostrup, Denmark) 1:25 dilution; CD24 (Invitrogen, Carlsbad, CA, USA) 1:200 dilution, and ALDH1A1 (Abcam, Cambridge, UK) 1:300 dilution]. Details of this protocol have been described previously (27–29). Briefly, slides were de-paraffinized with xylene and re-hydrated through decreasing concentrations of ethanol to water, including an intermediate step to quench endogenous peroxidase activity (3% hydrogen peroxide in methanol) and transferred to 1× Tris-buffered saline–Tween (TBS-T). For heat-induced antigen retrieval, sections were heated in a steamer while submerged in Citra (BioGenex, Fremont, CA, USA) or Trilogy (Cell Marque, Rocklin, CA, USA) for 30 minutes. Next, slides were 1) rinsed in 1× TBS-T and incubated with a universal protein blocker Sniper (Biocare Medical, Walnut Creek, CA, USA) for 10 minutes (for CD44 and ALDH1A1) or 15 minutes (for CD24); 2) rinsed in 1× TBS-T and co-incubated in primary antibody ALDH1A1, CD24, or CD44 for 1 hour; and 3) rinsed in 1× TBS-T followed by application of conjugated secondary antibody [Mach 2 goat anti-rabbit horse (or mouse) radish peroxidase-conjugated, Biocare Medical, Walnut Creek, CA, USA] for 30 minutes. Detection of antibodies was achieved by incubating slides in 3′3’-diaminobenzidine (Vector Laboratories Inc., Burlingame, CA, USA) for 4 minutes. Slides were counterstained with hematoxylin (Biocare Medical, Walnut Creek, CA) 1:10 for 3 minutes and mounted with Cytoseal XYL (Richard-Allan Scientific, Kalamazoo, MI, USA). The laboratory implemented standard quality control procedures.
2.6 Image analysis
Immunoreactivity was quantified using a semi-automated image analysis system, Definiens Tissue Studio software (Munich, Germany), which quantifies tissue marker expression within the context of tissue architecture. For each core, the extent of each marker expression was assessed on a continuous scale as the percent of cells that stain positively (across all intensities) for a specific marker out of the total cell count, separately for epithelium and stroma. Briefly, TMA slides were digitalized at 20× into whole slide images using the Pannoramic Scan 150P (3DHistech, Budapest, Hungary). For each marker, the images were imported into Definiens, and an experienced operator randomly selected a representative TMA as the training TMA (30, 31). On the training TMA, the operator selected 12 training cores that were assessed as >0 to <1 (n = 3), 1 to 10 (n = 3), >10 to 50 (n = 3), and >50% (n = 3) by the pathologist to optimize a Definiens algorithm for automated IHC assessment. Definiens only allows a maximum of 12 cores for algorithm training. The minimum positive IHC staining threshold in Definiens was set using the pathologist’s manual reads as a reference. The optimized Definiens algorithm segmented each tissue core into epithelium, stroma, fat, and background; detected the number of cells; and quantitated the IHC stains. The current analysis was specifically focused on the expression of stem cell markers in normal TDLU cores for the following reasons: 1) we specifically targeted normal TDLUs in the construction of these TMAs within NHS/NHSII, and thus, the number of women with benign lesion cores was smaller and would not allow to draw meaningful conclusions; 2) in our earlier reliability study, we observed higher heterogeneity within benign lesion cores, as they were represented by various lesion types (29); and 3) we were interested in the underlying changes in the breast tissue happening early in the process of breast carcinogenesis, and thus, normal TDLUs were more relevant to address our research questions. Staining results for stroma were available for 423, 434, and 408 women for CD44, CD24, and ALDH1A1, respectively, and the staining results for epithelium were available for 396, 409, and 393 women, respectively.
2.7 Covariate information
Information on breast cancer risk factors was obtained from the biennial questionnaires closest to the date of the biopsy. Women were considered to be postmenopausal if they reported 1) no menstrual periods within the 12 months before biopsy with natural menopause, 2) bilateral oophorectomy, or 3) hysterectomy with one or both ovaries retained and were 54 years or older for ever-smokers or 56 years or older for never smokers (32, 33).
2.8 Statistical analysis
We modeled marker expression (weighted average across available cores for a woman) both as log-transformed to improve normality of residuals and as dichotomous using 10% cut-offs based on the results of our prior reliability study and distribution in our sample (29).
First, we used multivariate linear regression to examine the associations of parity, age at menarche, age at first birth, breastfeeding, time since last birth and interval between menarche and first birth with continuous expression of each of the markers, adjusted for the following covariates: age (continuous), BMI (continuous), a family history of breast cancer (yes/no), menopausal status/postmenopausal hormone use (premenopausal, postmenopausal/no hormones, postmenopausal/past hormones, postmenopausal/current hormones, and postmenopausal/unknown hormone use status), NHS cohort (NHSI and NHSII), benign breast disease subtype (non-proliferative, proliferative without atypia, and proliferative with atypia), and alcohol use (none, >0 to <5 g/day, and ≥5 g/day). Additionally, in the analysis of the association of breastfeeding, the estimates were adjusted for parity and age at first birth. In the analysis of the associations of parity and age at first birth, the risk estimates were mutually adjusted for these two variables. In the analysis of the interval between menarche and first birth, the estimates were adjusted for parity.
The analyses of all reproductive variables except nulliparity and age at menarche were limited to parous women only. Parity, age at first birth, and age at menarche were modeled as both continuous and categorical, and breastfeeding was modeled as categorical. The lowest category for parity (one child), age at first birth (<25 years), and breastfeeding (0 to <1 month) were used as the reference. To assess the overall trend for each of the categorical reproductive variables, we used respective medians within each category. The duration of the interval between menarche and first birth and the time since the last birth were modeled as continuous variables. Next, we used logistic regression to examine the associations of reproductive factors with dichotomized marker expression while using the same adjustment approaches. Next, among parous women, we also examined the associations in mutually adjusted models that included all reproductive factors except the time between age at menarche and first birth. Finally, in exploratory analyses, we also examined the associations by menopausal status. The results of this stratified analysis, however, should be interpreted with caution due to the small number of observations in some strata. The analyses were performed using SAS software (version 9.4, SAS Institute, Cary, NC, USA).
3 Results
Among 439 cancer-free women in this study, 128 (29.2%) had non-proliferative disease, 241 (54.9%) had proliferative disease without atypia, and 70 (16.0%) had atypical hyperplasia, similar to the previously reported distributions (24). The average age at the biopsy was 45 years (range 17–67 years). The sample included predominantly premenopausal women at the biopsy (68.6%). The majority of women were parous (90.7%), and of those, the majority had at least two children (89.9%) and breastfed for at least 1 month (63.1%). The average age at first birth was 24.9 years (range 15–39 years). Table 1 presents the age-adjusted characteristics of women in the study by nulliparous status. The distribution of markers’ expression by BBD subtype is presented in Supplementary Table 1.
Table 1 Age-adjusted characteristics of women at the time of the biopsy, stratified by parous status.
Age and BMI-adjusted associations of reproductive factors with stem cell markers are presented in Supplementary Table 2. In multivariate analysis (Table 2), nulliparous status and, among parous women, number of children, age at first birth, and duration of breastfeeding were not associated with CD44 expression in epithelium or stroma. Younger age at menarche was marginally associated with lower CD44 expression in stroma and epithelium (p-trend = 0.12 and 0.08, respectively), with significant estimates found for age at menarche 12 vs. >13 years (β = −1.56, 95% CI −2.82; −0.31 and β = −0.91, 95% CI −1.58; −0.25, respectively). The time between menarche and age at first birth was inversely associated with CD44 in epithelium (β per 5-year increase = −0.38, 95% CI −0.69; −0.06). Age at first birth and the time between menarche and age at first birth were inversely associated with ALDH1A1 (stroma: β per 5-year increase = −0.43, 95% CI −0.76; −0.10 and β = −0.47, 95% CI −0.79; −0.15, respectively; epithelium: β = −0.15, 95% CI −0.30; −0.01 and β = −0.17, 95% CI −0.30; −0.03, respectively). Associations of the time between menarche and age at first birth with CD44 and ALDH1A1 were similar when the models were additionally adjusted for age at menarche (data not shown). The time since the last pregnancy was inversely associated with stromal ALDH1A1 (β per 5 years = −0.55, 95% CI −0.98; −0.11). We found no associations of any of the reproductive factors with continuous CD24 expression. Finally, associations of age at menarche with CD44 expression in stroma and epithelium became more pronounced when the models were mutually adjusted for all reproductive factors among parous women (p-trend = 0.01 for both stroma and epithelium) (Supplementary Table 3). Associations of age at first birth with CD44 in epithelium and ALDH1A1 in stroma as well as associations of age at menarche with ALDH1A1 in epithelium were similar in mutually adjusted models (Supplementary Table 3).
Table 2 Associations of reproductive factors with log-transformed expression of stem cell markers in benign breast biopsy samples (β coefficients and 95% confidence intervals).
When marker expression was modeled as dichotomous, the time since last birth was positively associated with stromal CD24 expression (OR per 5 years = 1.85, 95% CI 1.05; 3.26) and inversely associated with stromal ALDH1A1 expression (OR per 5 years = 0.60, 95% CI 0.38; 0.94) (Supplementary Table 4). There was a suggestive trend of an inverse association between longer duration of breastfeeding and dichotomous stromal CD24 expression (p-trend = 0.03) (Supplementary Table 4), which, however, did not have a clear pattern.
The associations with CD44 were similar in premenopausal women (Supplementary Table 5), while in postmenopausal women, we observed positive associations of age at menarche with CD44 expression in stroma and epithelium (Supplementary Table 6). The significant trend for associations for categorical parity with CD44 in stroma and epithelium in postmenopausal women did not have a clear pattern and should be interpreted with caution given small numbers in some strata. For CD24, overall patterns were similar in premenopausal women. Additionally, we observed a positive association of age at first birth with epithelial CD24 expression (β per 5 years = 0.31, 95% CI 0.04; 0.57) and an inverse association of age at menarche with stromal CD24 expression (β per 5 years = −1.43, 95% CI −2.59; −0.27). In postmenopausal women, age at menarche was positively associated with epithelial CD24 expression (β per 5 years = 1.18, 95% CI 0.14; 2.22). Similar to overall results, in premenopausal women, age at first birth and the interval between age at menarche and age at first birth were both inversely associated with stromal (β per 5 years = −0.54, 95% CI −0.95; −0.13 and β = −0.59, 95% CI −0.98; −0.21, respectively) and epithelial (β per 5 years = −0.25, 95% CI −0.41; −0.08 and β = −0.23, 95% CI −0.39; −0.07, respectively) ALDH1A1 expression. In premenopausal women, the time since last birth was inversely associated with stromal ALDH1A1 (β per 5 years = −0.60, 95% CI −1.17; −0.03). These associations were not observed in postmenopausal women, but the time since last birth was inversely associated with epithelial ALDH1A1 expression (β per 5 years = −0.42, 95% CI −0.70; −0.14). In postmenopausal women, lifetime duration of breastfeeding appeared to be inversely associated with stromal expression of ALDH1A1 (β for ≥24 vs. 0 to <1 months = −2.24, 95% CI 3.96; −0.51, p-trend = 0.01).
4 Discussion
Among 439 cancer-free women in our study, we found inverse associations of age at first birth and the time between menarche and first birth with expression of CD44 and ALDH1A1 and positive, though marginally significant, associations of age at menarche with CD44 expression. The lifetime duration of breastfeeding was inversely associated with stromal ALDH1A1 expression in postmenopausal women. No associations were observed for nulliparous status or number of children.
It has been suggested that breast tissue changes during pregnancy may explain its long-term protective effect on the risk of breast cancer (34). Full-term pregnancy influences breast tissue in a variety of mechanisms, including hormonal signaling changes, alterations in gene methylation and expression, long-term reduction in the circulating hormone levels, and life-long decrease in the number of stem cells (7, 14, 35–39). Further, it was suggested that terminal differentiation of mammary stem cells during breastfeeding reduces the pool of breast stem cells at risk of mutation, which could explain the subsequent long-term reduction in the risk of breast cancer (40). However, the evidence from epidemiologic studies with cancer-free women remains very limited. An earlier study that investigated possible associations between the size of stem/progenitor cell population and parity by examining the percentage of ALDH1A1+ cells in breast epithelium found no significant differences in nulliparous vs. parous women (41). In our study, we did not find significant associations of parous status or the number of children with the expression of any of the three markers, though nulliparous women had suggestive evidence of greater expression (with exception of stromal CD24) of these markers. Another investigation utilizing normal adjacent tissue from women with triple-negative breast cancer found that women with a greater presence of stem cells were less likely to have breastfed or had a shorter duration of breastfeeding (42). In our study, breastfeeding duration appeared to be inversely associated with the expression of stromal ALDH1A1 among postmenopausal women. However, these results need further confirmation due to the small number of women in some of the strata. Finally, we observed an inverse association of the time since last birth with the expression of stromal ALDH1A1, overall and in premenopausal women, and an inverse association with epithelial ALDH1A1 in postmenopausal women. Whether this potential decrease in stem cell marker expression could explain the long-term reduction in breast cancer risk in parous women is yet to be determined.
It was suggested that earlier menarche and thus earlier exposure to estrogens may alter stem cell functions in the breast tissue (43). As pregnancy induces cellular differentiation, earlier age at first birth and a shorter period between age at menarche and first birth both reduce breast cancer risk by shortening the period when breast tissue, including stem cells, remains highly susceptible to various potentially carcinogenic insults (13, 44). Contrary to these mechanisms, in our study, we found positive associations of age at menarche with expression of CD44 in stroma and epithelium and inverse associations of age at first birth and the interval between menarche and age at first birth with CD44 and ALDH1A1 expression.
Our study is the first to date to explore associations of several reproductive variables with the expression of breast stem cell markers CD44, CD24, and ALDH1A1 in cancer-free women. We utilized data from the Nurses’ Health Study and Nurses’ Health Study II, established cohorts with more than 30 years of follow-up, with confirmation of benign breast disease status, and comprehensive breast cancer risk factor information. The following few limitations should be noted. Even though the data in these cohorts have been collected prospectively, measurement error for some of the reproductive variables particularly in postmenopausal women cannot be ruled out completely. For example, in the previous studies, the findings on the accuracy of recall for age at menarche have been conflicting (45–47), which could potentially influence our findings for the interval between menarche and first birth. Next, we recognize that biopsy samples come from a specific area of the breast. In our previous studies, we demonstrated that this tissue sampling method still provides strong evidence for formulating a priori hypotheses and meaningful findings for involution of breast tissue (48), identification of breast cancer risk markers (17, 49, 50), and associations with various breast cancer risk factors, suggesting only a minimal impact of this limitation on research findings (51). Only cancer-free women with routine clinical biopsy with BBD diagnosis are included in our study; however, since our analysis is focused on normal TDLUs, the findings are likely to be generalizable to all cancer-free women and not necessarily limited only to women with BBD diagnosis. Even though the positivity of the staining was calculated taking into account any staining intensity, the associations were not examined separately for each intensity level. We do not focus specifically on the intensity, as there are accumulating data indicating that staining intensity can be affected by both storage time and variability in processing (52). Because our samples were collected from across the United States over a large period of time, we do not feel that staining intensity would be a reliable measure (17). Finally, the results of the stratified analysis by menopausal status should be interpreted with caution given the small number of postmenopausal women in our sample.
In conclusion, we examined the associations of several reproductive variables with the expression of stem cell markers CD44, CD24, and ALDH1A1 in cancer-free women. Our findings suggest positive associations of age at menarche with CD44 expression and inverse associations of age at first birth and the time between menarche and first birth with expression of CD44 and ALDH1A1 as well as inverse associations of lifetime duration of breastfeeding with stromal ALDH1A1 expression in postmenopausal women. Future studies are needed to confirm these findings and to explain the potential biological mechanisms.
Data availability statement
The data that support the findings of this study are available from the Nurses’ Health Studies, however they are not publicly available. Investigators interested in using the data can request access, and feasibility will be discussed at an investigators’ meeting. Limits are not placed on scientific questions or methods, and there is no requirement for co-authorship. Additional data sharing information and policy details can be accessed at http://www.nurseshealthstudy.org/researchers.
Ethics statement
The studies involving humans were approved by the institutional review boards of the Brigham and Women’s Hospital and Harvard T.H. Chan School of Public Health, and those of participating registries as required. Written informed consent was obtained or implied by return of questionnaires. The studies were conducted in accordance with the local legislation and institutional requirements.
Author contributions
LY: Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Writing – original draft, Writing – review & editing. YH: Investigation, Methodology, Writing – review & editing. GB: Data curation, Writing – review & editing. VB-M: Investigation, Writing – review & editing. DM: Data curation, Writing – review & editing. MM: Project administration, Resources, Writing – review & editing. BR: Methodology, Writing – review & editing. RT: Conceptualization, Funding acquisition, Investigation, Methodology, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by the National Cancer Institute at the National Institutes of Health [CA240341 to LY, CA131332, CA175080, P01 CA087969 to RT, UM1 CA186107 and, to MS, U01 CA176726 to WW], Avon Foundation for Women, Susan G. Komen for the Cure®, and Breast Cancer Research Foundation.
Acknowledgments
The authors would like to acknowledge the contribution to this study from central cancer registries supported through the Centers for Disease Control and Prevention’s National Program of Cancer Registries (NPCR) and/or the National Cancer Institute’s Surveillance, Epidemiology, and End Results (SEER) Program. Central registries may also be supported by state agencies, universities, and cancer centers. Participating central cancer registries include the following: Alabama, Alaska, Arizona, Arkansas, California, Colorado, Connecticut, Delaware, Florida, Georgia, Hawaii, Idaho, Indiana, Iowa, Kentucky, Louisiana, Massachusetts, Maine, Maryland, Michigan, Mississippi, Montana, Nebraska, Nevada, New Hampshire, New Jersey, New Mexico, New York, North Carolina, North Dakota, Ohio, Oklahoma, Oregon, Pennsylvania, Puerto Rico, Rhode Island, Seattle SEER Registry, South Carolina, Tennessee, Texas, Utah, Virginia, West Virginia, and Wyoming.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2024.1354094/full#supplementary-material
References
1. Minami Y, Ohuchi N, Taeda Y, Fukao A, Hisamichi S. Risk factors for benign breast disease according to histopathological type: comparisons with risk factors for breast cancer. Jpn J Cancer Res. (1998) 89:116–23. doi: 10.1111/j.1349-7006.1998.tb00538.x
2. Modugno F, Ngo DL, Allen GO, Kuller LH, Ness RB, Vogel VG, et al. Breast cancer risk factors and mammographic breast density in women over age 70. Breast Cancer Res Treat. (2006) 97:157–66. doi: 10.1007/s10549-005-9105-8
3. Yaghjyan L, Mahoney MC, Succop P, Wones R, Buckholz J, Pinney SM. Relationship between breast cancer risk factors and mammographic breast density in the Fernald Community Cohort. Br J Cancer. (2012) 106:996–1003. doi: 10.1038/bjc.2012.1
4. Vachon CM, Kuni CC, Anderson K, Anderson VE, Sellers TA. Association of mammographically defined percent breast density with epidemiologic risk factors for breast cancer (United States). Cancer Causes Control. (2000) 11:653–62. doi: 10.1023/A:1008926607428
5. Yaghjyan L, Colditz GA, Rosner B, Bertrand KA, Tamimi RM. Reproductive factors related to childbearing and mammographic breast density. Breast Cancer Res Treat. (2016) 158:351–9. doi: 10.1007/s10549-016-3884-y
6. Ramon JM, Escriba JM, Casas I, Benet J, Iglesias C, Gavalda L, et al. Age at first full-term pregnancy, lactation and parity and risk of breast cancer: a case-control study in Spain. Eur J Epidemiol. (1996) 12:449–53. doi: 10.1007/BF00143995
7. Britt K, Ashworth A, Smalley M. Pregnancy and the risk of breast cancer. Endocr Relat Cancer. (2007) 14:907–33. doi: 10.1677/ERC-07-0137
8. Butt S, Borgquist S, Anagnostaki L, Landberg G, Manjer J. Parity and age at first childbirth in relation to the risk of different breast cancer subgroups. Int J Cancer. (2009) 125:1926–34. doi: 10.1002/ijc.24494
9. Woolcott CG, Koga K, Conroy SM, Byrne C, Nagata C, Ursin G, et al. Mammographic density, parity and age at first birth, and risk of breast cancer: an analysis of four case-control studies. Breast Cancer Res Treat. (2012) 132:1163–71. doi: 10.1007/s10549-011-1929-9
10. Dall G, Risbridger G, Britt K. Mammary stem cells and parity-induced breast cancer protection- new insights. J Steroid Biochem Mol Biol. (2017) 170:54–60. doi: 10.1016/j.jsbmb.2016.02.018
11. Pike MC, Krailo MD, Henderson BE, Casagrande JT, Hoel DG. ‘Hormonal’ risk factors, ‘breast tissue age’ and the age-incidence of breast cancer. Nature. (1983) 303:767–70. doi: 10.1038/303767a0
12. Rosner B, Colditz GA. Nurses’ health study: log-incidence mathematical model of breast cancer incidence. J Natl Cancer Inst. (1996) 88:359–64. doi: 10.1093/jnci/88.6.359
13. Li CI, Malone KE, Daling JR, Potter JD, Bernstein L, Marchbanks PA, et al. Timing of menarche and first full-term birth in relation to breast cancer risk. Am J Epidemiol. (2008) 167:230–9. doi: 10.1093/aje/kwm271
14. Ritte R, Tikk K, Lukanova A, Tjønneland A, Olsen A, Overvad K, et al. Reproductive factors and risk of hormone receptor positive and negative breast cancer: a cohort study. BMC Cancer. (2013) 13:584–4. doi: 10.1186/1471-2407-13-584
15. Isfoss B, Holmqvist B, Jernström H, Alm P, Olsson H. Women with familial risk for breast cancer have an increased frequency of aldehyde dehydrogenase expressing cells in breast ductules. BMC Clin Pathol. (2013) 13:1–9. doi: 10.1186/1472-6890-13-28
16. Collins LC, Baer HJ, Tamimi RM, Connolly JL, Colditz GA, Schnitt SJ. The influence of family history on breast cancer risk in women with biopsy-confirmed benign breast disease: results from the Nurses’ Health Study. Cancer. (2006) 107:1240–7. doi: 10.1002/cncr.22136
17. Tamimi RM, Colditz GA, Wang Y, Collins LC, Hu R, Rosner B, et al. Expression of IGF1R in normal breast tissue and subsequent risk of breast cancer. Breast Cancer Res Treat. (2011) 128:243–50. doi: 10.1007/s10549-010-1313-1
18. Tamimi RM, Byrne C, Colditz GA, Hankinson SE. Endogenous hormone levels, mammographic density, and subsequent risk of breast cancer in postmenopausal women. J Natl Cancer Inst. (2007) 99:1178–87. doi: 10.1093/jnci/djm062
19. Tamimi RM, Rosner B, Colditz GA. Evaluation of a breast cancer risk prediction model expanded to include category of prior benign breast disease lesion. Cancer. (2010) 116:4944–53. doi: 10.1002/cncr.25386
20. Kensler KH, Beca F, Baker GM, Heng YJ, Beck AH, Schnitt SJ, et al. Androgen receptor expression in normal breast tissue and subsequent breast cancer risk. NPJ Breast Cancer. (2018) 4:33. doi: 10.1038/s41523-018-0085-3
21. Yaghjyan L, Colditz GA, Collins LC, Schnitt SJ, Rosner B, Vachon C, et al. Mammographic breast density and subsequent risk of breast cancer in postmenopausal women according to tumor characteristics. J Natl Cancer Inst. (2011) 103:1179–89. doi: 10.1093/jnci/djr225
22. Moran O, Eisen A, Demsky R, Blackmore K, Knight JA, Panchal S, et al. Predictors of mammographic density among women with a strong family history of breast cancer. BMC Cancer. (2019) 19:631. doi: 10.1186/s12885-019-5855-2
23. Jacobs TW, Byrne C, Colditz G, Connolly JL, Schnitt SJ. Pathologic features of breast cancers in women with previous benign breast disease. Am J Clin Pathol. (2001) 115:362–9. doi: 10.1309/UP07-K3KD-25NL-D3M8
24. Tamimi RM, Byrne C, Baer HJ, Rosner B, Schnitt SJ, Connolly JL, et al. Benign breast disease, recent alcohol consumption, and risk of breast cancer: a nested case–control study. Breast Cancer Res. (2005) 7:R555. doi: 10.1186/bcr1039
25. Page DL, Dupont WD, Rogers LW, Rados MS. Atypical hyperplastic lesions of the female breast. A long-term follow-up study. Cancer. (1985) 55:2698–708. doi: 10.1002/(ISSN)1097-0142
26. Collins LC, Wang YH, Connolly JL, Baer HJ, Hu R, Schnitt SJ, et al. Potential role of tissue microarrays for the study of biomarker expression in benign breast disease and normal breast tissue. Appl Immunohistochem Mol Morphol. (2009) 17:438–41. doi: 10.1097/PAI.0b013e3181993d86
27. Yaghjyan L, Stoll E, Ghosh K, Scott CG, Jensen MR, Brandt KR, et al. Tissue-based associations of mammographic breast density with breast stem cell markers. Breast Cancer Res: BCR. (2017) 19:100–0. doi: 10.1186/s13058-017-0889-3
28. Yaghjyan L, Esnakula AK, Scott CG, Wijayabahu AT, Jensen MR, Vachon CM. Associations of mammographic breast density with breast stem cell marker-defined breast cancer subtypes. Cancer Causes Control. (2019) 30:1103–11. doi: 10.1007/s10552-019-01207-w
29. Yaghjyan L, Heng YJ, Baker GM, Bret-Mounet V, Murthy D, Mahoney MB, et al. Reliability of CD44, CD24, and ALDH1A1 immunohistochemical staining: Pathologist assessment compared to quantitative image analysis. Front Med. (2022) 9. doi: 10.3389/fmed.2022.1040061
30. Roberts MR, Baker GM, Heng YJ, Pyle ME, Astone K, Rosner BA, et al. Reliability of a computational platform as a surrogate for manually interpreted immunohistochemical markers in breast tumor tissue microarrays. Cancer Epidemiol. (2021) 74:101999. doi: 10.1016/j.canep.2021.101999
31. Baker GM, Bret-Mounet VC, Wang T, Veta M, Zheng H, Collins LC, et al. Immunohistochemistry scoring of breast tumor tissue microarrays: A comparison study across three software applications. J Pathol Inform. (2022) 13:100118. doi: 10.1016/j.jpi.2022.100118
32. Willett W, Stampfer MJ, Bain C, Lipnick R, Speizer FE, Rosner B, et al. Cigarette smoking, relative weight, and menopause. Am J Epidemiol. (1983) 117:651–8. doi: 10.1093/oxfordjournals.aje.a113598
33. Stampfer MJ, Willett WC, Colditz GA, Rosner B, Speizer FE, Hennekens CH. A prospective study of postmenopausal estrogen therapy and coronary heart disease. New Engl J Med. (1985) 313:1044–9. doi: 10.1056/NEJM198510243131703
34. Meier-Abt F, Bentires-Alj M, Rochlitz C. Breast cancer prevention: lessons to be learned from mechanisms of early pregnancy–mediated breast cancer protection. Cancer Res. (2015) 75:803–7. doi: 10.1158/0008-5472.CAN-14-2717
35. Belitskaya-Lévy I, Zeleniuch-Jacquotte A, Russo J, Russo IH, Bordás P, Åhman J, et al. Characterization of a genomic signature of pregnancy identified in the breast. Cancer Prev Res. (2011) 4:1457–64. doi: 10.1158/1940-6207.CAPR-11-0021
36. Peri S, De Cicco RL, Santucci-Pereira J, Slifker M, Ross EA, Russo IH, et al. Defining the genomic signature of the parous breast. BMC Med Genomics. (2012) 5:46. doi: 10.1186/1755-8794-5-46
37. Ghosh S, Gu F, Wang C-M, Lin C-L, Liu J, Wang H, et al. Genome-wide DNA methylation profiling reveals parity-associated hypermethylation of FOXA1. Breast Cancer Res Treat. (2014) 147:653–9. doi: 10.1007/s10549-014-3132-2
38. Santucci-Pereira J, Zeleniuch-Jacquotte A, Afanasyeva Y, Zhong H, Slifker M, Peri S, et al. Genomic signature of parity in the breast of premenopausal women. Breast Cancer Res. (2019) 21:46. doi: 10.1186/s13058-019-1128-x
39. Murrow LM, Weber RJ, Caruso J, Mcginnis CS, Gascard P, Borowsky AD, et al. Pregnancy and obesity modify the epithelial composition and hormone signaling state of the human breast. bioRxiv. (2020) 430611. doi: 10.1101/430611
40. Kotsopoulos J, Lubinski J, Salmena L, Lynch HT, Kim-Sing C, Foulkes WD, et al. Breastfeeding and the risk of breast cancer in BRCA1 and BRCA2 mutation carriers. Breast Cancer Res. (2012) 14:R42. doi: 10.1186/bcr3138
41. Honeth G, Schiavinotto T, Vaggi F, Marlow R, Kanno T, Shinomiya I, et al. Models of breast morphogenesis based on localization of stem cells in the developing mammary lobule. Stem Cell Rep. (2015) 4:699–711. doi: 10.1016/j.stemcr.2015.02.013
42. Atkinson RL, Yang WT, Rosen DG, Landis MD, Wong H, Lewis MT, et al. Cancer stem cell markers are enriched in normal tissue adjacent to triple negative breast cancer and inversely correlated with DNA repair deficiency. Breast Cancer Res. (2013) 15:R77. doi: 10.1186/bcr3471
43. Fuhrman BJ, Moore SC, Byrne C, Makhoul I, Kitahara CM, Berrington De González A, et al. Association of the age at menarche with site-specific cancer risks in pooled data from nine cohorts. Cancer Res. (2021) 81:2246–55. doi: 10.1158/0008-5472.CAN-19-3093
44. Biro FM, Deardorff J. Identifying opportunities for cancer prevention during preadolescence and adolescence: puberty as a window of susceptibility. J Adolesc Health. (2013) 52:S15–20. doi: 10.1016/j.jadohealth.2012.09.019
45. Koprowski C, Coates RJ, Bernstein L. Ability of young women to recall past body size and age at menarche. Obes Res. (2001) 9:478–85. doi: 10.1038/oby.2001.62
46. Must A, Phillips SM, Naumova EN, Blum M, Harris S, Dawson-Hughes B, et al. Recall of early menstrual history and menarcheal body size: after 30 years, how well do women remember? Am J Epidemiol. (2002) 155:672–9. doi: 10.1093/aje/155.7.672
47. Cooper R, Blell M, Hardy R, Black S, Pollard TM, Wadsworth MEJ, et al. Validity of age at menarche self-reported in adulthood. J Epidemiol Community Health. (2006) 60:993–7. doi: 10.1136/jech.2005.043182
48. Rice MS, Tamimi RM, Connolly JL, Collins LC, Shen D, Pollak MN, et al. Insulin-like growth factor-1, insulin-like growth factor binding protein-3 and lobule type in the Nurses’ Health Study II. Breast Cancer Res. (2012) 14:1–7. doi: 10.1186/bcr3141
49. Huh SJ, Oh H, Peterson MA, Almendro V, Hu R, Bowden M, et al. The proliferative activity of mammary epithelial cells in normal tissue predicts breast cancer risk in premenopausal women. Cancer Res. (2016) 76:1926–34. doi: 10.1158/0008-5472.CAN-15-1927
50. Oh H, Eliassen AH, Wang M, Smith-Warner SA, Beck AH, Schnitt SJ, et al. Expression of estrogen receptor, progesterone receptor, and Ki67 in normal breast tissue in relation to subsequent risk of breast cancer. NPJ Breast Cancer. (2016) 2:16032–. doi: 10.1038/npjbcancer.2016.32
51. Oh H, Eliassen AH, Beck AH, Rosner B, Schnitt SJ, Collins LC, et al. Breast cancer risk factors in relation to estrogen receptor, progesterone receptor, insulin-like growth factor-1 receptor, and Ki67 expression in normal breast tissue. NPJ Breast Cancer. (2017) 3:39. doi: 10.1038/s41523-017-0041-7
52. Atkins D, Reiffen K-A, Tegtmeier CL, Winther H, Bonato MS, Störkel S. Immunohistochemical detection of EGFR in paraffin-embedded tumor tissues: variation in staining intensity due to choice of fixative and storage time of tissue sections. J Histochem Cytochem. (2004) 52:893–901. doi: 10.1369/jhc.3A6195.2004
Keywords: benign breast disease, parity, breastfeeding, age at first child, breast cancer risk
Citation: Yaghjyan L, Heng YJ, Baker GM, Bret-Mounet VC, Murthy D, Mahoney MB, Rosner B and Tamimi RM (2024) Associations of reproductive breast cancer risk factors with expression of stem cell markers in benign breast tissue. Front. Oncol. 14:1354094. doi: 10.3389/fonc.2024.1354094
Received: 11 December 2023; Accepted: 20 February 2024;
Published: 21 March 2024.
Edited by:
Orazio Fortunato, Fondazione IRCCS Istituto Nazionale dei Tumori, ItalyReviewed by:
Gary Zirpoli, Boston University, United StatesAlisson Clemenceau, Yale University, United States
Copyright © 2024 Yaghjyan, Heng, Baker, Bret-Mounet, Murthy, Mahoney, Rosner and Tamimi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Lusine Yaghjyan, bHlhZ2hqeWFuQHVmbC5lZHU=