Introduction

The majority of benign breast epithelial cells among young women are present in structures termed terminal duct lobular units, which lose epithelial content and complexity with advancing age and menopause (i.e. involution). Several studies have demonstrated that histology of benign breast epithelium is associated with breast cancer risk, with more complete epithelial involution conferring reduced risk of breast cancer [1,2,3,4]. It has been hypothesized that some well-established breast cancer risk factors (most notably mammographic density, parity, and menopause) are associated with age-related involution of epithelium, with consequences for etiology or risk [2, 5,6,7,8,9,10]. Thus, identifying the best method for quantifying involution has potential etiologic, and translational importance.

However, research on benign histology as a biomarker of breast cancer risk has been constrained by availability of biospecimens [9, 11] and uncertainty as to whether a single breast biospecimen represents a woman’s involution status. Several studies have evaluated multiple specimens from a single patient and reported high within-person agreement for measures of breast involution [1, 5, 6, 12], while others have identified variability within an individual [13, 14]. Research is needed to systematically evaluate intraindividual variability in epithelial involution status. We propose that epithelial composition may be heterogeneous throughout the breast, with higher levels in non-fatty, mammographically dense tissue, which is the background in which most breast cancers develop [15].

In the current analysis, we used a digital histology approach to measure stroma and epithelial characteristics in the same specimens. We measured percent area of stroma and epithelium, as well as epithelial nuclear density as a measure of epithelial involution status [16]. Our goals were to understand intraindividual variability, to assess the relationship between stromal content and epithelial involution measures, and to assess risk factor-histology associations after considering multiple samples. Over 400 women are included in the current analysis, which applied novel digital imaging algorithms to quantify epithelial and stromal histological features on approximately 1800 distinct hematoxylin and eosin (H&E)-stained slides. Some findings were independently validated in a separate data set from the Mayo Clinic, with local mammographic imaging data.

Materials and methods

Study population

The Normal Breast Study is a study of breast cancer microenvironment and normal breast tissue conducted at the University of North Carolina (UNC) Hospitals in Chapel Hill, NC. As described previously [16], English-speaking women ≥18 years of age undergoing breast surgery at UNC Hospitals (mastectomy, lumpectomy, excisional biopsy, reduction mammoplasty, or other cosmetic breast surgery) between October 2009 and April 2013 were contacted for participation by study personnel during a pre-surgery appointment with their surgeon. The final Normal Breast Study population included 399 women with breast cancer and 75 women with benign breast histology, for a total of 474 participants. Written informed consent was obtained for all enrolled participants, and all study protocols were approved by the UNC School of Medicine’s Institutional Review Board.

Participants completed a telephone interview to provide demographic, lifestyle, and risk factor exposure data, and medical records abstraction was performed to obtain patients’ medical history, mammographic screening, surgical details, and tumor pathology data. Age and body mass index (<25.0, 25.0–29.9, ≥30.0 kg/m2) were defined at time of surgery using medical records. Mammographic density was abstracted from medical records and defined from mammograms taken at time of breast cancer diagnosis (breast cancer patients) or the most recent mammogram prior to surgery (patients with benign breast histology) using standard Breast Imaging Reporting and Data System (BI-RADS) scores, 4th edition (1—Fatty, 2—Scattered fibroglandular densities, 3—Heterogeneously dense, and 4—Extremely dense) [17] assigned by a radiologist. For a small subset of patients (N = 33), mammographic density was defined as “dense, not otherwise specified”, as the radiologist reported breast density as “dense” without distinction between BI-RADS category 3 or 4. Pathologic data (including diagnosis status, sampled breast quadrant, and surgery type) were obtained from pathology reports at time of surgery. All other covariates were self-reported by patients completing the telephone interview and defined as follows: race (White, African American, Other), age at menarche (<13, ≥13 years), parity (nulliparous, 1–2 births, ≥3 births), age at first term pregnancy (<26, ≥26 years), time since last birth (<10, 10–19, ≥20 years), history of breastfeeding (never, ever), breastfeeding duration (<3, ≥3 months), oral contraceptive use (never, ever), oral contraceptive duration (never, ≤10, >10 years), oral contraceptive recency (never, ≤5, >5 years), and menopausal hormone therapy recency (never, ≤5, >5 years). Menopausal status was defined as pre- vs. postmenopausal: women with a history of bilateral oophorectomy, self-reported natural menopause, or over age 54 years (among ever smokers) or age 56 years (among non-smokers) were classified as postmenopausal, while women reporting pre- or perimenopausal status, having at least one intact ovary, and under age 45 years (among ever smokers) or age 47 years (among non-smokers) were classified as premenopausal. Patients who did not complete the telephone interview to assess risk factor exposure (N = 30) were excluded.

Tissue processing and slide preparation

All participants donated grossly normal-appearing breast tissue (as assessed by pathology assistants at UNC Hospitals) for research purposes. Tissues were snap-frozen and/or paraffin-embedded, and each patient donated between 1 and 17 specimens from grossly fibrous and mostly fat regions of the breast, as available. For breast cancer patients, tissue specimens were collected at specified distances from the tumor depending on tissue availability (<1, >1–2, >2–4, and >4 cm), while patients with benign breast histology donated tissue from one or two distinct breast sites as available. Frozen tissue specimens were prepared as described previously [16]. Briefly, ~100 mg specimens were cut over dry ice, and sections were collected at both ends of the specimen to construct two 20 micron slides per tissue specimen; 20 micron frozen sections were used to maximize tissue fidelity and image quality. All slides were H&E stained and histological images (1–8 per patient) were scanned into high-resolution digital images using the Aperio Scan-Scope XT Slide Scanner (Aperio Technologies) in the UNC Translational Pathology Laboratory. Slides with poor resolution after scanning or significant folded tissue (N = 16) were manually identified and excluded from analysis. After manual review of the remaining slides by a study pathologist, four breast cancer patients were found to have tissue with high epithelial content (>50%) that did not appear histologically normal and were excluded. Thus, the study population eligible for this analysis included 424 participants (364 patients with a confirmed diagnosis of breast cancer and 60 with no evidence of breast cancer).

Assessment of breast tissue composition metrics and breast cancer risk factors

A previously published algorithm was utilized to partition slides into epithelium, stroma, and adipose regions [8, 16, 18, 19]. Briefly, pathologists marked areas of epithelium, stroma, adipose tissue, and total tissue area using Aperio Imagescope V11.0.2.725, enabling measurement of tissue compartment-specific areas and percentages of total area. Aperio’s Genie Classifier was trained to partition epithelium, adipose tissue, nonfatty stroma, and glass (with glass excluded from the total area) with validation of test digital slides by two pathologists conducting semi-quantitative review of tissue component percentages. The results of these three methods (Genie, manual-annotation, and pathologist semi-quantitative visual review) showed strong correlations (Pearson correlation coefficients ranged 0.96–0.98), with the algorithm outperforming semi-quantitative visual assessment by a pathologist and matching almost perfectly with manually annotated, pathologist reviewed images [8, 16, 18, 19]. A standard, validated nuclear detection algorithm included with Genie was used to identify the number of nuclei per unit area of epithelium (defined as the general area containing terminal duct lobular units and inter-lobular ducts) and to calculate epithelial nuclear density in cells/mm2. The algorithm calculates the density of nuclei within terminal duct lobular unit regions, and therefore handles large and small ducts similarly.

Our previous work shows that the percentages of epithelial area and age are only modestly correlated, or alternatively, that the decline is obscured because of concomitant declines in stromal area and/or total breast area [16]. However, cellular density within regions of epithelium, i.e., number of nuclei per mm2 of epithelial area, did decline with age. As a result, we focused on epithelial nuclear density as a measure of epithelial involution in the current analyses. Measures of percent stroma were considered as possible modifiers of the risk factor-epithelial nuclear density associations. We compared tissue composition measures obtained from frozen and paraffin-embedded tissue sections and found a small but significant decrease in mean percent stromal area among paraffin-embedded compared to frozen tissue specimens (−5.2%, p = 0.005). Therefore, only frozen specimens were included in our analyses, and eight patients with only paraffin-embedded samples were excluded (final N = 416).

To independently validate our methods, we assessed composition metrics in a data set containing asymptomatic healthy volunteers recruited at the Mayo Clinic between 2006 and 2008. In that study, mammography and ultrasound were used to identify and sample tissue through core biopsies from regions of dense and non-dense breast tissue, as described previously [14]. The H&E slides from the dense and non-dense paraffin-embedded tissue were subjected to our classification algorithm, with minor modifications to account for slide fading, artifacts, and differences between fixation protocols. Visual inspection showed that the modified Genie Aperio Classifier was comparable to the Normal Breast Study algorithm for identification of epithelium, adipose, non-fatty stroma and glass. Further, the Classifier correlated with prior semi-quantitation of total epithelial, stromal, and adipose area (data not shown).

Statistical analysis

Technical replicates

For each tissue specimen, samples of ~100 mg size were cut and separate sections were taken from the top and bottom as described previously [16]. To compare histologic metrics within these proximal tissue sections, intraclass correlation coefficients were calculated [20], and we observed highly correlated percentage area and nuclear density measures within a small local region, implying reproducibility of our metrics [16]. Therefore, these top and bottom sections were treated as technical replicates and were averaged to create one percentage area and one nuclear density estimate per epithelial, stromal, and adipose compartment per specimen. For eight patients, one of the two replicate slides showed poor resolution after scanning; thus, histologic metrics were based on a single slide.

Intraindividual variability

To explore variability in breast composition across multiple tissue specimens from the same individual, we calculated intraclass correlation coefficients and examined stromal area in a subset of patients with multiple tissue specimens from different sampling locations (N = 141). Again, technical replicates were averaged for each specimen. Stromal percent area was dichotomized at 10% based on visual inspection of the histogram, which suggested a right-skewed distribution of stromal area (Fig. 1a). We also anchored this 10% stroma threshold to independent observations from the Mayo data set, in which dense and non-dense tissue from the same patient were sampled. A 10% stromal cutpoint could distinguish Mayo samples with high mammographic density (BI-RADS categories 3 and 4) from those without, with 91% sensitivity and 85% specificity (Table 1). Thus, independent data suggest a 10% stromal area threshold may be biologically meaningful. Based on these observations, we categorized specimens according to stromal area as follows: all replicate biological samples ≥10% stroma, all samples <10% stroma, or mixed, referred to henceforth as “adequate stroma”, “mostly fat”, and “mixed”, respectively. Polytomous logistic regression was used to relate patient and sampling factors (e.g., number of tissue samples, age, and mammographic density) to stromal area among women with multiple tissue samples (N = 141).

Fig. 1
figure 1

Distribution of stromal percentage area (a) and epithelial nuclear density according to stromal area (b) among patients in the Normal Breast Study (N = 416). Trend line in panel b obtained by linear regression, β = 68.2, p < 0.0001, adjusted R2 = 0.50

Table 1 Assessment of sensitivity and specificity using different stromal area cutpoints in Mayo data set (N = 150 samples, 75 dense/nondense tissue pairs)

Risk factor associations with tissue composition

For analyses assessing associations between histology and risk factor exposure, a single specimen was evaluated for each patient. For patients with “adequate stroma” samples, the first sampled tissue site with adequate stroma was selected; for patients with all “mostly fat” specimens, the specimen closest to tumor or first sampled specimen was included. Linear regression was used to estimate associations between breast cancer risk factors and epithelial nuclear density among all women (N = 416). All risk factors were modeled categorically using indicator terms, and linear tests for trend were conducted for continuous exposures showing a linear dose-response relationship with epithelial nuclear density (age, body mass index, and mammographic density). Univariate analyses estimated unadjusted associations between each risk factor and epithelial nuclear density, while multivariate analyses were adjusted for continuous age and type of surgery (cancer-adjacent lumpectomy/mastectomy, reduction mammoplasty/prophylactic mastectomy, and benign biopsy), as these factors were found to be associated with stromal content. Patients were not excluded on the basis of disease status, as sensitivity analyses reported previously [16] suggested results were similar when excluding patients without breast cancer. All risk factor analyses were stratified by stromal area (10% cutpoint), and heterogeneity of stratified estimates was compared to assess effect measure modification. Cubic spline models and boxplots were used to visualize the relationships between age or menopausal status and epithelial nuclear density overall and by stromal area. Spline models specified a knot at age 55 years consistent with evidence of an inflection point. Statistical significance for all analyses was defined as p < 0.05, and statistical analyses were performed using SAS software, version 9.4 (SAS Institute, Inc., Cary, North Carolina), or R, version 3.2.3 (for spline models).

Results

Intraindividual variability in breast stromal area

The stromal area showed a right-skewed distribution (Fig. 1a, b) in this study, with ~24% of samples having <10% stromal area. Among 408 women with repeated breast tissue sampling, the majority of women (81%) had at least one sample with at least 10% stroma, with 51% of all women having stromal area ≥10% for all specimens. We observed substantial intraindividual variation in the levels of stromal area across samples; Fig. 2 illustrates differences in stromal area for four representative women with repeated sampling. Intraclass correlations comparing tissue specimens from two distinct sampling sites from 141 women revealed low consistency in stromal percent area and epithelial nuclear density across different breast regions (stroma: 0.44, 95% CI: 0.32, 0.54; epithelium: 0.35, 95% CI: 0.23, 0.47). Interestingly, premenopausal women with mostly fat samples were common (35% of premenopausal women), and many patients (25% overall) had both low stroma (<10%) and higher stroma (≥10%) samples within the same breast. These results highlight substantial intraindividual variability in breast composition across women of all ages.

Fig. 2
figure 2

Intraindividual variation in breast tissue composition across repeated tissue sampling. Panels a1–d2 represent H&E slides taken at resolution ×2 from four representative women in the Normal Breast Study (two slides per woman) highlighting differences in tissue composition across distinct regions of the breast. Images from panels a1/a2 and b1/b2 represent tissue from a 58-year-old and 39-year-old woman, respectively, with mixed adequate stroma/mostly fat tissue samples. Images from panels c1/c2 and d1/d2 represent tissue from a 38-year-old and 64-year-old woman, respectively, with all adequate stroma tissue samples

To identify factors contributing to sampling reproducibility, patients were divided into three groups: all samples ≥10% stroma (adequate stroma), all samples <10% stroma (mostly fat), and mixed. Examining whether breast quadrant was related to intraindividual variability in stromal area revealed a slightly higher likelihood of mixed stroma/mostly fat tissue samples among women with upper outer quadrant sampling compared to other quadrants, though no associations were statistically significant and the precision of our effect estimates was poor (results not shown). However, the number of tissue samples greatly increased the odds of having mixed class, while women with all mostly fat samples tended to have fewer samples (Table 2). Women with mixed composition tended to be older with lower mammographic density than women who had consistently adequate (at least 10%) stromal composition. Sampling and patient characteristics appeared to significantly influence both the magnitude and intraindividual variability of stromal area.

Table 2 Patient factors contributing to intraindividual variability of stromal area within women with multiple tissue specimens in the Normal Breast Study (N = 141)

Stromal area modifies the dynamic range of epithelial nuclear density

The distribution of epithelial nuclear density comparing low stroma (<10%) to higher stroma (at least 10%) revealed marked differences in the range of epithelial nuclear density values (Fig. 3a). Low stroma was associated with low average epithelial nuclear density (mean = 3910.8 cells/mm2, SE = 117.4 cells/mm2) and little variability across patients. In contrast, the mean and dynamic range of epithelial nuclear density were dramatically greater for samples with adequate stroma (stromal area ≥10%), encompassing approximately twice the range of nuclear density values (mean = 7077.6 cells/mm2, SE = 145.2 cells/mm2). Moreover, patterns of age-dependent involution were evident only in samples with at least 10% stroma (Fig. 3b). Among those with adequate stroma (≥10%), epithelial nuclear density remained relatively stable at ~8000 cells/mm2 until age 55 years, after which epithelial nuclear density declined rapidly at a rate of 99.7 cells/mm2 per year (p < 0.0001). In contrast, women with low stroma samples (<10%) had tissue that appeared to be completely involuted, with epithelial nuclear density remaining stable at ~4000 cells/mm2 and showing no change with increasing age. For comparison with previous studies that did not stratify on stromal content, we also performed analyses considering all samples, wherein epithelial nuclear density decreased with age at an annual rate of 39.1 cells/mm2 (p = 0.04) until age 55 years, after which the rate increased and epithelial nuclear density declined 66.5 cells/mm2 per year (p = 0.0008) (Fig. 3b). There was high variability within stromal area categories (R2 < 0.1) underscoring the importance of replicate observations and large samples sizes in histologic studies of normal breast.

Fig. 3
figure 3

Epithelial nuclear density by stromal area content in the Normal Breast Study (N = 416). a Distribution of epithelial nuclear density by stromal area among pre- and postmenopausal women in the Normal Breast Study. “Mostly fat” denotes tissue samples with <10% stromal area while “adequate stroma” denotes samples with ≥10% stromal area. b Cublic spline curves for epithelial nuclear density by age stratified by stromal area (inflection point at age 55 years). R2 = 0.02, 0.09, and 0.06 for “mostly fat”, “adequate stroma”, and “all” spline curves, reflecting high variability between categories

Risk factor associations with epithelial nuclear density are influenced by stromal area

Given the observation that epithelial involution status is associated with stromal composition, we hypothesized that risk factor associations with epithelial involution may be modified by stromal content. Indeed, exclusion of samples containing low stroma showed different and stronger risk factor associations with epithelial nuclear density than those among women overall (Table 3). Among women with low stromal area, no risk factors were significantly associated with epithelial nuclear density (Table 4). However, among patients with high stroma (Table 3), we observed significant associations with risk factor exposures.

Table 3 Risk factor associations with epithelial nuclear density among women overall and with adequate stroma (stromal area ≥10%) breast tissue
Table 4 Risk factor associations with epithelial nuclear density among women with mostly fat (stromal area <10%) breast tissue (N = 99)

Epithelial nuclear density significantly varied by surgery type after adjustment for age, with women receiving reduction mammoplasties or prophylactic mastectomies having significantly lower epithelial nuclear density than women receiving surgeries related to their cancer diagnosis (difference = −891.1 cells/mm2, p = 0.03); thus, all subsequent analyses adjusted for surgery type. Postmenopausal women appeared to have dramatically lower epithelial nuclear density in univariate analyses (p < 0.0001), but not after adjusting for age and surgery type (p = 0.5). Among women with stromal area ≥10%, higher mammographic density was associated with significantly increased epithelial nuclear density in adjusted models (Table 3). Epithelial nuclear density was significantly higher among women with later age at first term pregnancy, more recent birth, longer durations of oral contraceptive use (>10 years), and more recent use of oral contraceptives (≤5 years ago) in univariate analyses (results not shown). These associations were attenuated and not significant in adjusted models, but the patterns of association were still consistent (Table 3). Race, body mass index, age at menarche, history of breastfeeding, breastfeeding duration, and menopausal hormone therapy recency were not associated with epithelial nuclear density.

Discussion

Understanding the relationship between risk factors and breast histology may inform breast cancer risk assessment [7,8,9,10]. Our findings suggest that epithelial nuclear density is sensitive to some risk factor exposures. Higher epithelial cell content (i.e., nuclear density) was associated with higher mammographic density, and, to a lesser extent, with later age at first term pregnancy, more recent time since last birth, and longer duration and more recent use of oral contraceptives. However, these associations were evident only in samples with higher (at least 10%) stromal percent area. There was substantial intraindividual variability in stromal percent area in our study, underscoring the need for systematic sampling strategies. The probability of sampling a tissue specimen with at least 10% stromal area was increased with younger age, lower body mass index, higher mammographic density, and greater number of tissue samples.

Previous studies have also assessed intraindividual variability in breast composition. Hutson et al. [13] described variability in the representation of epithelium but similarities in lobule size and composition across different breast quadrants, while Figueroa et al. [9] observed mixed adipose content in breast tissue samples. Gierach et al. [5] identified moderate agreement in the number of terminal duct lobular units, a measure of breast involution, within replicate breast biopsies, but observed heterogeneity in measures of terminal duct lobular unit size among premenopausal women. In contrast, Vierkant et al. [12], among others [1, 6], identified little heterogeneity in markers of breast involution (including involution of terminal duct lobular units and lobules as well as differences in lobule type) across multiple breast tissue specimens obtained from the same woman. Differences between these studies may reflect differences in how sampling was performed. In many studies, breast tissue sampling occurs by pathologist gross assessment of tissue specimens [1, 2, 6, 8, 12, 13, 21, 22] or by mammography- and ultrasound-guided sampling [10, 14], either of which may result in oversampling of dense, high stromal-content tissues. Our study pathologists were instructed to sample both grossly fibrous and mostly fat regions of breast, and therefore our study was sensitive to uncovering intraindividual variability. Similarly, the study by Mayo included in the present report explicitly sampled high- and low-density areas and showed similar intraindividual variability in stromal content. Other sampling strategies may explicitly or implicitly favor sampling of epithelial-rich areas and therefore may have lower risk of sampling low stroma tissue. However, our findings demonstrate conclusively that sampling strategy, and specifically stromal composition, has important implications for risk factor and histologic associations.

Previous experimental studies have documented the importance of stroma and the breast microenvironment in regulating epithelium. In experimental and model systems, stroma plays a critical role in mammary gland development, growth of epithelium, and tumorigenesis [23,24,25]. In the current study with human tissues, the limited dynamic range of epithelial nuclear density within regions of low stromal area suggests stroma is also important in maintaining human breast epithelium. We hypothesized that if a gland is fully involuted and/or lacks stroma for active regulation, it may be less responsive to other hormonal factors and risk factor-induced changes. Indeed, tissue with stromal area ≥10% showed the strongest associations between epithelial nuclear density and risk factor exposures. Stroma-poor regions of the breast may lack regulation mechanisms present in stroma-rich regions.

Some of the associations we observed among samples with high stroma have been reported previously. It is well established that both stromal and epithelial content are associated with higher mammographic density [8, 10, 14, 26, 27]. In fact, mammographic density is often interpreted as a noninvasive measure of fibroglandular content, underscoring that fibrous stroma and epithelial glandular tissue cannot be readily distinguished in mammograms. Our data emphasize that stromal content may itself be associated with higher epithelial content, suggesting a plausible explanation for why percent density and mammographically dense area both have strong associations with risk [2, 27]. Although we were underpowered to study associations using multivariate logistic regression in the Mayo dataset, those data provided independent validation of the association between stromal content and dense area as identified through breast imaging, demonstrating that both parameters vary intraindividually. Taken together, these findings show that mammographically dense area reflects variability in composition across the entire breast, and our study shows this intraindividual variability to be quite high.

Consistent with our findings, most studies examining breast epithelial content in relation to parity and other reproductive or hormone exposures (including age at first term pregnancy, breastfeeding history, and oral contraceptive use) have observed no long-term statistically significant differences in epithelial content with exposure [7, 8, 10]. Jindal et al. [28] and Russo et al. [6] reported increased epithelial area and lobule formation, respectively, during pregnancy and significant epithelial regression postpartum, but these changes were transient and were offset by postpartum involution. In contrast, we found increased epithelial nuclear density among recently parous women and recent or long-term users of oral contraceptives, similar to a study of epithelial percent area by Gertig et al. [7]. The effects of parity and other risk factors may vary in study populations according to methodological and selection factors.

Our findings should be interpreted in light of some limitations. Our study included histologically normal breast tissue sampled from women undergoing breast surgery due to cancer or other benign conditions. Although biopsy and mastectomy samples minimize risks to patients and are therefore most commonly used in previous breast histology studies [1, 6,7,8,9, 12, 14, 16, 18, 19, 21, 22], there is potential for reverse causation and selection bias, with more women participating with epithelial and/or stroma-rich tissue. Indeed, we found that epithelial nuclear density varied by procedure. However, we observed associations that were comparable to those from other studies including only volunteers with benign or healthy breast tissue [7, 9]. It has also been suggested that breast tissue response to hormone exposure may be stronger in high-risk women [29]. Thus, while some associations we detect may be different in a general population, they may be highly relevant for a susceptible group of high-risk women. Additionally, we did not use specific multivariable modeling strategies to adjust for repeated tissue samplings in our study. Rather, we selected one sampling site for inclusion in our analyses to most closely approximate the previous literature which used one sample per patient. While a strength of our study is the use of frozen tissue which avoids artifacts and cell shrinkage associated with formalin-fixed paraffin-embedded tissues, paraffin-embedded samples are more highly represented in breast tissue studies, which may complicate the comparison of findings across preservation types. However, our validation to the Mayo test data set involving paraffin-embedded samples showed that algorithm performance was comparable to the Normal Breast Study algorithm, suggesting that future comparisons may reveal similar patterns of breast tissue composition regardless of preservation method. We also did not distinguish between luminal epithelium and myoepithelium in our measure of epithelial nuclear density, and we may have occasionally misclassified endothelial and epithelial cells due to similarities between the size of nuclei and architecture of the lumen. We were unable to evaluate how tissue histology related to patient germline genetics, immune infiltrates, or physiological changes in breast tissue associated with menstrual cycling, all of which would be interesting future directions for studies of benign breast tissue. Finally, while our study included over 400 women, our analyses of some risk factors were undoubtedly underpowered given the high variation within and across participants. Larger studies with repeated sampling of the same individuals may identify subtler patterns in breast tissue composition across breast cancer risk factors.

In summary, our findings underscore the importance of stroma in modifying epithelial characteristics of breast tissue and indicate that both sampling strategies and analysis methods for breast tissue should consider epithelial and stromal components simultaneously. There is widespread interest in using epithelial characteristics of benign breast to predict risk [1, 9], and while sampling is relatively invasive, findings from this study suggest analytic strategies for risk stratification in women who have undergone biopsy following suspicious mammogram findings (false positives).