Introduction

In spite of the enormous forward leaps made in treatment regimens, breast cancer remains a leading cause of death among women worldwide, typically attributable to late detection of the disease1. In the absence of major research breakthroughs, the probability for the foreseeable future is that female breast cancer cases are going to escalate2. Metastatic disease is accountable for the majority of cancer-induced mortality.

Diagnosis of breast tumour can result from breast screening (eg in asymptomatic screening programmes), self-examination and symptomatic concerns. Despite mammography-guided fine needle biopsies and morphological/histological studies of core biopsies being the current gold standards for establishing cancerous alterations, these have limitations, not limited to human error and professional experience that can influence diagnosis3. Many studies have pointed to the limitations of mammography, offering sensitivity and specificity of 70% to 90%4, usually followed up by biopsy to determine whether the abnormality is cancerous5.

Metastastic initiation requires invasion of surrounding tissues by malignant cells via a multistep process. Breast cancer usually develops from the epithelial cells lining the terminal duct lobular unit. Disintegration of different junctions functioning to connect epithelial cells to each other allows detachment of epithelial cells and loss of apical-basal polarity. To provide epithelial cells with invading characteristics, a highly conserved cellular program called the ‘epithelial-mesenchymal transition’ (EMT) is induced, restraining epithelial traits in favour of a motile mesenchymal phenotype. The epithelial cells usually interact with the basement membrane via its basal surface to undergo multiple biochemical changes that allow it to take on a mesenchymal cell phenotype, including enhanced migratory capabilities, invasiveness and significantly increased production of extracellular matrix (ECM). As opposed to the EMT viewed as a program that distinctly demarcates cells residing in either an absolute epithelial or mesenchymal condition, the EMT is seen as a plastic program whereby cells progress towards a mesenchymal state, potentially reverting to their epithelial roots once they have invaded a secondary site. Consequently, carcinoma cells are able to re-epithelialize at the metastatic location, being vital for metastatic colonization and development of metastatic outgrowths.

Collagens are major proteins that exist in the extracellular matrix (ECM), the degeneration and penetration of the ECM seeming to be significant processes relating to the morbidity and mortality of cancer6. Specifically, association of invasive carcinoma cells with the ECM has been associated with an increase in proteins which then disrupt the ECM, permitting the invasion of carcinoma cells, proliferating steadily to surrounding tissue. Several studies have demonstrated disturbed collagen structures, in particular the degree of deterioration corresponding with invasive carcinoma development in breast7,8,9. Fibrillar collagen types I and III are dominant in ECM10, molecular alterations in these being potentially linked with invasive carcinoma. In electron micrograph studies by others, a particular focus has concerned fibrillar collagen in tumour-bearing tissues, degradation in this being observed. The suggestion has been that this could arise from a failure of the altered collagen to aggregate in such a way that would allow formation of the bundles typical of healthy tissue11,12. Further to this has been the suggestion that impaired supramolecular organization occurring during cancer progression could provide for pathways engendering invasion of neoplastic cells13,14. The organization of the collagen molecules within the tissue can be analyzed through diffraction peaks that reflect the regular packing of the collagen. As one of the prime components of breast tissue, collagen has been studied through a plethora of techniques inclusive of small angle x-ray scattering (SAXS)5,15,16. SAXS patterns have previously been analyzed in investigation of the condition of collagen fibrils in breast tissue in attempts to find distinguishable characteristics relatable to breast disease, all containing collagen structures affected by invasive cancer3,15,17. Table 1 details the parameters used in previous studies, analytical results demonstrating impact on structural components of breast tissue, particularly in collagen. Table 2 summarizes SAXS breast cancer tissue work of others.

Table 1 SAXS features and their relation to the structural components in breast tissue.
Table 2 Features studied in previous SAXS work.

It needs to be stressed that the EMT process relates to a relatively brief period of existence, occurring at the onset of malignancy. In the absence of soft tissue mass and associated density changes, detecting early stage changes via mammographic imaging is a particular challenge, evidence of spiculations, calcifcations and disturbed breast architecture being diagnostically equivocal. Further to be acknowledged are the several primary challenges in histopathological evaluation of EMT in human cancer tissue. Difficulties arise in real time tracking of individual cancer cells that have transitioned or are in transition. One issue concerns the staging of EMT, whether it has occurred, whether it has yet to undergo transition or whether the reverse process has occurred, viz the mesenchymal-to-epithelial transition (MET)18,19,20,21 Nevertheless, the prototypical spindle-shaped morphology of the fibroblasts can be used to advantageous to differentiate between epithelial cancer cells and fibroblasts. Thus said, epithelial cells that have undergone transition are poorly distinguishable from fibroblasts.

The overarching aim of present research on SAXS pattern analysis is to discriminate between different types of breast tissue, healthy, benign or malignant, focusing in particular on those that are EMT negative and those that are EMT positive. In this respect, while numbers of studies have investigated use of SAXS for tissue recognition, none to our knowledge have investigated spectroscopic diagnosis (e.g. by structural component analysis) of EMT breast cancer tissues, a matter of interest in present work. Specifically, we have utilized x-ray synchrotron radiation to gather information on the axial spacing and intensity of scattering patterns, reflective of internal collagenous structure, the scattering profiles being indicative of structural detail at the supramolecular level.

Methodology

Breast tissue sample preparation

This work examined breast tissue samples acquired by the Department of Pathology, University of Malaya, obtained from surgical excision or mastectomies. The tissues were acquired following fully informed consent from patients who were subject to surgical treatment, with approval for use in research granted by the Medical Research Ethics Committee, University Malaya Medical Centre in accordance with the International Conference on Harmonization – Guidelines for Good Clinical Practice (ICH-GCP) and Declaration of Helsinki. For the first category of investigations, primary analysis was carried out by the collaborating pathologists on all eight of the tissue samples that were acquired from five patients; these were histologically categorized as EMT-1 to EMT-8 (see Table 3). Following this, SAXS measurements were also carried out on 26 paraffin-wax embedded (subsequently dewaxed) breast tissues from 17 patients (indicated as EMT-9 to EMT-35, as shown in Table 3), the pathological state of these not being revealed to the SAXS investigators (deemed blinded samples). For all cases, the samples were dissected out, obtaining nominal dimensions 5 mm (length) × 5 mm (breadth) × 1 mm (thickness), with two sub-samples taken of each selected case, fixed in formalin solution and kept in Eppendorf tubes (at 10% formaldehyde in water). Prior to SAXS measurements, excess formalin was withdrawn from the tissues and the tissues taped onto a metal frame (10 × 10 mm) on both sides using Kapton tape, a process performed at the synchrotron site.

Table 3 The breast cancer tissue samples classified on the basis of pathology.

Preparation of breast cancer tissue into FFPE blocks

This study utilized routine patient samples submitted for histopathological examination at the University of Malaya Medical Centre (UMMC), processed as follows. Fresh mastectomy or tumour excision specimens received by the histopathology laboratory following breast cancer surgery were sliced to some 1 cm thickness and immersed in 10% neutral buffered formalin for tissue preservation. The volume of formalin to tissue was about 10:1. Fixation was carried out over a minimum duration of 6 hours, larger specimens (mastectomies included) being commonly fixed overnight. Subsequent to fixation, the specimens were sampled for tumour, surgical margins and any other areas of interest. Each sample was placed into a labelled plastic cassette and kept in 10% neutral buffered formalin to await their processing into formalin-fixed paraffin-embedded (FFPE) blocks.

The completely automated sample processing, from formalin-fixation to paraffin wax (impregnation), was availed using a Leica TP 1020 tissue processor. The sequence was dehydration by alcohol followed by the use of xylene for clearing of alcohol and subsequently replacement of xylene by paraffin wax. In detail, the processing schedule was:

  1. i)

    Two changes of 10% formalin for 81 minutes each.

  2. ii)

    Two changes of 95% alcohol for 81 minutes each.

  3. iii)

    Three changes of 100% alcohol for 81 minutes each.

  4. iv)

    Two changes of xylene for 81 minutes each.

  5. v)

    Three changes of wax for 81 minutes each.

After impregnation with paraffin, the tissue was embedded in a desired orientation in the cassette, with a metal mould as the backing upon a hot plate. The cassette was then filled with liquid paraffin and placed on a cold plate to solidify the paraffin. The paraffin block was then popped out of the mould, in so-doing creating a formalin fixed paraffin embedded (FFPE) block in a plastic cassette, which was then ready for microtome sectioning.

Dewaxing of the FFPE blocks

The cases recruited for this study were histologically-confirmed breast carcinoma and two cases of benign tumour (fibroadenoma). The EMT status of the breast carcinomas have been determined by the collaborating pathologists based on y scoring of E-cadherin (epithelial) and vimentin (mesenchymal) expression by immunohistochemistry as described in section 2.1.4. For the breast carcinoma cases, selection of study material included additional samples of normal (healthy, non-lesional) tissues from cancer free regions of mastectomies. Exclusion was made of cases with scanty tumour material in the FFPE blocks, the concern arising that future pathology review could be compromised. Dewaxing of the FFPE material for SAXS was performed manually using the following steps:

  1. i)

    The chosen FFPE blocks were melted down at 65 to 70 °C for about 3 hours.

  2. ii)

    The sample was retrieved from the melt and dewaxed with xylene for 3–4 hours.

  3. iii)

    The dewaxed sample was subjected to 100% alcohol cleansing for 30 to 60 minutes.

  4. iv)

    The sample was then washed under running water until clean.

Dewaxed and rehydrated samples were then subsampled for SAXS. Two subsamples were taken of each selected case, comprising slices of 1 mm thickness. The sliced tissues were then kept in 10% neutral buffered formalin within Eppendorf tubes to await their SAXS examination.

Formalin-fixed non-paraffinized tissue

Formalin-fixed non-paraffinized tissue were re-sampled from residual tumour, from mastectomy or tumour excisions of four recently diagnosed cases (2 breast carcinoma and 2 fibroadenoma). These were material that were left over after histopathology diagnosis had been completed and identified for disposal. Duplicate samples from each selected case were of 1 mm thickness. The sliced tissues were then kept in 10% neutral buffered formalin within Eppendorf tubes to await SAXS examination.

Determination of EMT status by immunohistochemistry (IHC)

For each study cancer case, microtomed sections from the selected paraffin blocks were stained for immunohistochemical expression of e-cadherin (1:50; Dako: Clone NCH-38) and vimentin (1:500 Dako:Clone V9) using a Ventana BenchMark automated system. Cytoplasmic membrane expressions were semiquantitated for (i) percentage of malignant cells that expressed e-cadherin or Vimentin, as 0 (0–<1%), 1 (1–10%), 2 (11–50%), 3 (more than 50%) and (ii) intensity of staining: 0 (negative), 1(weak), 2 (moderate) and 3(strong). Separately, for both e-cadherin and vimentin, the percentage and intensity scores were then multiplied to obtain a final score. Final scores of 0–4 were classified as low expression and scores of 6–9 classified as high expression. EMT was considered to be present if the cancer showed a low e-cadherin final score, or a high vimentin final score.

Determination of tumour size, histological grade, estrogen receptor (ER), progesterone receptor (PR) and HER2 status

It is the routine practice in UMMC for the maximum dimensions (cm) of all excised breast cancers to be recorded in the histopathology reports. The H&E slides of cancers for this study were retrieved and re-evaluated for histological grade based on the modified Bloom and Richardson system which incorporates an evaluation of nuclear pleomorphism, tubule formation and mitotic activity22. The presence of calcifications was also noted.

All cancers for this study had been routinely stained for estrogen receptor (ER), progesterone receptor (PR) and HER2 status by IHC using the Ventana BenchMark automated system (see Table 4). The IHC slides were retrieved and expressions for ER, PR and HER2 evaluated according to guidelines of the American Society of Clinical Oncology/College of American Pathologists (ASCO/CAP)23,24. Cancers which were ER and PR positive irregardless of HER2 expression were categorized as hormone receptor (HR) positive. Cancers which were negative for ER and PR, and were HER2 positive (i.e. HER2 IHC score 3 or IHC score 2 but confirmed amplified by in-situ hybridization) were categorized as HER2 enriched. Cancers which were negative for ER, PR and not HER2 enriched were categorized as triple negative (TN).

Table 4 Pathological characteristics of the study samples.

Small angle X-ray scattering (SAXS) measurement

The experiment was conducted at the BL1.3 W: SAXS beamline at the Synchrotron Light Research Institute (SLRI) in Thailand. The experimental setup is as in Fig. 1. An undulator radiation source was used, monochromatized using a Double Multilayer Monochromator (W/B4C), focused with a toroidal mirror to provide an X-ray beam of wavelength 1.379Å (Energy = 9 keV) and a beam size of sample of 2 × 1 mm. The sample-to-detector distance was fixed at 1837.14 mm, a 2-D detector (Rayonix SX165 CCD) providing 2048 × 2048 pixels in recording the scattering pattern. The detector resides inside a vacuum chamber in order to reduce air scattering and absorption losses. The sample-detector-distance allowed recording of momentum transfers within the range 0.06 nm−1 ≤ q ≤ 2.50 nm−1. To avoid detector saturation a beam stop of 4.5 mm diameter was centrally located adjacent to the detector, radically reducing primary (direct) beam intensity. To calibrate and normalize the SAXS patterns, silver behenate was used as a standard, obtaining the reciprocal space scale of each image25. Measurement time was fixed at 5 min, obtaining sufficient scattered photon count while avoiding detector saturation. Once imaged, the tissue samples were returned to the receptacles for continued preservation.

Figure 1
figure 1

Set-up of the Beamline 1.3 W: SAXS at Synchrotron Light Research Institute (SLRI) in Thailand.

Data processing

Each SAXS image consists of a series of arcs, each arc with a radius commensurate with the momentum transfer q, as can be seen in Fig. 2. Through radial integration, as typically performed for SAXS analysis, supported by SAXSIT software, for each sample the one-dimensional distribution of scattered intensity as a function of momentum transfer was extracted from the two-dimensional SAXS pattern. Of paramount importance in obtaining the scattering profile of the sample itself is that corrections are made to account for sample attenuation and background; Fig. 3 shows an example corrected scattering profile normalized to the incident intensity. Background subtraction is obtained from Eq. 1 and fitting was performed on background subtracted scattering profiles.

$${I}_{subbg}={\tilde{I}}_{sam}-(1-c){\tilde{I}}_{bg}$$
(1)
$${\tilde{I}}_{sam}=\frac{{\bar{I}}_{sam}-{\bar{I}}_{dark}}{{T}_{sam}\cdot {i}_{sam,IC}},$$
(1.1)
$${\tilde{I}}_{bg}=\frac{{\bar{I}}_{bg}-{\bar{I}}_{dark}}{{T}_{bg}\cdot {i}_{bg,IC}}$$
(1.2)

where \({\bar{I}}_{sam}\) is the sample pattern divided by the sample exposure time, \(\,{t}_{sam}\), \({\bar{I}}_{bg}\) is the background pattern divided by exposure time, \({{t}}_{bg}\), \({\bar{I}}_{dark}\) is the dark current pattern divided by accumulation time, \({t}_{dark}\). The X-ray intensities which is measured by the ion chamber integrated over the period of the measurements of the sample and background are classified as \({i}_{sam,IC}\), and \({i}_{bg.IC}\), respectively. The sample used herein is solid and therefore the concentration, c was set to zero. Meanwhile, \({T}_{sam}\) is the sample transmission and \({T}_{bg}\) is the background transmission. It should be noted that \({\bar{I}}_{sam}\) is normalized by the exposure time \({t}_{sam}\), sample transmission \({T}_{sam}\) and X-ray intensity measured by ion chamber integrated over the exposure time \({i}_{sam,IC}\).

Figure 2
figure 2

Example of a two-dimensional SAXS diffraction pattern as obtained in the present work.

Figure 3
figure 3

Typical one-dimensional profile extracted from the SAXS signal of non-lesional breast tissue, as obtained in present work. A fat (adipose tissue) peak is found in the momentum transfer range 1.0–2.0 nm−1 (labeled ‘Range 3’). The 3rd and 5th order peaks are found in the momentum transfer range 0.2–0.6 nm−1 (labeled ‘Range 1’).

SAXS patterns accord with Bragg’s law (Eq. (2)), coherent scattering giving rise to interference and spread as a result of structural periodicity, providing an ability to classify tissues and to identify structures as small as 0.1 nm3,25, also being commonly used to characterize macromolecules in low resolution:

$$n\lambda =2d\,sin\theta $$
(2)

d representing the separation between the periodic scatterers, θ being the Bragg angle, equal to half the total scattering angle of the incident monochromatic x-rays, 𝜆 its wavelength and n the order of the scattering maxima. In this way, SAXS studies have provided convincing contrast between healthy and diseased tissues16,25,26. In particular, oriented collagen fibrils have distinct SAXS features in the meridional direction due to the axial periodicity26, the d-spacing being easily obtained from the axial Bragg peaks. Diffraction pattern differences arise from inhomogeneities in the distribution of electrons in the material. The radial distribution of the observed scattering pattern can also be used to infer particle density and surface area of the sample constituents. Figure 4 illustrates the SAXS geometry; the elastic scattering of the incident photons are conserved and interference of the scattered photons can be sensed by the detector. Scattering data can be presented by relating the observed intensity to the modulus of the scattering vector \(\mathop{q}\limits^{\rightharpoonup }\) (see Fig. 5) as described in Eq. (3):

$$\mathop{q}\limits^{\rightharpoonup }=\mathop{{k}_{f}}\limits^{\rightharpoonup }-\mathop{{k}_{i}}\limits^{\rightharpoonup }$$
(3)
$$\mathop{|{k}_{i}|}\limits^{\rightharpoonup }=\frac{2\pi }{{\rm{\lambda }}}=\mathop{|{k}_{i}|}\limits^{\rightharpoonup }$$
(3.1)

where the incoming photon, \(\mathop{{k}_{i}}\limits^{\rightharpoonup }\) and the scattered photon \(\mathop{{k}_{f}}\limits^{\rightharpoonup }\) define \(\mathop{q}\limits^{\rightharpoonup }\). From the geometry of Fig. 4, the scattering vector can be described as in Eq. 3.2.

$$q=\frac{4\pi sin\theta }{{\rm{\lambda }}}$$
(3.2)
Figure 4
figure 4

Small angle x-ray scattering geometry. Note: figure drawn in Microsoft powerpoint.

Figure 5
figure 5

Construction of the scattering vector, \(\mathop{{\boldsymbol{q}}}\limits^{{\boldsymbol{\rightharpoonup }}}\,\)with incoming photon, \({{\boldsymbol{k}}}_{{\boldsymbol{i}}}\) and the scattered photon, \({{\boldsymbol{k}}}_{{\boldsymbol{f}}}\). Note: figure drawn in Microsoft powerpoint.

Using Eqs. (2) and (3.2), the d-spacing and scattering vector q are related by Eq. (4).

$$d=\frac{2\pi n}{q}$$
(4)

In this work, the structural components at scattering vector, q = 0.2 nm−1 to 1.5 nm−1 were investigated as this region corresponds to the region that provide the d-spacing for the collagen. From Fig. 5, it can be deduced that the 3rd and 5th order axial peak were the most intense for non-lesional tissue, benign lesion and EMT negative malignant tissue whereas the 9th and 10th order are the most intense for EMT positive malignant tissue. In regard to the peaks, the software OriginPro201927 was used to fit Gaussian curves to each significant peak of the scattering profiles. From the Gaussian peaks, parameters such as the centre position were determined for each peak. The peak centre corresponds to the x-coordinate of the peak centre, namely the scattering vector, q. The value of the Gaussian center is related to the collagen axial d-spacing, calculated through Bragg’s Law using Eq. 4.

The key advantages of the SAXS method, other than that its generally non-destructive nature, are the availability of extensive variety of sample solution conditions, including the ability to maintain coverage of a wide range of molecular sizes. In particular, it allows one to analyze the structure of native particles in near physiological state, also to examine structural changes in response to an imbalance in external conditions28.

Statistical analysis

From the corrected sample scattering profiles obtained from SAXSIT, parameters associated with structural information have been generated, fittings resulting from use of the software OriginPro2019 (v9.6)27. Fitting of the peaks to Gaussians have allowed generation of the parameters: FWHM, amplitudes, centre position and area of the individual peaks. Axial periodicities (d-spacings), extracted from Gaussian peak centres, were obtained using Eq. (4). To determine those variables differentiating between groups, discriminant analysis was performed, R software (Version 3.6.1) being used to perform the Kruskal-Wallis test, with subsequent performance of Dunn’s post hoc test to identify the pairs of tissues which differed significantly for all measurements considered, p values substantiating parameter extractions of practical consequence in constructing a diagnostic model.

Results and Discussion

SAXS Analysis of 16 classified breast tissues

Figure 6 depicts the scattering profile of each of the tissue groups analyzed in this work, including non-lesional, benign and malignant, EMT negative and EMT positive. Based on the scattering profiles, a number of features were analyzed, as follows: (i) the 3rd, 5th, 9th and 10th order Bragg peaks located between 0.175–0.200 nm−1, corresponding to the axial organization of the collagen fibrils; (ii) the peak at 1.44 nm−1 for EMT positive and EMT negative lesions; (iii) 1.46 nm−1 for non-lesional tissue, corresponding to the scattering of triglycerides acknowledging this to be one of the most significant of the constituents of lipids. Fat peaks are not present in any of the benign tissue, stroma not containing triglycerides. Finally is a fourth category, a diffuse scattering component manifests a rapidly decreasing, resulting from disordered collagen fibril structures and other constituents of the extracellular matrix.

Figure 6
figure 6

Typical scattering profiles of four groups of breast tissue analyzed in present work.

Observed is that the scattering intensity is greatest in EMT positive malignant tissue, followed by EMT negative malignant tissue and subsequently benign tissue, moderate in intensity, the lowest intensity being for non-lesional tissue. This arises from the propagation of cancer and invasion of normal tissues, leading to changes in collagen fibril structural organization, then causing the endogenous collagen to degenerate and new collagen of diminished structural organization to emerge8. Notably, EMT positive lesions manifest at the 9th and 10th order while other tissues manifest in 3rd and 5th order peaks, resolved to be due to the loss of E-cadherin expression in tissues suffering invasion and metastasis29. E-cadherin functions to yield mechanically strong adhesive links between cells in the tissue and is crucial in maintaining epithelial morphology30,31. It is then replaced by mesenchymal specific N-cadherin and integrins specific for a more transitory adhesion, activating the EMT, invasion and metastasis. In addition, matrix metalloproteinases (MMPs) emerge to degrade the basement membrane, leading to the migration of cancer into the surrounding stroma.

Table 5 shows analysis of the axial spacing, peak area and FWHM (mean and range). Figure 7 demonstrates appreciably larger d-spacing for EMT positive malignant lesion compared to the other tissue types, a consequence of cancer propagation into the breast tissues and increase of N-cadherin expression, the tissue losing cell-cell adhesion as previously mentioned, causing the long-range intermolecular bonding rigidity of collagen to diminish. Further apparent are smaller values of axial peak areas in benign tissue and malignant lesions compared to normal tissue (Table 5 refers). The reduction of the structural order of collagen within lesions and consequent decrease in intensity can be associated with the activity of collagenases, a group of collagen degrading enzymes32. Present study has observed benign lesions to be associated with a larger FWHM in the third-order peak than that for negative EMT lesions, also for normal tissue and even positive EMT at the 9th and 10th order (see Table 5), a result similar to that found by previous researchers16. Larger FWHM is expected on the grounds that greater variability is to be found in the fibril packing structure of benign lesions.

Table 5 Summary peak parameters for 16 classified breast tissues.
Figure 7
figure 7

Individual axial d-spacing (nm) points for the 16 groups of classified tissues, including non-lesional, benign, EMT negative and EMT positive. Note: the data points are offset vertically for clarity.

SAXS Analysis of 51 blinded breast tissues

In regard to differences in axial d-spacing between non-lesional and malignant breast tissues, as seen in the summary results of the 16 classified breast tissues investigation (Fig. 7), this is associated with the disruption of collagen fibrils structure. Note that collagen is a long-range periodic structure, repeated along the fibril axis, containing molecules packed laterally to each other with hydrogen bonds, binding to each other longitudinally in a staggered arrangement. Tissue remodeling is an essential step during the malignant process, transformation of epithelial cells being associated with metalloproteinases in collagen degradation by synthesis of fibrillary and non-fibrillary matrix proteins33. Extensive alteration of the ECM has been observed, collagen derangement being attributed to enzymatic degradation and altered neosynthesis. It is evident that at the supramolecular level changes in collagen in tissue invaded by cancer can be described quantitatively from the SAXS patterns, paving the way for tissue classification and improved understanding of mechanisms of cancer growth.

In this section, using a similar experimental SAXS setup to that above, the study has been repeated for 51 blinded breast tissues (FFPE (dewaxed)), seeking to distinguish differences between the tissues groups, non-lesional and malignant (including EMT negative and EMT positive breast tissue). Biological tissue SAXS studies3,17,25,34 often place emphasis on odd-ordered Bragg peaks in obtaining the periodicity within the collagen present in the sample. It is apparent that the odd peaks are larger than the neighboring even peaks, revealing a qualitative feature of the structure. The nth order Bragg peak derives from the nth Fourier component of the electron density unit cell35, contributing to the odd cosine waves with feature that the amplitude is of the opposite sign at the centre of the unit interval than it is at the edges. This is a known feature of the structure of long-chain hydrocarbons exist in extracellular matrix (ECM).

Highly-ordered healthy collagen exhibit sharp and prominent peaks from the 3rd to the 10th order; conversely, axial ordering gradually disappears in invaded regions, most particularly for EMT positive tissues, as in Fig. 8, demonstrating greater degradation in diseased tissues than surrounding healthier tissue. Variations in axial periodicity result from a change in crimping angle36, also if the normal type I collagen molecule is replaced by the trimer in which the three chains are identical8 and high concentration of type III is ruled out36. It has been suggested that the contact of epithelial cells with collagen I contributes to increased cell motility accompanying EMT, critical in disease progression37. For invaded regions the scattering intensity has been observed to be significantly greater, the triglyceride peak also being broader, EMT positive breast cancer tissue in particular where collagen fibrils can suffer “peeling off”, corresponding to large increase in the specific surface area per unit volume of scatterers, agreeing with the findings of Fernandez et al.25.

Figure 8
figure 8

Wide scattering profile of blinded samples, including non-lesional, malignant; EMT negative and EMT positive breast tissues.

Fibrils orientation was determined from the maximum intensity of the collagen peaks; the calculated axial peak area and FWHM are detailed in Table 6. Small but systematic differences are seen in axial d-spacing, the period of collagen being slightly greater in fibrils surrounding invasive carcinoma cells than in non-lesional areas, interpreted as an increase in axial periodicity with malignancy. It has also been reported38 that breast tumours display aberrant collagen bundle organization in the tumour stroma. The FWHM and peak amplitudes show significant differences between diseased and non-lesional breast tissues. In the case of collagen, the FWHM is related to the distribution of axial d-spacings in the fibrillar collagen and the amount of variability in the distribution of particle sizes in the sample. The FWHM of the investigated samples show a general trend of decreasing FWHM with severity of disease, in particular EMT positive. A larger width would depict more variability in the distance between the collagen fibrils compared to a peak with a smaller width. Here, it is important to keep in mind that the peak area is the incorporation of the peak width and peak amplitude. The axial peak area describes the extent of the long-range order in the packing structure of the collagen, arising from the fact that x-ray scattering involves constructive interference, whereby the larger the area, the greater the degree of long-range order of the samples. In Table 6, it can be observed that the axial peak area decreases with malignancy. The reduction of the peak area is possibly caused by the action of collagen-degrading enzymes known as collagenases as found in other work32. In the case of a greater degree of ordering (i.e. non-lesional tissue) this will be portrayed as a combination of broader width and greater amplitude, thus of larger axial peak area, vice-versa in the less ordered malignant tissue. It should be noted, as stated in Table 1, that peak area depends on the abundance of material within sample, thickness and composition in particular. The dissected-out samples are of nominal dimensions 5 mm (length) × 5 mm (breadth) × 1 mm (thickness), assumed to be sufficiently equal across samples to allow use of the peak area as parameter for quantitative comparisons.

Table 6 SAXS peak parameters for the 51 blinded individual breast tissue cases, each in FFPE (dewaxed) condition. Pathology reporting was performed after conduct of the experiment.

While 5th, 9th and 10th axial order are present in both EMT negative and EMT positive, an observed effect of degradation in the poorly ordered collagen of EMT positive is reduction of FWHM and axial peak area (Table 6). Comparison can also be made with the earlier work on the classified 16 tissues, the d-spacing being within the same range. Special care needs to be taken to include all key parameters, indicators arising from change in collagen fibril packing (axial peak d-spacing) and the amount of collagen in the tissue (integrated intensities and area), correlating with morphological features and cell types of the tissues. For instance, while EMT-17, EMT-18 and EMT-31 produce typical scattering spectra for non-lesional tissue, the axial area of EMT-17 for all d-spacing orders are much less than non-lesional and EMT negative tissues; additionally, the FWHM of all d-spacing orders for EMT-18 are much less than non-lesional tissue while EMT-31 shows greater d-spacing than non-lesional tissue. Accordingly, EMT-17 is classified as EMT positive while EMT-18 and EMT-31are classified as EMT negative. As with other studies, present studies find collagen fibril d-spacing, axial peak amplitude and the area of the lateral peaks equatorial peak features to be distinguishing parameters.

Table 7 summarises the peak parameters of the several tissue types investigated. As expected the FWHM of triglyceride peak is greater in EMT positive tissues compared to non-lesional tissues. Of interest are triglycerides, fat being a major source of energy for growth and membrane synthesis in solid tumours39,40, the proportions of fat and fibrous tissue present in breast being associated with enhanced cell migration and aggressiveness, characteristic of metastatic cells, suggestive of activation of de novo lipogenesis in EMT induced cells41.

Table 7 Summary peak parameters for the 51 blinded breast tissues, each in FFPE (dewaxed) condition.

Lewis et al.5 acquired SAXS images by averaging over the whole tissue sample while SAXS images acquired by Fernandez et al.25. resulted from the selecting of several points on the tissue sample (1–0.25 mm intervals), each image considered a discrete observation relative to the sample pathology. The axial d-spacing of collagen in benign and non-lesional breast tissues reported by Fernandez et al.25. are 65.3 ± 0.2 nm and 65.0 ± 0.1 nm, respectively. On the other hand, the d-spacing of collagen found by Lewis et al.5 for invasive carcinoma and normal tissues is within the range 64.7 nm–65.2 nm, whereas a smaller d-spacing was obtained for benign tissues between 64.4 and 64.7 nm. The present study found the d-spacing in invasive EMT positive, at 64.5 ± 0.2 nm, to be larger than in histopathologically invasive EMT negative at 63.8 ± 0.1 nm and non-lesional d = 63.6 ± 0.1 nm tissues, as illustrated in Fig. 9 (the error bars in the graph represent the standard deviation). It should be noted that the formalin fixation samples of present investigation are found to change and reduce the absolute d-spacing of the collagen fibril in comparison to previous research, although it does not alter the relative differences between non-lesional and malignant tissues, being of great value since the transport and restoration of samples is greatly facilitated by formalin fixation. The larger standard deviation of the d-value in invaded regions (EMT positive) can be explained by the weak Bragg reflections and higher background.

Figure 9
figure 9

Mean axial d-spacing (nm) for the 51 blinded groups of tissues; non-lesional and malignant; EMT negative and EMT positive. The number of samples are shown in the brackets. Note: error bars represent standard deviation. Note: the data points are offset vertically for clarity.

Kruskal-Wallis test has been performed to examine if the four types of tissues; non-lesional, benign, EMT negative and EMT positive differed significantly by considering d-spacing, axial area, FWHM, FWHM triglyceride and axial area triglyceride (see Table 8). It has been found that differences exist between the four tissues for d-spacing, axial area and FWHM. Dunn’s post hoc test was subsequently performed to identify the pairs of tissues which differed significantly for all five measurements considered (see Table 9). The p-values for axial area triglyceride and FWHM triglyceride showed insignificant differences for all four tissues. The EMT positive – non-lesional pair showed significant differences for d-spacing, axial area and FWHM while EMT negative – EMT positive appeared to be significantly different for axial area.

Table 8 Kruskal-Wallis test for differences in non-lesional, benign, EMT positive and EMT negative tissues according to measurements.
Table 9 Dunn’s post hoc test to assess the significant differences in measurements between non-lesional, benign, EMT negative and EMT positive tissues.

In present work the healthy and unhealthy tissues from the same patient were considered as independent. The Cook’s distance was calculated and two outliers were detected, belonging to patients 10 and 30 – both with non-lesional tissues in the FFPE (dewaxed) condition. Data from patients 10 and 30 were omitted before further analysis was performed. General trends in the data from measurements on the tissues were studied via principal component analysis (PCA) (See Fig. 10) and the methods used, as described below:

  1. 1)

    Non-lesional - FFPE (dewaxed) (N = 9); Formalin fixed (N = 2)

  2. 2)

    Benign - Formalin fixed (N = 2)

  3. 3)

    EMT negative - FFPE (dewaxed) (N = 8); Formalin fixed (N = 2)

  4. 4)

    EMT positive - FFPE (dewaxed) (N = 9); Formalin fixed (N = 1)

Figure 10
figure 10

PCA plot of all studied samples; non-lesional, benign, EMT negative and EMT positive tissues.

Figure 10 indicates the presence of two clusters along the PC1 axis, one in the direction of negative PC1 (EMT positive) and another in the opposite direction (non-lesional). Two observations with non-lesional tissues belonging to patients 14 and 26 are seen in the EMT positive cluster. The PCA plot explains 60.2% of the total variation where 35.0% of the total variation is described by axis 1 while 25.2% is described by axis 2. An extreme observation belonging to patient 31 (EMT negative) whose D-spaceB measurement failed to show peak readings is seen in the PCA plot. Patient 24 (non-lesional) also appeared to be an outlier due to high readings of axial peak area and FWHM. These two observations were retained in the data as they did not affect the results of the Kruskal-Wallis test and the Dunn post hoc test.

Conclusion

It is shown that the SAXS technique is able to produce rapid high-quality interpretable diffraction (structural) data from breast excision specimens, employed herein to seek differentiation between normal, benign and malignant breast tissues, the latter consisting of EMT negative and EMT positive cancers. From the Kruskal-Wallis and Dunn post hoc tests, we have found that the difference between the d-spacing of the EMT positive lesion and non-lesional tissue is highly significant (p < 0.008). The respective d-spacings for the various tissues are: for EMT positive lesion, in the range 64.5–64.7 nm; for EMT negative lesions, FFPE (dewaxed) and formalin fixed, 63.8 nm and 64.3 nm, respectively; for benign lesions 64.0 nm; for non-lesional tissues FFPE (dewaxed) 63.7 nm and; for non-lesional tissue (formalin-fixed) in the range 63.4–63.6 nm. For EMT positive breast tissue, the disturbed collagen structure is most notable in the 9th and 10th order. Thus said, present study has found that axial d-spacing on its own does not offer the sufficiency needed in determining early breast disease. Instead, other features, such as, the amplitude of the third, fifth, nine and tenth-order axial Bragg peaks are needed, together with the magnitude of the integrated intensity and the full-width at half-maximum of the fat peak, offering significant differences between tissue types. As such, the presence of disease is thought to be best represented by a combination of factors, rather than any single specific trait. Of particular note in regard to formalin fixation of samples is that no alteration is observed to occur in the relative differences in collagen d-spacing between non-lesional and malignant tissues. This is a matter of great importance given that preserved-sample and also retrospective study of samples is greatly facilitated by formalin fixation. Present findings indicate that molecular structure characteristics of breast tissue, obtained at the SAXS (supramolecular) level, can be used as markers of disease progression, SAXS analysis offering potential as a viable diagnostic procedure in support of breast clinics.