Introduction

Bone derives its unique resistance to fracture, also known as toughness, from its hierarchical structure spanning multiple length scales. This hierarchical structure includes bone’s basic building blocks, primarily comprising collagen and mineral phases at the nanoscale, and osteonal structures (cylindrical features of concentric lamellae layers around a central Haversian canal) at the microscale1 (Fig. 1a). Features at every length scale in the complex, composite structure of bone contribute to its characteristic high toughness and light weight2,3.

Fig. 1: Overview of bone’s hierarchical structure and bone’s toughening mechanisms.
figure 1

a Bone’s hierarchical structure shows features ranging from the macroscale to the nanoscale. b Bone’s extrinsic toughening mechanisms are illustrated using a 3D SRμCT image acquired during a toughness test. Extrinsic toughness mechanisms occur in the wake of the crack (green) and are attributed to microscale features in the bone. c Bone’s intrinsic toughening mechanisms, which are attributed to nanoscale features, occur ahead of the crack and are primarily attributed to aspects of collagen structure and composition. Fibrillar sliding and uncoiling of the collagen structure are mechanisms that contribute to intrinsic toughness in bone. There are additional mechanisms that impact bone’s intrinsic toughness, such as collagen cross-linking and sacrificial bonds. Collagen cross-linking at the nanoscale is known to influence mechanical properties34. Sacrificial bonds form between collagen fibrils creating a glue-like layer that contributes to bone’s resistance to fracture32,36. Figure adapted from Zimmermann et al.115, with permission from The Proc. Natl. Acad. Sci. USA.

Of the many factors influencing toughness in bone, only changes in mineral content of the bone matrix are traditionally used to assess and predict fracture risk. While new, encouraging methods and aids for fracture risk assessment are being used in clinics such as trabecular bone score, bone material strength index, and computer-based algorithms (e.g., FRAX), the primary current clinical indicator of fracture risk is bone mineral density4,5,6. However, there is evidence that the mineral components mostly confer bone’s elastic properties and strength7,8 whereas collagen components predominantly contribute to bone’s post-yield properties and failure energy9,10,11. Degraded and denatured collagen is prevalent in various conditions12,13, including age11,14, cancer15, osteogenesis imperfecta16,17,18,19,20, osteoarthritis21, osteoporosis22,23,24, diabetes25,26, and mechanical damage27. In these conditions, degraded collagen molecules usually result in a local unfolding or unwinding of the collagen triple helix. This unfolding might be caused by the breaking of a small number of consecutive hydrogen bonds28,29,30. Collagen damage increases the risk of fracture in these populations and reduces bone’s resistance to crack formation and crack growth14. Therefore, the role of factors beyond bone mineral density, such as collagen damage, should be considered when studying fracture in bone.

Fracture resistance in bone originates from both intrinsic and extrinsic toughening mechanisms31,32. Intrinsic (material) mechanisms act at the nanoscale and are known to impede crack initiation and growth in the plastic zone located around the crack tip31 (Fig. 1c). This material plasticity around the crack tip is conferred by the collagen. Collagen’s influence on bone’s mechanical properties is influenced by many factors such as both enzymatic and non-enzymatic cross-linking33,34,35 and molecular level bonds that depend on calcium ion content36. In contrast, extrinsic mechanisms act at length scales of tens to hundreds of micrometers in the wake of the crack37 (Fig. 1b). Extrinsic toughening mechanisms use microstructural features to dissipate energy via crack deflection38, crack bridging, and microdamage accumulation39. The most dominant extrinsic toughening mechanism is crack deflection in the transverse direction37,40. Crack deflection is controlled by the direction of applied stresses driving crack growth and the anisotropy in bone’s microstructure (e.g., highly mineralized cement line at the border of osteons in bone)41,42,43. In summary, intrinsic toughness absorbs energy via material properties and plastic deformation of collagen while extrinsic toughness dissipates energy primarily via crack deflection around microstructural features.

Extrinsic toughening has been shown to dissipate more energy than intrinsic toughening40. Therefore, changes in bone structure, such as canal diameter and density, would be the predominant contributor to toughness. However, this raises the questions of (1) whether bone’s toughening mechanisms act independently or cooperatively; and (2) whether changes in material properties at the nanoscale, such as collagen damage, affect extrinsic toughening mechanisms at the microscale? Little is known about the contribution of collagen damage, present in many bone fragility diseases, to extrinsic toughness.

One advantageous modality for examining extrinsic toughening mechanisms in bone at the microscale is synchrotron radiation micro-computed tomography (SRμCT). This technique uses high-flux x-rays to reconstruct an image stack corresponding to a scanned sample’s three-dimensional (3D) structure and material composition44,45. Previous studies have shown that SRμCT is able to reveal the crack path of a fractured bone sample41,46,47. With short acquisition times and high-resolution imaging, dynamic SRμCT is well suited for in situ mechanical testing to determine how microstructure resists crack growth under realistic loading and failure conditions.

Deformation in bone during loading and failure conditions can be characterized in near real-time using digital volume correlation (DVC) with SRμCT20,48,49. Presently, DVC has been more widely explored in trabecular bone, or porous bone, due to the unique, varied structure of the trabeculae50,51,52,53. In cortical bone, DVC can instead be performed by tracking the speckle-like osteocyte lacunae between different SRμCT scans to estimate strain fields. Combining in situ mechanical testing in cortical bone with SRμCT and DVC to obtain datasets of crack propagation with near real-time strain states would further elucidate crack growth mechanisms at the microscale.

However, in situ SRμCT mechanical testing poses a fundamental challenge: overirradiation of bone during high-resolution SRμCT imaging. Radiation impairs bone’s mechanical properties when exposures reach a cumulative dosage of 35 kGy and above54,55,56. Acquiring lower-quality scans using a reduced radiation dose can aid in maintaining mechanical properties53. The preservation of mechanical properties with low-dose scans comes at the cost of a reduced signal-to-noise ratio56. In turn, features in these low-dose images cannot successfully be segmented, eliminating the possibility of performing DVC. Our solution to this issue is leveraging deep learning to process low-quality images prior to DVC.

In recent years, deep learning has shown promise in noise removal and enhancing the signal of images57,58,59. Specifically, convolutional neural networks (CNNs) have had success in denoising and segmenting images in the biomedical engineering field60. Performing these tasks with CNNs provides an unprecedented opportunity to enable SRμCT imaging with lower radiation and accelerate image analysis. To our knowledge, the use of CNNs in conjunction with low-dose SRμCT imaging to enable damage-mitigated DVC has not previously been performed.

This work aims to determine the role of collagen deformation capacity on the fracture resistance of bovine cortical bone. Our hypothesis is that alteration of the collagen network, often associated with aging and fragility diseases, results in decreased crack deflection and reduced bone toughness. To test this hypothesis, we used a heat-treatment model since the rate of local unfolding of collagen and cleavage at the unfolded sites rapidly is known to increase with temperature16 in a similar way to unfolding progression with pathological conditions14. We examined changes in whole-bone fracture resistance, microscale deformation, and crack path for different levels of collagen denaturation. To achieve this aim, we have developed the promising technique of in situ SRμCT testing, which combines low-dose synchrotron imaging of cortical bone with CNNs for near real-time insight into crack evolution during mechanical testing. We discovered that damaging the collagen through heat treatment decreased macroscale toughness and post-yield mechanical properties in the transverse direction of bovine bone. Decreases in macroscale toughness due to collagen damage were evident in reduced amounts of crack deflection. These deflections tended to be informed by the internal strain state estimated using DVC. This work concludes that collagen damage, induced via heat treatment, at the nanoscale profoundly affects both microscale crack deflection and macroscale mechanical properties.

Results

Collagen damage and fragility induced through heat treatment

A set of bovine bone samples (N = 8–11/group) were dedicated for use in a biochemical assay to quantify collagen denaturation via local unfolding and the associated decrease in flexural strength. Since the rate of local unfolding increases with temperature16, heat denaturation is a good model to induce a similar local triple helix unfolding to pathologic conditions. Samples were heat-treated for two hours at temperatures in the range known to denature mineralized collagen (100 °C to 190 °C). Heat treatment can either be reversible when the collagen molecule regains its original structure at normal temperature, or permanent with irreversible unfolding or damage of triple helical structure into an uncoiled structure28.

Samples heat treated above 130 °C sustained permanent collagen damage while samples treated at, or below, 130 °C were significantly less damaged. Denatured collagen chains, which are non-helical and unfolded, are susceptible to cleavage by proteases, including trypsins61,62,63, whereas intact triple helix collagen chains are not15. Therefore, we used a trypsin assay to digest and quantify the amount of damaged collagen (DC) combined with hydroxyproline assay to quantify the amount of undamaged collagen (UC) to calculate the percentage of DC in each bone sample64. Minimal collagen damage (≤3%) was present in samples at or below 130 °C, whereas considerably more damage was detected in samples treated at 160 °C or above (Fig. 2a). These results confirm that collagen damage onset occurs between 130 °C and 160 °C. As such, bones treated at 130 °C and below were considered in this study as UC groups and bones treated at 160 °C and above can be considered as DC groups.

Fig. 2: Collagen damage is evident in results from biochemical, fracture toughness, and mechanical tests.
figure 2

Based on the trypsin-hydroxyproline assay, two general categories of collagen damage were defined. The two groups were undamaged collagen (UC), including samples from RT, 100 °C, and 130 °C, and damaged collagen (DC), including the 160 °C and 190 °C samples. All bar-chart data are represented by mean ± standard deviation. a The trypsin-hydroxyproline assay (N = 8–11/group), which measures the percent of damaged collagen, shows an increase in collagen damage with increases in heat-treatment temperature. Very little collagen damage was present in RT, 100 °C, and 130 °C groups. The 160 °C and 190 °C groups possessed much more damage than the lower-temperature groups. A Tukey multiple comparison tests showed that all groups were significantly different from one another. b The crack-resistance curves (R-curves) from the in situ SRμCT toughness test (N = 2/group) shows a marked decrease in toughness from the UC groups to the DC groups. c Flexural strength tests (N = 6–8/group) were performed to obtain ultimate strain, work to fracture, and Young’s modulus. The ultimate strain was reduced from the UC groups to the DC groups. A Tukey test revealed that all temperatures were significantly different from each other except the RT and the 130 °C. d Work-to-fracture was greatly reduced from the UC groups to the DC groups. Tukey test showed that all groups were significantly different from each other except the RT and 100 °C, and the 130 °C and 160 °C. e Young’s modulus is largely unaffected by heat treatment; there was no significant difference between the UC and DC groups. A Tukey test revealed that the groups that were significantly different from each other include the RT and 100 °C, and the RT and 130 °C.

We assessed the effect of heat treatment on bovine bone’s mechanical properties with a flexural strength test using a set of bovine bone samples (N = 6–8/group) dedicated for mechanical testing divided into the same heat-treatment groups (Fig. 2c–e). For ultimate strain and work to fracture, the 160 °C and 190 °C groups were significantly lower compared to all other groups. We observed a significant reduction in ultimate strain and work to fracture in the DC groups (p < 0.001); however, no change was observed in Young’s Modulus (p = 0.889) between the UC and DC groups (Fig. 2c–e). The exact values, standard deviations, and statistical significance for each group are shown in Supplementary Table 1. These findings indicate that collagen damage mostly affects post-yield properties. Results from the flexural test further confirm that significant collagen damage occurs in temperatures between 130 °C and 160 °C.

High-quality SRμCT imaging achieved with a low-radiation dose using deep learning

The repeated imaging involved with fracture analysis necessitated acquiring images at lower doses of radiation to preserve bone’s mechanical integrity. Low-dose radiation imaging produced noisy images that would not be feasible to use in further analysis. The quality of these images was improved with deep learning using a mixed-scale dense CNN65. The CNN accepted low-quality images as input and used high-quality images as the target (Fig. 3a, b). The trained CNN was able to greatly reduce the difference between the low- and high-quality images, resulting in improved image quality (Fig. 3g).

Fig. 3: The CNN successfully denoised and segmented SRμCT images acquired with low radiation.
figure 3

ac Example of images related to the regression, or denoising, functionality of the CNN. Left to right these images include a the low-quality image used as input to the CNN, b the corresponding target image that is used during training, and c the denoised output from the CNN. Each image has its corresponding histogram included in the top right corner. df Example images related to the segmentation capabilities of the CNN. Left to right these images include d the denoised image used as input for segmentation, e the manual segmented image referenced during training, and f the segmentation performed by the CNN. g MSE was used to assess how well the CNN removed noise from the input image. In both the validation and the test datasets the CNN output was much more similar to the target than the input low-quality image. h Segmentation performance was evaluated with the Dice coefficient. This coefficient ranges from 0 to 1 where 1 is perfect segmentation. The network achieved high Dice coefficients for both the validation and the test datasets. All bar-chart data are represented by mean ± standard deviation.

Images denoised with the CNN increased in similarity to the high-quality images. Mean-squared error (MSE) was used to measure how different images were from the high-quality images, and peak signal-to-noise ratio (PSNR) was used to measure how similar images were to the high-quality images. The CNN denoising decreased the low-dose image’s MSE by 85% (Fig. 3) and increased the low-dose image’s PSNR by 62% on the holdout test dataset (Supplementary Figure 1). Reviewing the images qualitatively, we observed that images processed with the CNN had increased contrast between the bone and the background compared to the low-quality and the high-quality images. The corresponding image grayscale intensity histograms further illustrate this observation. The histograms for the low- and high-quality images both have a single peak, while the histograms for images processed with the CNN have two distinct peaks, one small peak for the background and one larger peak for the bone (Fig. 3a–c). Having the voxels distributed into these two peaks allows for more precise and consistent image segmentation. The denoised CNN output appears to have a higher signal-to-noise ratio than the high-dose target images used during training. This behavior could be caused by noise in the high-dose target images being uncorrelated to the noise in the low-dose input images, which is similar to the explanation for the denoising performance of self-supervised deep-learning methods58,66.

We made further use of CNN to expedite image analysis with semantic segmentation. Segmenting the crack path from the background and other features in the scan is a necessary process for analyzing the crack properties. This process was accelerated using CNN. The network accepted denoised images as input (Fig. 3d) and used a manual segmentation of the crack as a target image (Fig. 3e). The CNNs performance was evaluated by calculating the Dice similarity coefficient67. The Dice coefficient indicates how well the segmentation corresponds to the ground truth, where 0.0 is a completely incorrect segmentation and 1.0 is a perfect segmentation. The CNN performed the segmentation task well with the mean Dice coefficients for the network’s segmentation being close to 1.0 (0.96 validation, 0.93 test) (Fig. 3h). Qualitative inspection of the segmentation also indicated that the network was performing the task appropriately (Fig. 3d–f). An example of crack segmentation for each step in toughness testing is shown in Supplementary Movie 1 for a 160 °C sample.

Fracture toughness is impaired by collagen damage

Fracture toughness was evaluated using a set of bovine bone samples (N = 2/group) dedicated to in situ SRμCT toughness testing. Crack-resistance curves, or R-curves, are reported in terms of crack-driving stress-intensity factor, K, with respect to stable crack extension, Δa. R-curve data revealed that crack initiation toughness is markedly reduced in the DC groups compared to the UC groups, with DC groups having between two to three times lower crack initiation toughness than UC groups (1.39 MPa  m1/2 and 3.15 MPa  m1/2, respectively, see Supplementary Table 1). Crack-growth toughness (slope of the R-curve) of bone samples from DC groups was reduced to one-third the value of UC groups (Fig. 2b). The toughness of UC groups at Δa = 0.4 mm is almost triple its crack initiation toughness value (2.5–7 MPa  m1/2), whereas it is only double in DC groups (1–2 MPa  m1/2). The R-curve behavior and trypsin-hydroxyproline assay results exhibited here imply that heat treatment beyond 130 °C reduces macroscale fracture toughness in the transverse direction of bone through collagen damage.

Crack deflection is impaired in collagen-damaged bone via heat treatment

Through denoising and segmentation of low-radiation-dose images using the CNN, we examined microscale crack-growth mechanisms with the in situ SRμCT image data. Crack deflection reduction was observed via SRμCT images (Fig. 4a, b) in DC group with reduced R-curve toughness behavior. Crack morphology was also quantified with several parameters. Crack deflection density (Df.Dn), i.e., the number of large crack deflections per millimeter of crack extension, was significantly higher in the UC groups than in the DC groups (Fig. 4c). A concurrent significant increase in average deflection angle (Df.An) was also observed in UC groups compared to the DC groups (Fig. 4d). Through Df.Dn and Df.An, the extrinsic toughness mechanism of crack deflection is unmistakably impaired with heat treatment beyond 130 °C. The reduced presence of crack deflection in the DC groups is accompanied by a decrease in overall toughness, exemplified by the R-curve data (Fig. 2b), reinforcing the importance of crack deflection as a salient toughening mechanism in bone.

Fig. 4: The amount of crack deflection was reduced due to damaged collagen.
figure 4

a, b Segmentation of bone samples after toughness testing, with canals colored in yellow and the crack path colored in green for both 100 °C and 160 °C samples. The red arrow indicates a large crack deflection in a 100 °C sample. c Crack deflection density, number of crack deflections per length of crack extension (Df.Dn), was reduced in the DC groups compared to the UC groups. d The average angle of crack deflection (Df.An) shows that UC groups exhibit more salient crack deflection than DC groups. In graphs, c, d the bars for each temperature represent the mean and each data point is shown as a black dot (N = 1–2/group). The bars for the UC and DC groups show the mean and standard deviation (N = 3–5/group).

Internal strain state dictates the evolution of crack propagation

Observing the importance of crack deflection led to using our in situ toughness testing technique to determine how internal strain influences crack deflection during a loading scenario. We performed DVC by tracking the displacement of osteocyte lacunae on SRμCT images acquired at various steps during the three-point bend fracture toughness test, revealing bone’s internal strain state for varying levels of heat treatment. Although all groups had two characteristic lobes of high-strain concentration near the tip of the notch, the strain extended laterally into the bone sample, further away from the crack, in the UC samples (Fig. 5b) compared to the DC samples (Fig. 5h).

Fig. 5: The internal strain state, revealed by DVC, informs the trajectory of crack propagation.
figure 5

This is visualized by overlaying the strain (ϵxx) on images of the notch and crack. Left to right, all images show an image of crack initiation, the image of crack initiation with the corresponding strain state overlaid, and the strain state at crack initiation overlaid on the image of the next step of crack propagation. In the 3D images the notch is colored in gray and the crack is colored in green. ac Examples of 2D DVC results for a room temperature sample. df Corresponding examples of 3D DVC results for a room temperature sample. gi Examples of 2D DVC results for a 160 °C sample. jl Corresponding examples of 3D DVC results for a 160 °C sample.

Visualizing the internal strain of a bone sample during a fracture toughness test after crack initiation is beneficial alone, and overlaying this strained state into context with the next step in crack propagation provides additional insight into fracture mechanisms. We visualized the relationship between strain state and fracture path by overlaying the strain state calculated at crack initiation with the following step of crack propagation (Fig. 5c, f, i, l). We observed that the crack propagated through regions of high strain in each image slice, and when rendered in 3D. Because in the UC samples the internal strain was high in regions further from the initial crack, more deflections towards these regions occurred, resulting in a tortuous crack path (Fig. 5c, f). In contrast, the crack path in the DC samples tended to propagate directly in the direction of mode I fracture, or fracture in the opening direction of the crack, following its corresponding prior strain state.

Discussion

Bone fragility is a complex pathology caused by a variety of conditions. Traditionally, fractures were attributed to a reduction in bone mineral density; however, this metric describes less than half of bone fractures68,69. This discrepancy illustrates one of the reasons an increased emphasis has been placed on understanding bone quality in both research and clinical settings70,71. The goal of this study is to evaluate the unique contribution of collagen damage to bone quality, specifically on bone’s properties that resist fracture. For this purpose, we used heat-treatment to model bone fragility in bovine cortical bone based on different levels of collagen molecular damage associated with molecular unfolding at different temperatures (Room Temperature, 100 °C, 130 °C, 160 °C, and 190 °C)72,73,74. This model was chosen to identify collagen damage’s independent contribution to bone’s resistance to fracture. To capture the evolution of microcracks in bone during mechanical loading, we developed an in situ SRμCT toughness testing technique for bone and other biological tissues. This research is, to our knowledge, the first dynamic experimental analysis quantifying toughening mechanisms, specifically crack deflection, and deformation evolution within bone samples with different levels of fragility during mechanical testing. We leveraged a CNN for denoising and segmentation to work in tandem with low-radiation dose imaging to mitigate the destructive effects of overirradiation due to high-flux synchrotron x-rays. This technique revealed that unfolding and damage of collagen at the nanoscale affect the predominant toughness mechanisms at the microscale (i.e., crack deflection), reducing the macroscale toughness and bone quality. Understanding the contribution of collagen damage provides insight into the origins of bone fragility and collagen quality.

In this work, we found using a trypsin-hydroxyproline assay that a large increase in the amount of (mineralized) collagen damage in bone was observed in the 160 °C and 190 °C groups (Fig. 2). These findings are consistent with the literature, as denaturation of mineralized collagen in cortical bone has been shown at temperatures between 150 °C and 160 °C72,74,74,75,76,77 whereas non-mineralized collagen in soft tissues denatures through unfolding around 60 °C. Wang and colleagues report the start of collagen damage occurring at temperatures of 120 °C also using trypsin-hydroxyproline assay; however, the largest increase in DC was found at 160 °C in their work78. Heat treatments below 200 °C do not alter the composition and the crystal structure of the mineral phase78,79,80. X-ray diffraction has shown that the inorganic phase of bone is not significantly altered at temperatures ranging from 100 °C to 200 °C72,81,82. Therefore, the heat treatment groups only induce collagen network damage without altering the mineral phase which confers bone’s elastic properties. This was confirmed by the fact that Young’s modulus did not change significantly between room temperature and 190 °C in the present work and in the work by Wang and colleagues78. Although the inorganic phase of bone is likely unaffected by heat treatment, heat treatment also dehydrates bone tissue. Dehydration is known to reduce bone’s mechanical properties when not rehydrated83,84. Granke et al.84 find that water eliminated at temperatures below 200 °C is the loosely- to tightly bound water in collagen. Additionally, rehydration of bovine bone in a saline solution that has been dehydrated in alcohol seems to restore its mechanical properties85. Therefore, rehydration of bone samples at least 12 hours before testing preserves the majority of bone’s mechanical properties.

Our results indicate that collagen damage within bone decreases bone’s material properties at the macroscale. This decrease in mechanical properties was shown by significant reductions in fracture toughness, ultimate strain, and work to fracture in groups that had substantial collagen damage. For example, in situ SRμCT toughness testing on notched samples with different levels of collagen damage showed a 50–70% decrease in crack initiation and crack growth in bone’s with >3% DC. For bone in the UC groups (<3% DC), our R-curve toughness values are consistent with the ones found in the literature for toughness testing in bovine bone. Crack initiation toughness was observed at 3.2 MPa  m1/2 in a study by Sakakura and Pezzotti, very similar to values in the RT group; however, the crack growth toughness (R-curve slope) in our study was notably higher than the slope in the Sakakura and Pezzotti study86. Similarly, work by Lucksanasombool et al. reports crack initiation toughness at 3.48 MPa  m1/285. Other studies report crack initiation toughness in the range of 2.4–5.2 MPa  m1/2 in the transverse direction, falling in the range of values observed in the present study87,88. Crack initiation and crack growth toughness observed in our study are comparable to past studies using bovine bone in the transverse direction.

We observed that collagen damage induced by heat treatment not only reduced the bone’s resistance to crack initiation but also the salience of crack deflection during crack growth. Reduction in bone’s resistance to crack initiation due to collagen damage is an expected intrinsic toughening effect since heat treatment impairs bone’s postyield, or plastic, properties. Osteonal microcracking also influences fracture in bone prior to crack initiation89; however, once the crack has been initiated, bone’s resistance to crack growth is expected to be largely unaffected by changes to plastic properties because bone’s predominant toughening mechanism during crack growth in the transverse direction is crack deflection32,37,39. In the transverse direction, cement lines around osteons act as surfaces of least resistance for crack growth, causing a propagating crack to suddenly change direction, dissipating energy during crack deflection. We observed that collagen damage induced by heat treatment resulted in a trend towards less frequent and less severe large crack deflections. Increased collagen damage was associated with three times fewer crack deflections and a 15° reduction in crack deflection angle. Past studies have attributed crack deflection solely to changes in micro- and macrostructure, such as increased osteonal density and orientation90. While these factors are integral to crack deflection and other extrinsic toughness mechanisms in bone, they don’t address material properties below the microscale.

Fantner et al.82 showed that heat-treatment-induced collagen damage drastically reduces bone’s ability to maintain fibrillar connections between fracture surfaces, or collagen fibril bridging, in trabecular bone. This past work helps to corroborate that heat treatment-induced collagen damage reduces bone’s capability to impair fracture at the microscale. The present work builds on previous work primarily by constructing an R-curve using in situ toughness tests, and by investigating the toughening mechanism of crack deflection, as opposed to collagen fibril bridging. The measures of Df.Dn and Df.An (Fig. 4) indicates a drop in the saliency of crack deflection with heat treatment above 130 °C. Fantner et. al also show a marked change crack growth behavior in heat-treated bone, but from the perspective of collagen fibril bridging; heat-treatment at 200 °C revealed no collagen fibril bridging compared to an untreated sample with many collagen fibrils spanning the crack. The reduced influence of collagen fibril bridging at 200 °C in trabecular bone found by Fantner et al. and the reduced salience of crack deflection in cortical bone samples heated at 190 °C in this work show that different toughening mechanisms are impaired in collagen-damaged trabecular and cortical bones. Willet et al.91 also observed that collagen plays a role in both intrinsic and extrinsic toughening mechanisms. Isolating collagen damage identified a unique effect of collagen intrinsic mechanisms on extrinsic toughening mechanisms. This effect indicates that intrinsic and extrinsic toughness may act cooperatively and that changes in collagen plasticity affect both intrinsic and extrinsic toughening mechanisms significantly.

By visualizing the strain distribution using DVC, we observed that the prior strain state informed the trajectory of crack propagation during the next step of loading. In the heat-treated samples, the strain state appeared more confined to the region around the crack tip compared to the groups without much collagen damage. This difference could be related to the reduction in crack deflection observed in the groups with collagen damage. The reduced crack deflection, or a smooth fracture profile, in the DC groups, is one of the indicators used by Zioupos et. al to categorize bone samples as brittle92. Recently, the application of DVC to reveal strains in trabecular bone has received much attention51,52,53. Conversely, little work has been published on applications of DVC in cortical bone93,94,95. In this work, we further develop the application DVC in bone by imaging at a resolution that allows for analysis of tissue level strains within cortical bone while still addressing the issue of overirradiation.

Our study has several limitations starting with low sample size. Because access to a synchrotron beamline is provided based on merit in 24-72 hour shifts, performing an experiment with five groups and acceptable sample size for statistical analysis is challenging; our sample size was achieved in a 48-hour beamtime shift. Future studies using fewer groups could achieve higher statistical power with this in situ toughness testing technique. To achieve a reasonable sample size for comparison, groups in this study were combined into UC and DC groups based on the trypsin-hydroxyproline assay results. Additionally, while heat treatment was deemed a simple and effective method of targeting the collagen structure to pilot the in situ toughness testing technique used here, the manner in which heat treatment denatures collagen is not a direct analog for the complex pathways that disease disrupts collagen. The heat treatment (at temperatures at and above 160 °C) not only unfold the collagen molecules but might also affect the bonding and cross-linking in the collagen environment and weaken the mineral-collagen interface72,78. By showing the capability of this in situ SRμCT testings here, future work can be focused on using this technique on bone tissue damaged by disease. It is also worth noting that cortical bone from adult bovine is a mix of Haversian and plexiform whereas cortical bone from adult humans is mostly Haversian96,97,98. We worked with the posterior femoral mid-diaphysis which has a Haversian-dominated microstructure with well-formed osteons. However, there is the possibility that regions of plexiform combined with Haversian bone could be present, as these intermixed regions have been observed in the work of others96,97. Finally, although DVC provided useful visualizations of the strain distribution within bone, it is possible that some of the strain values are exaggerated due to microcracks occurring below our image voxel size of 1.6 μm89. The accuracy of the strains calculated with DVC also remains to be validated. Despite these limitations, DVC provides unique insight into displacements and strains resulting from cortical bone mechanics, and shows promise in informing crack deflection and brittle fracture in bone.

The work performed here in cortical bone demonstrates how damage to collagen at the nanoscale adversely affects crack deflection at the microscale and reduces overall toughness in the transverse direction at the macroscale. These findings were only achieved through the development of an in situ SRμCT mechanical testing technique that combines low-dose imaging with deep learning denoising and segmentation. The valuable insight provided by this new technique can be applied to study pathologies known to impact bone toughness. This technique is especially useful in studying pathologies that impact bone quality while not affecting bone mass, such as diabetes. Our findings relating collagen damage via heat treatment directly to crack deflection show how bone’s most fundamental constituent at the nanoscale affects toughness at each subsequent length scale in the hierarchical structure of bone.

Methods

Sample preparation

All samples used in this study were machined from the mid-diaphysis of 18-month-old female bovine femurs. Whole bone was cut into four quadrant sections of ~40 mm long, under constant DI water irrigation. The posterior quadrant, presenting the most osteons, was kept for further cutting process. Cortical bone samples were cut with a low-speed diamond saw along the longitudinal axis of the femur. The posterior quadrant was fixed following the bone’s long axis, then bone samples were cut with precise parallel cuts. Samples were ground and polished under constant DI water irrigation into dimensions of 1 mm × 2 mm × 10 mm. Samples used in toughness testing were notched to a depth of 1 mm and a root radius 5 μm using a custom-made razor micro-notcher under irrigation with a 0.05 μm diamond suspension.

To induce collagen network denaturation without damaging the mineral phase of bone, processed samples were divided into heat treatment groups99. Collagen is a triple helix, which, when subjected to heat, denatures through unfolding. Non-mineralized collagen in soft tissues denatures through unfolding ~60 °C, and mineralized collagen in bone denatures between 100 °C and 190 °C but mostly around 150 °C72,74,76,77. Heat treatments below 200 °C do not alter the composition and the crystal structure of the mineral phase78,79,80. Heat treatment was applied to each sample using an oven for 2 hours in five groups (Room Temperature, 100 °C, 130 °C, 160 °C, 190 °C) to induce different levels of collagen denaturation. Following preparation, samples were submerged in Hank’s Balanced Salt Solution (HBSS) for at least 12 hours prior to mechanical testing. The rehydration step reverses bound water loss during heat treatment85,100, and serves as an important step for preserving mechanical properties.

Trypsin-hydroxyproline assay

The trypsin-hydroxyproline assay was used to determine the amount of collagen damage that resulted from heat treatment. Bone samples (N = 8–11/group) were demineralized in 10% ethylenediaminetetraacetic acid at pH 8 for 1 week as recommended by Kiviranta et al.101. The samples were digested by 500 μL trypsin (T0303 Sigma-Aldrich) for 24 hours. The supernatant containing the denatured collagen was transferred to new vials, and the remaining collagen was left in their original vials and filled with 500 μL of buffer solution. The remaining peptides in the solution were acid-hydrolyzed by 12N HCl and heated at 110 °C for 18 hours. Then, as reported by Brown, chloramine-T solution and p-DAB reagent were added to react with the hydroxyproline residues to provide a colorimetric reading64. Finally, hydroxyproline content was obtained using a microplate reader. Absorbance was read at 561 nm and converted to micrograms using a standard hydroxyproline kit (MAK008 Sigma-Aldrich). The amount of denatured collagen was measured from the supernatant vials, and the original vials contain the remaining collagen. The percent of denatured collagen is calculated by comparing the amount of denatured collagen to the total collagen content. The amount of denatured collagen was measured from the supernatant vials, and the amount of intact collagen was measured from the original vials. Then, the percent of denatured collagen was calculated by dividing the amount of denatured collagen over the total collagen content.

Flexural strength tests

To assess macroscale mechanical properties, bovine bone samples (N = 6–8/group) were tested for failure under a 3-point bending flexural strength test in accordance with ASTM D790102. Although standards for assessing the material properties of structural materials exist, there are few comprehensive guides for biological materials; as such, ASTM standards for appropriate materials are often adopted in bone research103. Flexural strength tests were performed with a span of 25.4 mm, a displacement rate of 0.05 mm/min, and data were recorded at 20 Hz with a lowpass filter cutoff of 100 Hz in a similar way to Acevedo et al.46. All bone samples were tested in the transverse orientation, perpendicular to the direction of osteons. To ensure sample hydration, drops of phosphate-buffered saline were poured on a gauze wrapped around each sample. Each test was performed under displacement control using a Psylotech testing device equipped with a 222 N load cell. A preload of 0.5 N was applied prior to testing. During each test, force and displacement data were recorded. All original force-displacement curves are shown in Supplementary Figure 2. Using these data and sample geometry, a strain-stress curve was plotted in MATLAB104 using equations 3 and 5 from the ASTM D790. From these curves, ultimate strain, work to fracture, and Young’s modulus was calculated.

In situ SRμCT imaging and toughness testing

Imaging and toughness testing was performed on bovine bone samples (N = 2/group) in a custom three-point bending chamber with a span of 6-mm at the Advanced Light Source beamline 8.3.2. - microtomography at Lawrence Berkeley National Laboratory to track crack propagation in near real-time. Prior to any toughness testing, a reference high- and low-quality scan was taken of each sample with 3937 and 657 projections respectively for training the CNN. An incident x-ray energy of 24 keV was selected with an exposure time of 100 ms. Image voxel size was 1.6 μm/voxel. A 1-mm Aluminum shield was used to limit the radiation dose in the bone sample. The radiation dose was calculated following a similar procedure to Barth et al.54 and was found to be 10 kGy for the reference high-quality scan and 1–2 kGy for each low-quality scan, keeping the total radiation dose underneath the detrimental limit of 35 kGy for the majority of crack propagation55. This indicates that 1 high-dose scan and 11 low-dose scans can be taken before the maximum recommended radiation dose of 35 kGy is exceeded54,55. Radiation dose calculations were performed in accordance with estimations performed by Barth et al.54. Their used ultraviolet resonance Raman spectroscopy to show that changes in plastic properties can be associated with collagen degradation. This is also supported by other studies on gamma radiation105,106,107.

The testing chamber allowed for sample immersion in HBSS bath during scanning and mechanical testing to ensure the bone tissue remained in physiological conditions. The three-point bending fracture toughness test was performed in accordance with ASTM E1820108 with constant displacement while simultaneously monitoring the force-displacement output of the load cell in the testing chamber. Bones were mechanically loaded in the transverse direction, normal to the direction of osteons. During the fracture toughness test, the force-displacement curve was observed to determine when crack growth occurred, shown in Supplementary Figure 3. Crack growth was expected when there was a drop in the force-displacement curve. Following a drop in the curve, the displacement was paused and a low-quality scan of the sample was acquired. Following the acquisition of the low-quality scan, the displacement resumed, and the test continued. This procedure of pausing the displacement and acquiring a low-quality scan was repeated for each suspected increment of crack growth until material failure. Each sample took approximately 2 hours to test, with time in between each sample to swap samples and prepare the next test. During each in situ toughness test, the time between steps of suspected crack propagation was ~7–9 minutes. Reconstruction of SRμCT data was performed with the Python open-source package Tomopy109 and visualization was performed in Dragonfly 2020.2 (Object Research Systems (ORS) Inc, Montreal, Canada) and ImageJ110.

R-curve toughness calculation

Data from the in situ SRμCT imaging and toughness test were used to calculate bones resistance to fracture at the various levels of heat treatment (N = 2/group). The procedure and equations used for these calculations can be found in ASTM E1820108. The change in crack length, Δa, was measured using SRμCT images for each crack growth step. Nonlinear fracture mechanics analysis was used to determine the J-based crack-resistance curves (J as a function of Δa). The applied load and Δa values were used to measure the nonlinear strain-energy release rate with elastic and plastic contributions (J = Jel + Jpl) and to calculate the stress-intensity factor (K) for each growth step108. From the K, the J elastic was calculated. We assumed a mode I (tensile opening) stress intensity, KI since mode II (shear) fracture toughness was shown to be <25% of mode I fracture toughness111. The load-displacement data were used to calculate the area under the force-displacement curve (Apl) at each step. Following the ASTM E1820, J plastic was calculated using Apl, the sample thickness, and the width of the remaining ligament. J total was calculated by adding the J elastic and the J plastic. The J total was converted back into the J-integral stress intensity (KJ) for nominally mode I fracture. KJ values were plotted against the Δa values for each group to generate R-curves. The Young’s modulus used in these calculations were obtained from preliminary flexural tests on similar samples (as described above). To ensure the validity of the method in plane strain conditions, the sample thickness B (1 mm in this study) was chosen to meet the condition \(B \; > \; 2.5{(K/{\sigma }_{Y})}^{2}\) which is ~0.25 mm (for an average initiation toughness K of 1 MPa  m1/2 and yield stress of σY of 100 MPa. In addition, the crack extension is only valid when it is greater than the plastic zone size, \({r}_{y}=1/6{(K/{\sigma }_{Y})}^{2}\) in plane strain conditions. In this case, the plastic zone size is ry = 0.01 mm. Calculations were performed in MATLAB104 and Microsoft Excel.

Mixed-scale dense CNN

Image denoising and segmentation were performed using a CNN. Denoising improved the quality of images acquired at low radiation, making them usable in further analysis. Segmentation was used to expedite portions of the analysis. The specific network architecture used was the mixed-scale dense CNN developed by Pelt et al.65. This network architecture is unique in its ability to capture multi-scale features through mixed-scale dilated convolutions. Here, we present, for the first time, the application of such networks to denoise and segment SRμCT images of a real-world in situ experiment, using a dedicated experimental setup for acquiring the required training data. The network was defined with a depth of 100 layers for both denoising and segmentation tasks. The training was performed on a workstation with an Nvidia GeForce GTX 1080Ti GPU on CUDA 10.1.

To train the network for the denoising task, the paired high- and low-quality SRμCT reference images were used as training data. These scans were taken before the in situ experiments were performed on each sample. In preparation for training, the images were cropped to a dimension of 1024 × 1024 pixels using ImageJ110. This cropping reduced the image size and narrowed the region of interest to make training more efficient. Approximately 650 image slices were used from each viable reference high- and low-quality scan. Of the ten samples tested, three samples were not used during training due to imaging artifacts present in high-quality data or the inability to reconstruct image data. Therefore, a total of seven image volumes were used during training, validation, and testing. A training, validation, and test split of ~70-15-15 was performed on the cropped images, yielding 3240 training images, 680 validation images, and a holdout test scan of 662 test images. The training images were converted to 3D slabs containing five image slices. These slabs included two consecutive slices before and two slices after the image was denoised to provide the network additional contextual information. The data were input to the network in batch sizes of one and the order of the images was shuffled after each epoch. The training data were augmented using both horizontal and vertical flipping with a probability of 0.5 to mitigate overfitting. Training occurred over 44 epochs using MSE as the loss function and Adam as the optimizer.

Semantic segmentation capabilities of the MS-D net were used in crack segmentation. A training dataset was obtained through manual segmentation of the crack using Dragonfly 2020.2 (Object Research Systems (ORS) Inc, Montreal, Canada). The final time point of crack growth from each of the seven viable samples was manually segmented for ground truth labels used during training. This segmentation was completed using a combination of thresholding and manual selection. A threshold was used to select the background and exclude the bone. Then, using the snap and paint tools in Dragonfly, the regions of the background we identified as cracks were selected. An intermediate network was trained and used to provide a better starting point for the manual segmentation; however, the predictions of the intermediate network were also manually corrected. All segmentations were performed and reviewed by the same two researchers. The segmentations were overlaid on the original image to ensure good segmentation. Approximately 625 image slices were manually segmented from the viable samples. Denoised images of dimension 1024 × 1024 pixels were used as input to the network. The training, validation, test split of ~70-15-15 was maintained with 3124 training images, 625 validation images, and a holdout test scan of 632 images. Again, input images were arranged into slabs consisting of five images that were input to the network in batch sizes of one. The images were shuffled during training following each epoch. The training data were augmented through horizontal and vertical flipping with a probability of 0.5. To learn the segmentation task, cross-entropy was used as the loss function. Network performance was further evaluated by calculating the Dice similarity coefficient for the segmentation67.

Crack deflection quantification

Since crack deflection is the dominant extrinsic toughening mechanism, the amount of crack deflection in each sample was quantified (N = 1–2/group). Crack deflection was measured for each sample at the final time point when the sample had fractured completely. Quantification of crack deflection was a multi-step process calculated using a custom python code (see code availability section). The process begins with the binary image of the crack segmented by the MS-D network. Each segmented crack surface was converted to a skeletonized representation using the median position value between the top and bottom surfaces of the segmented crack, i.e., the median position through the thickness of the crack. The gradient of the skeletonized crack surface in the direction of crack propagation (mode I) was calculated. The gradient of the crack was calculated to determine the points on the crack where a change in direction occurred in the direction of crack growth. This gradient was then smoothed using a rolling average. Following smoothing, large crack deflections were filtered using minimum magnitude and angle thresholds of 50 microns and 30 degrees, respectively. Two samples with insufficient crack propagation or excessive crack bifurcation were excluded from the calculations (100 °C, 160 °C). To eliminate small crack deflections, the OPTICS clustering algorithm in sci-kit learn was used to group larger sections of crack deflections112. The detailed workflow used for this study is shown in Supplementary Figure 4 and in the crack deflection GitHub repo. After large crack deflection events are found using the magnitude and angle thresholds, the average number of deflections normalized by crack length (Df.Dn) and average crack deflection angle (Df.An) are calculated for each sample. Because the sample size is not sufficient for a comparison between each heat-treatment temperature, no statistics were performed on the individual groups. Instead, comparisons were performed on the UC and DC groupings of temperatures using an independent t test.

Digital volume correlation

The internal strain state of samples was visualized and evaluated using DVC (N = 1–2/group). DVC was performed using the Python-based package TomoWarp2113 to compare the unloaded reference volume to the deformed volume at different steps of crack growth. Scans that had been denoised using the CNN were cropped to 400 × 300 × 300 voxels to focus on the region of bone ahead of the crack tip, and exclude regions containing the crack. The cropped scans were segmented into binary images, separating the features of bone (lacunae, canals, and notch) from the bone matrix. These features were tracked during DVC. For each sample, a correlation window of 25 voxels in all directions and a node spacing of 10 voxels in all directions were chosen. The search window was chosen to include all possible deformations. Data filtering was carried out using functionality in Tomowarp2113. Both the outlier and median filters were applied to the displacement data. The strain was calculated using these processed data according to the Green strain functions included in Tomowarp2113. The strains were upsampled with first-order, linear interpolation to match the original image dimensions using SciPy114. Upsampled strains were written to 32-bit image files for visualization in Dragonfly 2020.2 (Object Research Systems (ORS) Inc, Montreal, Canada)

Statistics

The data were compiled and statistical analyses were performed using R. Appropriate statistical methods were used to determine significance. When comparing two groups an unpaired two-tailed Student’s t test was used. For multiple comparisons, an analysis of variance with a Tukey post hoc test was used. In both cases, statistical difference was determined with an α = 0.05. In all bar charts, the data are presented as mean ± standard deviation.