Introduction

As metallic implants are susceptible to corrosion after implantation, corrosion resistance testing is an important preclinical assessment which helps ensure long-term implant durability in the in vivo environment1,2,3,4,5. While there are multiple types of corrosion that can occur in medical devices such as galvanic, crevice, fretting, pitting, and metal ion release, this study will focus on pitting corrosion as determined by the standard test method, ASTM F2129 “Standard Test Method for Conducting Cyclic Potentiodynamic Polarization Measurements to Determine the Corrosion Susceptibility of Small Implant Devices”6 that is well defined and is an FDA recognized consensus standard. Pitting corrosion results from a failure of the material’s protective passive film, which leads to a localized breakdown at isolated sites7. The breakdown potential, Eb, is commonly used to assess corrosion resistance with higher Eb values suggesting increased resistance to corrosion8. Corrosion can have serious consequences to patients with implanted devices as metal ion release during corrosion can cause adverse health effects and corrosion byproducts can be transported throughout the body through blood flow. Nickel has been shown to cause metal ion toxicity, formation of pseudotumors and kidney failure at high enough doses9 and there is no known safe threshold to prevent a hypersensitivity response. While corrosion can affect the biocompatibility assessment10,11, biocompatibility is multifactorial with extensive and well established test standards to support it12,13. Specific biocompatibility concerns are not addressed in this manuscript. Corrosion on an implant could also lead to structural integrity problems as corrosion pits have been observed to initiate fatigue cracks in vitro14.

Nitinol is an example of a passive metal alloy that has been widely used in the biomedical field due to its unique properties including shape memory and psuedoelasticity1. While nitinol has been most commonly used in cardiovascular devices like stents, endovascular grafts, inferior vena cava filters, heart valves, etc., it is now being utilized in other areas, including respiratory, neurological, spinal, and orthopedic. As experience with nitinol has grown over the years and as nitinol devices are being used in different locations within the body, the importance of surface finish to its corrosion resistance has become recognized throughout the medical device community15,16. Highlighting the importance of surface finish to corrosion susceptibility, Sullivan et al.17 evaluated nitinol stents made with the same base material, but with five different surface finishes. The authors found that the breakdown potential varied from an average of −117 mV for an oxidized tubing finish to no breakdown observed up to 1000 mV on stents with an electropolished (EP) finish.

ASTM F2129 details a number of experimental parameters (test temperature of 37 ± 1 °C, deaeration of the test solution with nitrogen gas, measurement of the open circuit potential of the test specimen for 1 h prior to initiating the potentiodynamic portion of the test, scan rate of the potentiodynamic portion of the test, etc.) to ensure repeatability and reproducibility among laboratories6. Multiple studies have investigated the effect of these parameters on test results. Increasing immersion time in phosphate-buffered saline (PBS), for example, has been found to shift the rest potential, Er, to higher values with the amount of the shift being modulated by the exposure time and nitinol surface finish5,18,19,20. These same studies similarly found increases in Eb with longer immersion time which depended again on surface finish and whether breakdown was observed after the shorter immersion time. Previous research in our laboratory has additionally evaluated whether damage to the oxide layer through fatigue or fretting prior to testing could affect corrosion susceptibility19,21. Both studies found that such damage did not affect the observed breakdown potential of nitinol with differing surface finishes per ASTM F2129 which may have been due to repassivation of damaged regions. ASTM F2129 recommends PBS to be used as the standard test solution, although other solutions are allowed, and several formulations are provided in Appendix X2 of the document. Studies have examined the use of bovine blood, simulated gastric fluid, and PBS containing amino acids as a test solution and found that the test solution, in addition to the immersion time and surface area, can all affect the observed Er and Eb values22,23,24.

Although many metallic implants are available in several sizes to accommodate varying patient anatomies, ASTM F2129 does not currently provide recommendations on what device size to test except to say that the upper limit on specimen size may be limited by the current delivery capability of the test apparatus. For a given device and assuming uniform processing, a larger surface area may increase the likelihood that a flaw exists that would initiate pitting. This hypothesis is based off of a previously published model on the corrosion performance of stents which combined the probability of the stent pitting with the probability the breakdown potential would be below the acceptance criteria20. The statistical model for if a stent would pit was based off the exposed surface area and the number of occurrences per unit surface area. While the overall model matched the data in the study, it was limited to a single stent size and did not assess the influence of device size on the corrosion performance. Although the implant size selected for an individual patient depends primarily on anatomic and mechanical considerations, the question of how specimen surface area might affect corrosion susceptibility and which implant size to select for pre-clinical testing remains open. To that end, with this study, we attempt to characterize how surface area affects corrosion susceptibility per ASTM F2129 as well as the interplay of surface area and nitinol surface finish.

Results

Corrosion tests

Figure 1 shows a plot of potential on y-axis and surface area on the x-axis of the average breakdown potentials of amber oxide (AO), mechanically polished (MP), chemically etched (CE), and EP wires as well as laser-cut stents. Statistical analysis (Table 1) showed that for AO, MP, and CE samples, there is a statistically significant difference (p < 0.05) in Eb between the smaller surface areas (0.05–2.00 cm2) and larger surface areas (5.00–10.0 cm2).

Fig. 1: Average breakdown potential vs. surface area.
figure 1

Plot of the average breakdown potential of AO, MP, CE, and EP wires and modified stents at all surface areas tested. The vertex potential of 1000 mV was used for samples that did not breakdown. Error bars represent ±1 standard deviation.

Table 1 Log rank test.

Figure 2 shows a bar graph of the percentage of the number of samples that experienced a breakdown at each surface area. It is worth noting that AO, MP, and CE all have a samples size of N = 10 at each surface area, while EP wires and the stents were only tested at two surface areas with a sample size of N = 6. The difference in the sample size is the reason for the representation in percentage instead of the actual sample count. The stent with the larger surface area was labeled as long stent, while the smaller one was labeled as short stent.

Fig. 2: Percent of Samples with Breakdown.
figure 2

Percentage of AO, MP, CE, and EP wires and modified stent samples with breakdown at all surface areas tested. Note that only two different surface areas were tested for both EP wires and stents.

Figure 3 shows the distribution of breakdown potentials. This bar graph shows how many samples exhibited pitting corrosion at each potential. AO samples show a peak at about 600 mV, while for CE and MP samples are more widely distributed.

Fig. 3: Distribution of breakdown potential.
figure 3

Bar graphs showing the distribution of breakdown potentials of a AO, b CE, and c MP wires over all surface areas tested.

The average breakdown potential in Table 2 is tabulated in two categories: “w/NB” and “w/o NB” where NB stands for No Breakdown. Thus, the “w/NB” category average includes all specimens (specimens which reached the vertex potential without exhibiting breakdown were scored at 1000 mV). The “w/o NB” category average excludes specimens which reached the vertex potential without exhibiting breakdown. As previously stated, samples that did not breakdown are assumed to have a breakdown potential of 1000 mV.

Table 2 Average breakdown potentials.

Poisson distribution

Figures 46 show the plots of the probability of a certain number of pit occurrence of AO, CE, and MP at selected surface areas. The curve is the Poisson model while the red markers represent the actual number of counted pits. The λ values were calculated based on the rate of pit occurrence. In the case of AO, for example, the average number of pits at 10 cm2 is 89.90 pits. This gives an average of 8.99 pits/cm2. To get the λ values of the smaller surface areas, the average number of pits per surface area was used. The average number of pits per 1 cm2 of surface area for CE is 3.84 and 14.74 pits/cm2 for MP.

Fig. 4: Poisson distribution model of amber oxide samples.
figure 4

Poisson distribution of amber oxide samples at a 0.05, b 0.10, c 1.00, and d 10.0 cm2.

Fig. 5: Poisson distribution model of chemically etched samples.
figure 5

Poisson distribution of chemically etched samples at a 0.05, b 0.10, c 1.00, and d 10.0 cm2.

Fig. 6: Poisson distribution model of mechanically polished samples.
figure 6

Poisson distribution of mechanically polished samples at a 0.05, b 0.10, c 1.00, and d 10.0 cm2.

Table 3 shows the chi-square analysis between the Poisson model and observed number of pits on AO, MP, and CE samples at different surface areas. Chi-square analysis usually requires a sample size greater than 20, since this study has a sample size of 10, the calculated chi-square values had to be divided by the William’s correction factor. This new value was then compared to the critical value at α = 0.05 obtained from the chi-square table. The Poisson model was deemed acceptable when the chi-square value with William’s correction is less than the critical value.

Table 3 Chi-square analysis.

Discussion

This work set out to study the effect of surface area on pitting corrosion susceptibility. To account for differences in surface finish, three surface finishes of nitinol (AO, CE, and MP) had their pitting corrosion susceptibility assessed from 0.05 to 10 cm2, and a fourth surface finish (EP) assessed for pitting corrosion susceptibility at 5 and 10 cm2. To account for geometry differences between straight wire and the complex shape of medical devices, as well as any differences between nitinol wire and laser cut nitinol tubing, two different lengths of nitinol stents with a thermally grown oxide were also assessed for pitting corrosion susceptibility. While this study did not investigate every metal, surface finish, or design used in medical devices, it did investigate a range of surface areas that extends above and below the average surface area of cardiovascular devices17,25 and materials whose breakdown potentials also match the mean breakdown potential of materials used in cardiovascular devices26.

As shown in Fig. 1, surface area had an observable effect on the pitting corrosion susceptibility on all surface finishes of nitinol. While this trend was expected, it was illuminating to observe how the trends differed across the surface areas. The different surfaces had differing critical surface areas above which corrosion was likely to occur. For AO and CE samples, the average breakdown potentials appeared to plateau to a minimum value at surface areas greater than 5 cm2; however, this result may not be generalizable to all implants made with these surface finishes. No such plateau was found in MP wires tested. Plateauing in the average breakdown potential was also evident in the statistical analysis shown in Table 1. The breakdown potentials observed at smaller surface areas were compared with the largest surface area. No statistical difference was found for AO and CE samples between surface areas of 5 and 10 cm2. Interestingly, when comparing how many specimens experienced electrochemical breakdown (Fig. 2), the CE and MP surface finishes had similar numbers across the differing surface areas, while the AO surface finish experienced electrochemical breakdown more readily than the other two surface finishes. The EP surface finish experienced far fewer electrochemical breakdowns at 5 and 10 cm2 compared to the other surface finishes which is why smaller surface areas of EP nitinol wires were not investigated in this study. This also demonstrates the importance of surface finish on what the critical surface area is for a material to be susceptible to pitting corrosion.

Returning to the mechanics of pitting corrosion, pitting corrosion is expected to occur at a flaw or weak spot on the oxide of the metal surface. This theory can explain Fig. 2 as each surface finish will result in a different distribution of surface flaws/defects in the protective titanium oxide layer surrounding the nitinol wire. While there is some variability observed in Fig. 2, the general trend shows an increasing number of samples experiencing electrochemical breakdown with increasing surface area. To assess the statistical variability, these results were plotted against the Poisson distribution calculated based on the number of pits per surface area of the largest samples (Figs 46). Based on the chi-square test (Table 3), these figures show a generally good fit at lower surface areas with more divergence at larger surface areas. While this may be a limitation of using a sample size of 10 for each surface area, the practical utility of this approach is to assess what the critical surface area is for each material surface finish when pitting corrosion is likely to occur (when the probability of one or more pits becomes likely).

While the Poisson distribution reasonably accounts for the differences between the surface finishes seen in Fig. 2, it does not account for the differences in breakdown potential seen in Fig. 1 between AO, CE, and MP. This led to the supposition that if there is a distribution of flaws in the surface of the nitinol, what does the distribution of the breakdown potentials for those flaws look like? This is shown in Fig. 3 where for each surface finish a bar graph of all breakdown potentials for all surface areas is shown. ASTM F2129 does not allow for the determination of the breakdown potential for each pit on a surface, instead it measures the breakdown potential of the flaw with the lowest breakdown potential on the exposed surface. Assuming the breakdown potential for the flaws is a normal distribution, it is expected that Fig. 3 would show either a normal distribution or a distribution skewed to lower breakdown potentials given the left censoring of ASTM F2129. Indeed, AO surface appeared to have a normal looking distribution, while CE and MP surface finishes had a flatter and broader distribution, and both seemed to have a median slightly lower than the AO surface finish. This raises an interesting question about the AO surface finish as it appears in this study to have more flaws in the oxide layer, but those flaws seem to have a higher breakdown potential with less variability than the MP and CE surface finishes. While this would explain why the MP surface finish does not appear to plateau in Fig. 1 while the AO surface finish does, it does not explain why the CE surface finish converges when it has the same flaw distribution as the MP finish. Still, as ASTM F2129 measures the worst of the flaws, the number of flaws has a greater influence on the overall corrosion resistance than the median breakdown potential of the flaws. On a similar front, each finish had a threshold surface area above which electrochemical breakdown was observed for all specimens (with the exception of one specimen at 2.0 cm2 AO). For AO, the threshold surface area was approximately 0.5 cm2, while for MP and CE, the threshold surface was approximately 5.0 cm2. Therefore, the minimum surface area required to assess the pitting corrosion susceptibility in a data set in which all specimens would be expected to experience an electrochemical breakdown increases with improved surface quality.

It is generally known that nitinol corrosion resistance depends highly on the surface finish and oxide layer15,16,17,27 and our results here confirm the importance of surface finish to breakdown potential. Although pitting corrosion can initiate from any number of surface flaws, inclusions have been the subject of recent investigations into pitting corrosion and may help explain why corrosion resistance decreased with increasing surface area in our study. One previous study that investigated the number of surface inclusions as nitinol was processed into tubes reported an increase in corrosion resistance with decreasing surface inclusion size28, which suggests that inclusions may be an important factor for the initiation of pitting corrosion. Sun et al.29 examined corrosion susceptibility in nitinol stents and found an inverse relationship between inclusion fraction and breakdown potential. Another recent study furthermore suggests that pitting initiation in EP nitinol was associated with Ti2NiOx inclusions and that titanium depletion in the surrounding alloy matrix may be the mechanism that renders the area more susceptible to pitting corrosion30. While it was not possible with our study design to analyze the dynamics of corrosion pit initiation nor was it within scope to conduct a systematic analysis of inclusions for each surface finish, it is logical that the differing surface finishes we tested would have varying distributions (size and area fraction) of surface inclusions that could have acted as corrosion pit initiation sites. As the surface area was decreased in our test specimens, the likelihood of each specimen containing a large inclusion (or other flaw) would have decreased as well resulting in higher recorded breakdown potentials as other (possibly smaller) flaws were able to initiate the first corrosion pit.

While some cardiovascular medical devices are made from wire, it is possible that wire and wire surface finishes would not be representative of the performance of non-wire based devices. While limited, the results from the laser-cut stents with a thermally grown oxide revealed that when a single stent ring was exposed during corrosion testing, half of the samples did not experience electrochemical breakdown, while in samples where 40 stent rings were exposed, all exhibited electrochemical breakdown. Although a limited sample size was used for our stent testing, the results seem to imply that the trends observed on wire specimens can be applied to non-wire devices. These findings indicate that when it comes to medical device testing, for all surface finishes, the largest device will be the representative of worst-case condition for corrosion testing, while for some surface finishes intermediate sized devices could serve as a worst-case test condition. Thus, for all surface finishes, there was a significant effect of surface area on the pitting corrosion susceptibility indicating that even for surface finishes that do seem to converge to a breakdown potential at large surface area it may be important to ensure that smaller sized devices do not fall below the critical surface area. Overall, the data suggests that the largest device size should be considered as the worst-case for assessing pitting corrosion susceptibility when performing ASTM F2129.

In conclusion, this study has shown that surface area affects the breakdown potential based on ASTM F2129 with larger surface areas having lower breakdown potentials. The effect is modulated by the surface finish. For some nitinol surface finishes such as AO and CE, there was a surface area threshold above which the average breakdown potential plateaued. The observed trends were consistent across wires with different surface finishes as well as with laser cut stents. Poisson distribution model was a good fit for predicting the number of pits at smaller surface areas; however, more deviation was found at larger surface areas. The above conclusions support the need to consider the effect of surface area when selecting test specimens for pitting corrosion susceptibility testing.

Methods

Materials

Four different surface finishes [AO, CE, MP, and EP] of Nitinol were selected. The nitinol wires all have a diameter of 0.5 mm. Wires were tested as received. Nitinol wires were cut to length to achieve the desired surface area. For AO, CE, and MP surface finishes 10 samples were tested (N = 10). For EP surface finish six samples were tested (N = 6).

Commercially available laser-cut nitinol stents with a diameter of 7 mm and length of 100 mm were selected. Prior to corrosion susceptibility testing, the stents (which had a nominal EP finish) were subjected to air furnace heating at 540 °C for about 2 h to thermally grow the oxide layer (Fig. 7a, b) to increase pitting corrosion susceptibility. A sample size of six (N = 6) was utilized for each test condition.

Fig. 7: Sample preparation and test set up.
figure 7

Optical microscopy image of stents a before and b after subjecting to air furnace heating to thermally grow an oxide layer. Sample preparation for c 0.05–2.00 cm2 and d 5.00–10.0 cm2 after they were coated with a conductive silver paint and nonconductive lacquer. f Representative curves of the cyclic potentiodynamic polarization tests for each kind of specimen performed on e a three-electrode cell. Digital microscopy images of post-corrosion amber oxide samples with g individual and h cluster of pits.

Corrosion testing

Wires were tested as per ASTM F21296 “Standard Test Method for Conducting Cyclic Potentiodynamic Polarization Measurements to Determine the Corrosion Susceptibility of Small Implant Devices” in deaerated PBS at a constant temperature of 37 °C. Prior to testing, PBS was deaerated for 30 min by sparging nitrogen gas at 150 mL/min. An Interface 1000 (Gamry, Warminster, PA) potentiostat, which was in compliance with ASTM G531 “Standard Reference Test Method for Making Potentiodynamic Anodic Polarization Measurements”, was used to perform the corrosion test. Saturated calomel electrode (SCE) was used as reference electrode, while a graphite rod was used as counter electrode (Fig. 7e). The ASTM F2129 test started with an hour of open circuit potential to determine the rest potential, then followed by a cyclic polarization test that has a vertex potential of 1 V vs. SCE and back down to the rest potential at a scan rate of 1 mV/s. Figure 7f shows representative curves of the cyclic potentiodynamic polarization test for each kind of specimen and Fig. 7e shows the three electrode test cell used for testing. Breakdown potential (Eb) was defined as shown in Fig. 7f with a sudden increase in current.

The surface areas tested for AO, CE, and MP wires were 0.05, 0.1, 0.5, 1.0, 2.0, 5.0, and 10.0 cm2. For EP wires, the surface areas tested were 5.0 and 10.0 cm2. The samples were then attached to a sample holder with conductive silver paint (Fast Drying Silver Paint, Ted Pella, CA). The areas of attachment were coated twice with a non-conductive lacquer (Microstop, Tolber Chemical, AK) and dried overnight (Fig. 7c, d).

For the stent specimens, two different surface areas were tested. The smaller surface area of 0.34 cm2 was created by cutting the stent such that only one continuous ring of stent was exposed to the testing solution. The larger surface area of 13.5 cm2 had nearly 40 continuous rings of stent exposed. As with the wire samples, the stents were attached to a sample holder with conductive silver paint and twice coated with a nonconductive lacquer and dried overnight.

Poisson distribution

In addition to performing corrosion tests, the study also attempted to predict when corrosion will happen with a Poisson distribution model showed by Eq. (1).

$$P(k) = e ^\wedge ( - \lambda )\lambda ^\wedge k/k!$$
(1)

P(k) models the probability of k number events occurring in a given interval. Lambda (λ) is the expected number of occurrences. This model will be applied by counting the number of pits in each surface area of the wire. In this case, P(k) will be the probability that k number of pits would occur per surface area, while λ is the average number of pits per surface area. The model distribution will be then compared against the observed number of pits. This statistical approach has been used previously to establish a statistically significant sample size for corrosion testing20.

In order to count the number of pits, after the corrosion test, the wires were removed from the sample holder, rinsed with deionized water, and dried. The pits found in the entire length of the wire were counted under a stereo microscope (Nikon SMZ-10). It is expected that the pits will not appear uniformly, so for efficiency, clusters of pits were counted instead of individual ones (Fig. 7g). A cluster of pits (Fig. 7h) is considered as a group of multiple pits that appeared inside another pit.

Statistical analysis

Statistical analysis was performed using Minitab (Minitab Inc., State College, PA). The breakdown potentials, Eb, of all the samples at each surface area were compared through a Log Rank test with right censoring. This is because when the samples did not experience a breakdown, Eb was assumed to be 1000 mV. Statistical significance was set at α < 0.05. Additionally, a chi-square (Χ2) test performed in MS Excel (Microsoft Corp., Redmond, WA) was used to determine the goodness-of-fit between the Poisson distribution model and the observed number of pits. Since the sample size is small, William’s correction factor was used.