Reinforcement learning for patient-specific optimal stenting of intracranial aneurysms

Hachem, E.; Meliga, P.; Goetz, A.; Rico, P. Jeken; Viquerat, J.; Larcher, A.; Valette, R.; Sanches, A. F.; Lannelongue, V.; Ghraieb, H.; Nemer, R.; Ozpeynirci, Y.; Liebig, T.

doi:10.1038/s41598-023-34007-z

Download PDF

Article
Open access
Published: 02 May 2023

Reinforcement learning for patient-specific optimal stenting of intracranial aneurysms

E. Hachem¹,
P. Meliga¹,
A. Goetz¹,
P. Jeken Rico¹,
J. Viquerat¹,
A. Larcher¹,
R. Valette¹,
A. F. Sanches²,
V. Lannelongue¹,
H. Ghraieb¹,
R. Nemer¹,
Y. Ozpeynirci² &
…
T. Liebig²

Scientific Reports volume 13, Article number: 7147 (2023) Cite this article

1281 Accesses
2 Citations
Metrics details

Subjects

Abstract

Developing new capabilities to predict the risk of intracranial aneurysm rupture and to improve treatment outcomes in the follow-up of endovascular repair is of tremendous medical and societal interest, both to support decision-making and assessment of treatment options by medical doctors, and to improve the life quality and expectancy of patients. This study aims at identifying and characterizing novel flow-deviator stent devices through a high-fidelity computational framework that combines state-of-the-art numerical methods to accurately describe the mechanical exchanges between the blood flow, the aneurysm, and the flow-deviator and deep reinforcement learning algorithms to identify a new stent concepts enabling patient-specific treatment via accurate adjustment of the functional parameters in the implanted state.

In-silico trial of intracranial flow diverters replicates and expands insights from conventional clinical trials

Article Open access 23 June 2021

A Deep Learning Framework for Design and Analysis of Surgical Bioprosthetic Heart Valves

Article Open access 06 December 2019

Adaptive wireless millirobotic locomotion into distal vasculature

Article Open access 01 August 2022

Introduction

An estimated 2–3% of the population harbour intracranial aneurysms (IAs)^1,2, a pathological, localized sac-like outpouching of the arterial wall, whose rupture is the leading cause of nontraumatic subarachnoid haemorrhage, associated with a high rate of morbidity and mortality and a significant economic burden³. The increased frequency at which unruptured IAs are being diagnosed, due to the widespread use of cross-sectional neuroimaging in routine clinical practice, poses a persistent dilemma for physicians. This is imputable to the lack of definitive guidelines for optimal management, which is due to the high prevalence of aneurysms along with low rupture rates (with the annual occurrence of subarachnoid haemorrhage being about 10 per 100.000 persons⁴), and preventive treatment carrying risks of adverse complications⁵.

Aberrant vascular remodelling occurring through abnormal hemodynamic stress on blood vessels is believed to be a major factor in intracranial aneurysms pathophysiology, i.e., formation, growth, and stabilization or rupture⁶. The stress distribution, as determined by the blood flow and aneurysm geometry, elicits vascular remodelling via cell-mediated biologic pathways. This modifies the geometry, the stress, and drives further biologic processes, with rupture occurring when the stress on the aneurysm wall exceeds the yield strength of the material^7,8. This biomechanical approach has proven relevant in assessing rupture risk, with hemodynamic indices such as flow-induced pressure (the stress normal to the vascular wall) and wall shear stress (WSS, the viscous frictional force exerted parallel to the blood flow) identified as potentially significant determinants of aneurysm natural history^9,10,11,12.

Preventive treatment of unruptured intracranial aneurysms consists in occluding the sac to prevent blood from flowing directly into the aneurysm, which in turn helps reduce the stress on its wall. The two main options have long been surgical clipping and coiling¹³. Clipping is invasive, as it requires performing a craniotomy and exposing the aneurysm before placing surgical clips across the neck. While highly effective, clipping is constrained to easily accessible aneurysms and operations generally bear a substantial complication risk. Endovascular procedures, such as stenting and coiling, minimize the operational risk by avoiding open skull surgery. The latter approach involves filling the aneurysm sac with flexible platinum wires that dampen out ingoing blood jets and contribute towards the occlusion of the bulge. Since the wires are contained by the sac, wide neck IA or fusiform IA cannot be treated this way, due to the high risk of embolic disease and coil detachment.

In recent years, the implantation of flow-diverter (FD) stents has gained increasing acceptance among the interventional and neurosurgical communities as an effective alternative treatment option¹⁴. Such an approach consists in the endovascular deployment of flexible, highly conforming braided mesh devices along the parent artery and across the neck. The blood flow into the aneurysm is damped and redirected by the low porosity layer of FD wires covering the neck, reducing the overall circulation in the sac. The blood stagnation that follows a successful deployment causes a thrombus formation in the aneurysm cavity and a subsequent endothelialization of the neck¹⁵. In some cases, the completely occluded aneurysm is progressively reabsorbed by the parent vessel, precluding regrowth by hemodynamic mechanisms. Flow diverter stents have started a breakthrough in the endovascular management of intracranial aneurysms (including many wide-necked and fusiform aneurysms that were previously considered untreatable) but their mechanism of action is not thoroughly understood, as about 5–25% of aneurysms remain with circulation even after multiple-layer implantations¹⁶.

A substantial body of work is ongoing to improve aneurysm treatment outcomes by increasing the flow-diversion effect of the implanted stent^17,18. The functional performance is largely dependent on implantation (e.g., sizing, landing zone) and geometrical features (e.g., braid angle, wire density, wire diameter) with wire material properties also being an important contributor. Hemodynamically, we believe that the pore density is a key parameter, as it must be high enough to occlude the aneurysm sac satisfactorily¹⁵, but not so high that it would trigger inflammatory remodelling associated with low-WSS values¹⁹. Nonetheless, there is currently a lack of empirical evidence supporting the superiority of one design over the others, meaning that the type of stent used for each patient is often based on the length of the lesion and the personal preference of the physician (even availability of stock). Therefore, the ability to design novel stent concepts from fast and accurate identification of patient-specific functional parameters is of utmost importance to provide clinical insight, optimize treatment decision-making, and improve prognosis. This has never been done before.

In order to make progress towards this objective, the present study combines multi-physics computational fluid dynamics (CFD) and deep reinforcement learning (DRL) to prove the applicability of such an optimization workflow for patient-specific stent design. On the one hand, CFD has risen to a prominent position in the endovascular research community due to its potential for rupture risk prediction via objective, quantitative, and mechanism-based parameters^20,21, and its contribution to the design, development and evaluation of endovascular management methods^22,23. On the other hand, DRL has been shown to perform with unprecedented efficiency in several areas, e.g., language processing²⁴, robotics^25,26, autonomous driving²⁷, finance²⁸ or healthcare management^29,30, including recent inroads in computational biomechanics³¹.

The efforts for coupling CFD and DRL are developing rapidly, with a handful of pioneering studies providing insight into the performance improvements to be delivered in shape optimization^32,33,34 and flow control^35,36,37; see³⁸ for a review. This is largely ascribed to the sustained efforts and commitment of the machine learning community, which has allowed expanding the scope from computationally inexpensive, low-dimensional model reductions^39,40,41 to complex two- and three-dimensional Navier–Stokes systems^{42,43,44,45,46,47,48,49,50}. Nonetheless, DRL has never been applied to hemodynamics computations (let alone biomedical flow computations in patient-specific geometries), even though we believe the field has matured up to the point where a breakthrough may be in reach for targeted control of unruptured intracranial aneurysms.

Results

Pre-stent hemodynamics

Direct numerical simulations of the three-dimensional, incompressible Navier–Stokes equations performed with the Carreau–Yasuda rheological model of blood are used to investigate two patient-specific models of unruptured intracranial aneurysm. In the first place, the focus is drawn on the geometry labelled A, whose vascular information is provided in Fig. 1. It is a side-wall, wide-neck aneurysm of the supraclinoid internal carotid artery (ICA), proximal to the ICA bifurcation into the anterior cerebral artery (ACA) and the middle cerebral artery (MCA); see Fig. 1 for provision of the detailed vascular information. The posterior communicating artery (PComA) is neglected, as it branches off well past the aneurysm. The ophthalmic artery (OA) is also neglected, although it branches off at this section of the ICA, in the vicinity of the aneurysm. Nonetheless, we do not anticipate any significant effect on the hemodynamics given its patient-specific smallness ($\sim 0.5$ mm in diameter), plus this outflow is often neglected in numerical simulations as it is a common clinical practice to let flow diverters occlude it if no other viable option is present⁵¹. The simplified model therefore ultimately features a single source of inflow (the ICA) and two outflows (ACA/MCA).

The vessel walls, taken to be impermeable and rigid, are treated with body-fitted, unstructured adapted grids (see Fig. 2a). The numerical solutions are customized to the patient (also labelled A) specific physiology using vascular geometries reconstructed from three-dimensional rotational angiography (3D-RA) images and pulsatile volumetric inflow rates adjusted to two-dimensional phase-contrast magnetic resonance imaging (2D-PCMRI) measurements. All quantities of interest, including velocity, WSS (the local, instantaneous, intra-saccular wall shear stress of pivotal importance in this context), SAWSS (the instantaneous WSS spatially averaged over all intra-saccular positions), and TAWSS (the local WSS averaged over a cardiac cycle), are computed from parallel hemodynamics simulations run over two cardiac cycles (representing approximately 1.6 s of physical time) with a reference inflow sequence of consecutive pulses starting in the end-diastolic state. This is because the solution has settled into regular, sinusoidal oscillations by the end of the first cycle, as obtained from preliminary comparison of WSS data over up to ten cardiac cycle.

The peak-systolic streamlines in Figs. 2 and 3 show that the flow remains close to parabolic in the inflow segment (hence reminiscent of Hagen–Poiseuille flow), but quickly becomes helical because the curved vessel geometry acts as a source of flow instability, as recently assessed in patient-specific geometries⁵³. Most of the blood enters the aneurysm at the proximal part of the neck in the form of a high-speed jet ($\sim$ 0.73 m/s in velocity magnitude), impinging on and reflecting off the aneurysm wall, rolling up into complex vortical structures and finally swirling out of the bulge and to the outflow segments. This creates a strongly heterogeneous WSS pattern, with most of the distal part of the dome sustaining high WSS classically associated with aneurysm growth and rupture, as clearly illustrated in Fig. 3a, b. Maximum local, instantaneous WSS values of more than $\sim$ 169 dyne/cm$^{2}$ have been measured near the impaction zone, which is about 8 times normal WSS in cerebral arteries⁵².

Virtual stenting

Endovascular treatment is modeled by wrapping a distribution of identical, cylindrical wires around a toroidal envelope inscribed in the arterial segment containing the aneurysm (half of them clockwise and the other counter-clockwise). In order to achieve heterogeneous functional parameters (in the sense that the pre-deployment stent structure must have variable pore density and porosity), the proximal end section of the envelope is divided into four quadrants, each of which with a specific (possibly different) number of uniformly distributed wires.

The parametrization foresees the modification of six design variables: the number of wires in each group, their radius and a winding factor (the same for all wires) that controls the local braiding angle between wires. Given the difference in scales between the vascular vessels (about a few mm in diameter) and the stent strut thickness (about a few ten $\upmu$m), we rely on a hybrid meshing approach wherein the stents are embedded in the body-fitted vascular grid^54,55. Anisotropic adaptation in the vicinity of the stent envelope, as shown in Fig. 2a. We then use the monolithic immersed volume method (IVM⁵⁶) together with anisotropic mesh adaptation in the vicinity of the stent envelope to solve the interaction between the blood flow and the stent material. This allows easy handling of any complex device whose struts may be in contact with, or form very small gaps with the vessel walls, without additionally conforming the vascular grid to the stent geometry.

The proposed approach is first used to assess the ability of the method to capture numerically the flow diversion effect using a standard homogeneous stent strut distribution made of 24 wires (12 in each braiding direction, 3 in each group) with a radius set to 60 µm. We believe this is a reasonable compromise between desirability and feasibility, as thinner wires mirroring more accurately those of real medical stents (whose radii are in a range from about 15 to 30 µm) would escalate the CPU time and memory requirements (due to the need to embed large stent meshes and to additionally refine the vascular grids). All wires are braided with winding factor 25, that yields a braiding angle of 75$^\circ$ and a porosity of about 68.5%, (pore density of 3.1 pores/mm²), all values estimated from a pre-deployment, cylindrical stent structure.

Visualizations of the inflow jet, intra-saccular flow pattern (Fig. 4a) show that the stent significantly disrupts the blood flow, more of which is diverted away from the aneurysm and flows directly to the outflow segments. Nonetheless, the flow organization inside the aneurysm is essentially reminiscent of its unstented counterpart, with blood entering at the proximal part of the neck and swirling in and out after impinging on the distal wall. The key difference lies in the inflow jet having lower velocity (about 0.60 m/s in velocity magnitude) and weaker shear, hence less vorticity, and ultimately less shear stress on the aneurysm wall. This is further illustrated by the TAWSS distributions in Fig. 5a, c, where the instantaneous WSS peaks at about $\sim$ 120 dyne/cm$^{2}$ . This represents a reduction of about 30% with respect to the unstented case illustrated in Fig. 3, although we notice the persistence of a heterogeneous WSS pattern over the distal part of the dome.

DRL optimization

The optimization objective considered herein consists of bringing back the post-operative value of MWSS (defined as the maximum of SAWSS over a full cardiac cycle) to a setpoint of half the pre-operative value, hence the reward

$$\begin{aligned} r&= -\left| \text {MWSS} - \text {MWSS}_{\text {ref}} \right| \qquad \text {with}\qquad \text {MWSS}_{\text {ref}}=\frac{\text {MWSS}_{\text {0}}}{2}\,, \end{aligned}$$

(1)

where the 0 subscript denotes a pre-stent quantity. This choice is intended to reduce high WSS associated with aneurysm growth and rupture, while avoiding low WSS conditions that might initiate apoptotic pathways via undesired vascular remodelling¹⁹. In practice, the MWSS₀ for patient A is 76.6 dyne/cm$^{2}$ (roughly half the maximum local, instantaneous value reported above), which yields a setpoint of 38.8 dyne/cm$^{2}$ .

Single-step PPO, a reinforcement algorithm intended for situations where the optimal policy is independent of state⁴⁸, is used to evolve five (out the six) design parameters, as the wire radius has been set to 60 µm to keep the computational cost affordable, although it is a free parameter that could also be learnt from data. For each learning episode, the DRL agent (a deep neural network) therefore outputs five discrete values: four values $\{n_{j\in 1\dots 4}\}$ between 3 and 7 in step of 1 for the numbers of wires (one per set of equally spaced wires) and one value k between 20 and 35 in steps of 5 for the winding factor, hence 2500 parameter combinations comprising between 24 and 56 wires in total. The generated stent configurations have nominal braiding angles in a range from 65$^\circ$ to 95$^\circ$ and porosities in a range from 30.5 to 68.5% (pore densities between 3.1 and 22.7 pores/mm²). all values estimated from pre-deployment, cylindrical stent structures. The reward evaluation proceeds from hemodynamics simulations run over two cardiac cycles, with MWSS computed over the second cycle, after which the network is updated for 32 epochs using 8 environments and 2 steps mini-batches.

A total of 68 episodes have been run for this case, which represents 544 simulations, each of which lasts 20 min using 32 cores, hence 5,760 h of total CPU cost (equivalently, 45 h of resolution time). A moving average reward is also computed as the sliding average over the 100 latest values to assess convergence a posteriori (see Fig. 6). The reward convergence history in Fig. 6 evidences the successful convergence of the PPO algorithm coupled with patient-specific hemodynamic simulations. After 50 episodes (representing 400 simulations, hence 400 out of the 2500 possible designs), the DRL agent indeed starts to systematically pick the specific stent shown in Fig. 6d, whose red patch singles out the region of interest located in front of the aneurysm neck. The latter is made of 34 wires (17 in each braiding direction, distributed into four groups of 5, 3, 5 and 4 wires, respectively) braided with winding factor 25. This yields in a nominal average porosity of 55%, with pore densities (in deployed state) ranging between 2.3 and 16.0/mm² in the neck region facing the aneurysm.

Such a design is meant to be optimal for the patient specific aneurysm geometry and pulse, which is assessed now by comparing numerically the post-operative hemodynamics treated with the optimal stent designed by DRL (that earns a MWSS of 37.8 dyne/cm$^{2}$ differing from the intended setpoint by 1%) and with the standard homogeneous stent considered so far (that earns a MWSS of 54.7 and is clearly inferior for the chosen reward). The peak-systolic streamlines in Fig. 4 illustrates the different flow deviation effect of the optimal stent. The latter successfully and adequately cuts down the inflow jet (that was inducing high WSS values on the distal part of the bulge, whose velocity magnitude is now about 0.47 m/s) while substantially altering the intra-saccular flow organization, found to involve fewer vortex structures, more parallel streamlines, and less swirling. We note that the optimal stent also substantially reduces the blood velocity at the exit of the aneurysm: 0.26 m/s, to be compared to 0.35 m/s without stent and using the standard, homogeneous stent. The result on the WSS distribution is even more patent, as Fig. 5 shows that the DRL stent has completely eliminated the unstented area of maximum WSS (the local, instantaneous WSS now peaks at about 91 dyne/cm$^{2}$ , which represents a reduction of about 45% with respect to the unstented case.) while restoring an almost homogeneous WSS pattern, which the homogeneous stent had failed to achieve.

Generalizability study

For the sake of generalization (and in order to assess suitability for various aneurysm configurations), we apply now the DRL framework to a second patient-specific model of untreated, unruptured intracranial aneurysm (labelled B) whose vascular information is provided in Fig. 1. It is a saccular, multilobulated aneurysm located on the ICA, at the junction with the ophthalmic artery (OA). The latter is thus retained in the model (as it cannot be cleanly removed), yet not occluded numerically, as we voluntarily let blood flow from the ICA into the OA across the stent to explore the model ability to handle more complex intra-aneurysmal flow conditions.

Following the same steps as for the previous case, the pre-operative hemodynamics of this patient has been analyzed from numerical simulations customized to his/her physiology (both in terms of vascular geometry and inflow pulse) carried out over two cardiac cycles (representing 1.8 s of physical time). The peak-systolic streamlines shown in Fig. 7a show that the case has many similarities to that of patient A, as the flow quickly becomes helical, and most of the blood enters the aneurysm at the proximal part of the neck in the form of a high-speed jet (0.72 m/s in velocity magnitude), that traverses the entirety of the primary bulge, impinges on its distal wall, rolls up into complex vortical structures and finally swirls out to the outflow segments (including the OA). As illustrated in Fig. 8a, c, this again yields a strongly heterogeneous WSS pattern, with high WSS values up to $\sim$ 146 dyne/cm$^{2}$ in the vicinity of the neck and in most of the distal part of the dome, but low WSS in the daughter sac, that turns to be barely exposed to the blood flow environment.

The DRL optimization has then been run in similar fashion over a total of 58 episodes (464 simulations) using a setpoint of 36.3 dyne/cm$^{2}$ (MWSS₀ for this patient is 72.6 dyne/cm$^{2}$ ,) and evaluated after convergence (accounted for when the agent outputs a majority of one specific design over several episodes). The reward convergence history in . 9 evidences good convergence after 45 episodes (representing 360 simulations, hence 360 out of the 2500 possible designs). The DRL agent then starts to sample the specific stent shown in Fig. 9d, whose red patch singles out the region of interest located in front of the aneurysm neck. The latter earns a MWSS value is 37.2 dyne/cm$^{2}$ , which differs from the intended setpoint by 3%. It is made of 30 wires (15 in each braiding direction, distributed into four groups of 5, 3, 4 and 3 wires, respectively) braided with winding factor 25. This yields in a nominal average porosity of 61%, with pore densities (in deployed state) ranging between 4.1 and 11.2/mm² in the neck region facing the aneurysm. We note that the convergence is slightly less good than for patient A, which may be because the neck of aneurysm B is larger, but the same ranges of design parameters have been used for both patients. This offers more leeway by allowing more threads to fit into the region of interest of patient B, which, combined to the fact that only a small number of discrete winding factors are evaluated, increases the sharpness of the reward, known to be detrimental to the conservative policy updates of the PPO algorithm⁴⁸.

Finally, the efficiency of this design (in fact different from the optimal determined for patient A) has been assessed by comparison of the pre- and post-operative hemodynamics computed after treatment with the optimal stent. The peak-systolic streamlines in Fig. 7b show that the blood flow is adequately diverted away from the neck and into the parent vessel. The blood velocity is reduced at the entry (0.38 m/s in magnitude) but also at the exit of the aneurysm (0.23 m/s, to be compared to 0.43 m/s without stent), while swirling is essentially suppressed, as the inflow jet now merely slides along the distal wall. This considerably reduces the high distal values of WSS, as the maximum local, instantaneous WSS is now about 80 dyne/cm$^{2}$ (again a reduction by about 45% with respect to the unstented case). We also note an almost homogeneous WSS pattern is restored in Fig. 8b, d, save for the low WSS region in the daughter sac, meaning that the latter could still grow on its own via inflammatory and apoptotic remodelling.

Discussion

Optimal stents feature a gradient of porosity

Reported results highlight the potential of DRL shape optimization for endovascular stenting of intracranial aneurysms. It should be emphasized the optimal designed generated by our DRL agent, all relative to the chosen reward functions, feature varying porosities in the region of interest (ROI) facing the aneurysm neck. This, by itself, could represent a breakthrough in the stent manufacturing industry, where such designs do not yet exist (to date, variable porosity can be achieved locally only by superposition of several flow-diverters, which increases the risk and the cost of the surgery). More importantly, the optimal porosity gradient differs for both aneurysms, which paves the way for developing novel devices tailored to the patient specific aneurysm, including (but not limited to) its geometry and pulse.

The results show that the optimal stents successfully cut down the blood velocity at the entry and the exit of the aneurysm while also altering the swirling flow inside the aneurysm, either by subtly modifying the swirling direction (patient A), or by suppressing swirling altogether (patient B). Nonetheless, the complexity of the correlation between local porosity distribution, flow deviation and hemodynamics makes it difficult to unravel the exact physical mechanism behind the efficiency of this or that design, although it is common knowledge that the stent must allow blood in and out to avoid the occurrence of too-low WSS values, while sufficiently impeding the blood flow associated with the highest values of WSS in the aneurysm. From this perspective, the DRL approach is beneficial in two important respects: first, it is efficient, even though the parameter spaces are large and it may be costly to identify optimal designs from simple parametric searches. Second, and more significantly, it succeeds in discovering optimal designs from unforeseen parameter combinations, without any priori knowledge or assumptions about hemodynamics concepts.

CFD modelling assumptions and limitations

Computational blood flow modelling in intracranial aneurysms has tremendous potential, yet limited applicability in a clinical context because of the simplifying assumptions that are traditionally (and often implicitly) made^57,58. Chief among them is the fact that walls are almost always assumed rigid, while arteries are compliant vessels, i.e., they deform under the shear stress of blood flow, with possibly large displacements impacting the WSS estimates (the authors in⁵⁹ report 10-30 % WSS reductions compared to rigid wall simulations). A two-way coupled fluid-structure interaction (FSI) analysis is thus necessary to solve accurately the mechanical exchanges between the blood flow and the arterial tissue, while also encompassing the stent deformation occurring under load conditions (by the blood flow and/or the arterial tissue), another important factor that may alter the porosity at the neck and impair the long-term efficiency^60,61. One ongoing debate regarding the need to include the effects of compliance comes down to whether the uncertainties or inaccuracies in the data needed to model its effects may mask any perceived benefit of doing so: on the one hand, it has been acknowledged that improved computational models should incorporate patient-specific, spatially varying wall thicknesses, as uniform wall properties and thicknesses based on literature values will fail to represent inter- and intra-individual variations^62,63. On the other hand, it is feasible to measure individual wall properties by imaging and inverse modelling techniques, but such non-linear analyses introduce substantial uncertainties, for instance imaging can distort wall thickness measurements in a way that can be difficult to detect or correct⁶⁴. In this regards, it is reasonable and expedient to use a rigid wall model for the present purpose of showcasing the use of DRL techniques for image-based CFD hemodynamics optimization (without any consideration of being directly applicable real medical cases), while leaving to future research to more fully address this issue and close the methodological gap of providing high-fidelity hemodynamic data. Finally, achieving the fine deployment of the stent in the arterial vessel was not in the scope of this study. A realistic deployment, along with a more versatile stent parametrization would expand the possible configurations and surely lead to fascinating results, which could drastically impact the medical community.

DRL reward function

There are two main aspects worth discussing regarding the DRL reward function used herein. First, the design of a feasible reward function is one of the challenges in reinforcement learning problems, but one that is barely discussed in the available literature. In the absence of best practice guidelines, it is essentially a trial-and-error exercise, with a human expert defining an initial reward function based on his/her knowledge of the problem, observing how the agent performs, then tweaking the reward function to achieve greater performance. We use here a reward function aligned with the objective function, meaning that when the agent is learning to maximise this reward, it is also learning to minimize the distance between the post-stent maximum value of MWSS over a cardiac cycle and the setpoint of half the pre-stent value. This allows reducing WSS while preventing the occurrence of very low WSS values, which is consistent with the expected outcome of a stenting operation (in the absence of further quantitative information or reduction objectives). A more sophisticated approach to pursue in future work could be to force the WSS to remain in a physiological range (that could be defined from patient-specific data) at every point in the bulge, using for instance a local reward function defined as

$$\begin{aligned} r = \int _S r_{loc}({\varvec{x}}) \,ds\quad \text{ with }\quad r_{loc}({\varvec{x}}) = {\left\{ \begin{array}{ll} \text {WSS}({\varvec{x}}) - \text {WSS}_{inf} &{} \text {if }\;\; \text {WSS}({\varvec{x}})< \text {WSS}_{inf},\\ 0 &{} \text {if }\;\; \text {WSS}_{inf} \le \text {WSS}({\varvec{x}}) \le \text {WSS}_{sup},\\ \text {WSS}_{sup} - \text {WSS}({\varvec{x}}) &{} \text {if }\;\; \text {WSS}_{sup} < \text {WSS}({\varvec{x}})\,. \end{array}\right. } \end{aligned}$$

(2)

Second, the reward uses MWSS as the sole predictor of aneurysm rupture, which implicitly assumes that a brief exposure to extended regions of high WSS is key towards predisposing the aneurysm wall to weakening and rupture. On the one hand, this suffices to lay the foundation for future research in this field, given the wide acceptance of WSS as a key factor in the physiological and pathological response of cerebral arteries. On the other hand, a gap of knowledge remains on this issue (for instance, both high or low WSS have been separately correlated to aneurysmal formation and growth^{9,19,20,65,66}), and enriched reward functions (encompassing the time and space-dependent influence of blood dragging at the aneurysm wall, both in magnitude and in direction) are likely needed to improve clinical relevance. In this regards, it is worth insisting that the presented framework is highly generalizable, in the sense that it can assess new concepts of flow-deviator stents with respect to any or any combination of the markers of disturbed blood flow that have surfaced in recent publications (WSS gradient, oscillatory shear index, relative residence time, to name a few), that reflect different assumptions being made about the hemodynamic conditions driving the progression of intracranial aneurysms toward rupture^{53,67,68,69,70}. This falls under the scope of multi-objective DRL for which there are two main approaches. The most common way is to use a linear function to transform the multi-objective problem into a standard single-objective problem. Another interesting (but very costly) strategy is to explicitly separate the individual components of the reward function, in order to better understand the policy trade-off (the related methods, based on the Pareto optimum, are not yet frequently applied to DRL problems).

DRL algorithm

Future work should aim at further improving the flexibility of the proposed framework by allowing more realistic stent geometries (in terms of wire radius, number of wires), thus increasing the number of possible stent designs. Having a more continuous optimization space (by increasing the number of winding factors) will also undoubtedly improve the convergence of the PPO algorithm. From this standpoint, it should to be emphasized that this is a proof-of-concept study and that convergence and efficiency (i.e., the number of stent designs that need to be evaluated to reach convergence) could be accelerated by hyper-parameter tuning or using pre-trained deep learning models (as is done for instance in transfer learning). Generally, the rather simplistic PPO framework could be substituted by a more elaborated algorithm, for instance Policy-based Optimization (PBO)⁷¹, another single-step reinforcement algorithm that samples actions from full covariance matrices, and is theoretically better suited to represent higher order logic and to handle complex parameter interactions.

Future research directions

The purpose of this study is to lay out the foundation for future research in this field. We anticipate that large-scale studies with long-term follow-up will allow developing more reliable risk-prediction models. As envisioned by Meng et al.¹⁹, we picture that intracranial aneurysms could be sorted into different categories associated with different predictors reflecting different growth and rupture mechanisms (say, high WSS and positive WSS gradient for narrow-necked aneurysms vs. low WSS and high fluctuations of WSS orientation for wide-necked aneurysms), at which point a high-fidelity DRL-CFD hemodynamics framework accurately modelling the elastic deformation of the parent artery will be instrumental in providing clinically relevant, patient-specific stent designs (except for the unpredictable delivery manipulations and variations of vessel geometry occurring during the intervention that still might impact the stent implantation). By then, it is reasonable to expect that further developments in the fast-moving field of deep reinforcement learning will allow for faster convergence and lesser execution load (using, e.g., auto-encoders and systematic state compression, or on-the-fly generation of surrogate models with uncertainty level prediction). This should set up a framework fast enough to inform design in a matter of hours rather than days, which in turn will reliably augment the current clinical diagnostics capabilities. Another reason to push DRL forward in this context is the ability of neural networks to transfer knowledge from previous experiences, to quickly adapt to different environments (i.e., different patient-speciffic numerical models of intracranial aneurysms, corresponding to new patients in practical applications) and effectively learn new tasks (i.e., different rewards, to achieve further refinement of risk prediction). We expect that this will be a key feature to reduce learning time and improved neural network performance, as progress are made towards realizing the clinical utility of CFD for assessment of intracranial aneurysm rupture.

Methods

Clinical and imaging data

Images obtained from 3 Tesla MRI (magnetic resonance imaging) and 3D-DSA (digital subtractiob angiography) to create the adequate geometry for the simulation and the pulse to impose the flow. All images have been acquired at the University Hospital - LMU Munich.

Stent model

Virtual stenting relies on a naive stent generator inspired by⁷², in which 2n wires are wrapped around the toroidal envelope parametrized by

$$\begin{aligned} ((r\cos \theta + R)\cos \frac{s}{R}\,,(r\cos \theta + R)\sin \frac{s}{R}\,,r\sin \theta )\,,\qquad (\theta , s)\in [0;2\pi ]\times [0;l]\,, \end{aligned}$$

(3)

where r and R are the minor and major radii of the torus, and l is its centerline length. The wire centerlines follow hhelical curves generated from a circular basis, that in turn provides the scaffold for the struts. Circular profiles are then extruded along the splines to generate the final wires with diameter $d=60\,\mu m$. Nominal heterogeneous functional parameters (yet homogeneous within a given group of wires) functional parameters, e.g., braiding angle, porosity (the percentage ratio of the wire-free surface area) and pore density (the number of pores per unit surface area) are obtained by braiding two by two parallel wires from $n_{j}$ initial positions uniformly distributed in each quadrant of the proximal end section (labelled counter-clockwise, adjusting the origin of azimuthal angle for the first quadrant of the cylinder to be mapped into the upper outer quadrant of the torus). In practice, all wire paths are actually computed under a slowly varying envelope approximation using

$$\begin{aligned} r(s)=r_{prox}\left( 1-\frac{s}{l}\right) + r_{dist}\frac{s}{l}, \end{aligned}$$

(4)

to fit the weak variations in the minor radius caused by the irregular patient-specific vascular geometry. This is because the variations for the cases documented herein (by about 11% relative to the average value) have been found to be well modelled by affine transformations, but more complex analytical functions can be specified as well. Examples of generated stents are given on Fig. 10.

Computational domain and mesh

The medical imaging data of both patients is segmented using the 3D Slicer software. The entire proximal portion of the parent artery visible in the images is reconstructed, for which 3D Slicer outputs point coordinates and connectivity of the centerline, together with the corresponding vessel radius (as defined by the minimal distance from the centerline to the vessel boundary) and curvature. A non-shrinking filter⁷³ is used as an additional step of shape regularization to obtain smooth surface triangulation of the aneurysm lumen and connected vessel walls (which helps mitigate the effect of inner surface roughness). All vessels are truncated at some distance from the aneurysm bulge and extended with straight cylindrical pipes closed perpendicularly to their axis, to allow for flow development and ease the subsequent application of inflow/outflow boundary conditions.

For each patient, three-dimensional unstructured isotropic meshes of the vascular domain and stent devices are generated with the Gmsh software⁷⁴, after which the vascular grid is finely and anisotropically refined with the method described in⁷⁵. This allows evaluating any stent design sampled by DRL on the same vascular mesh, and thus yields considerable saving in the overall computational time. For the stent devices, 4 mesh points are allocated across any wire diameter, which is a reasonable compromise to assess feasibility while producing qualitative results to build on, as it yields a number of mesh elements that stands well below the few ten million elements reported in previous studies^54,55. This in turn keeps the computational cost affordable, which is mandatory given that optimization requires evaluating the performance of hundreds of stent designs.

Computational hemodynamics framework

The blood flow is mathematically modelled after the three-dimensional incompressible Navier–Stokes equations

$$\begin{aligned} \nabla \cdot \varvec{u}=0\,,\qquad \qquad \rho (\partial _{t}\varvec{u}+ \varvec{u}\cdot \nabla \varvec{u})= \nabla \cdot (-p{{\textbf {I}}}+2\mu \varvec{\varepsilon }(\varvec{u}))\,, \end{aligned}$$

(5)

where $\varvec{u}$ is the velocity field, p is the pressure, $\varvec{\varepsilon }(\varvec{u})$ is the rate-of-deformation tensor, $\rho =$ 1050 kg/m$^3$ is the constant blood density, and $\mu$ is the non-Newtonian blood viscosity evaluated from the Carreau–Yasuda law using zero-shear rate viscosity $\mu _0=0.0456$ Pa.s, infinite-shear rate viscosity $\mu _\infty =0.00320$ Pa.s, relaxation time $\tau =10.03$ s, power law index $n=0.344$ and transition parameter $a=1.25$ (all values for a hematocrit of 40 % and a temperature of 37 $^\circ$C). The instantaneous wall shear stress whose peak value over a cardiac cycle is used for reward evaluation is computed as

$$\begin{aligned} \text {WSS}=\frac{3n+1}{4n}\mu \dot{\gamma }\delta _{\text {sac}}\,, \end{aligned}$$

(6)

where $\dot{\gamma }=(2\varvec{\varepsilon }(\varvec{u})\!:\!\varvec{\varepsilon }(\varvec{u}))^{1/2}$ is the wall shear rate defined as the second invariant of the rate-of-deformation tensor, $\delta _{\text {sac}}$ is a boolean representation of the aneurysm surface (obtained by embedding a portion of the adapted body-fitted grid truncated to remove the extra-aneurysmal domain), and the prefactor is the Weissenberg–Rabinowitsch correction for shear-thinning effects⁷⁶. Since the elastic motion of the arterial wall is overlooked as a first approximation, simple open flow conditions are used, that consist of no-slip conditions at the solid nodes, zero-stress outflow conditions, and pulsatile, parabolic inflow condition

$$\begin{aligned} \varvec{u}=\frac{2Q(t)}{\pi r^2}\left( 1-\frac{||{\varvec{x}}||^2}{r^2}\right) \varvec{n}, \end{aligned}$$

(7)

where $||{\varvec{x}}||$ and $\varvec{n}$ are respectively the distance to the centerline and the normal vector in the inlet section of the parent artery, and Q is the time-dependent, volumetric flow rate adjusted at each time step to 2D-PCMRI measurements of the patients cross-sectionally averaged blood velocity (using linear regression from the two closest data points whenever the simulation and acquisition times do not coincide).

Variational multiscale modeling

A stabilized weak form of Eq. (5) is solved with a finite element variational multiscale method (VMS^77,78,79). Such an approach consists in splitting the solution into coarse and fine-scale components, each corresponding to a different level of resolution. Only the large scales are fully represented and resolved at the discrete level. The fine scales are approximated in a way such that their effect into the large-scale equations is modelled after consistently derived source terms proportional to the residual of the resolved scale solution. Exhaustive details in⁸⁰ regarding the derivation of the stabilized formulations lead to the following weak form for the large scale

$$\begin{aligned}&(\rho (\partial _t^{}\varvec{u}+\varvec{u}\cdot \nabla \varvec{u})\,,\,\varvec{w})+(2\mu \varvec{\varepsilon }(\varvec{u})\,,\,\varvec{\varepsilon }(\varvec{w})) -(p\,,\,\nabla \cdot \varvec{w})+(\nabla \cdot \varvec{u}\,,\,q)\nonumber \\&\quad =\sum _{K\in \mathcal {T}_h}\left[ (\tau _M\mathcal {R}_M\,,\,\varvec{u}\cdot \nabla \varvec{w}+\nabla q)_K +(\tau _C\mathcal {R}_C\,,\,\nabla \cdot \varvec{w})_K \right] , \end{aligned}$$

(8)

where $(\,,\,)$ is the $L^2$ inner product on the computational domain, $(\,,\,)_K$ is the inner product on element K, $\varvec{w}$ and q are relevant test functions for velocity and pressure, $\mathcal {R}_{C,M}$ are the governing equations residuals

$$\begin{aligned} -\mathcal {R}_C=\nabla \cdot \varvec{u}\,,\qquad \qquad -\mathcal {R}_M=\rho (\partial _t^{}\varvec{u}+\varvec{u}\cdot \nabla \varvec{u})+\nabla p\quad \end{aligned}$$

(9)

and $\tau _{C,M}$ are ad-hoc mesh-dependent stabilization parameters (comparable to local coefficients of proportionality) defined in^81,82.

We solve Eq. (8) with an in-house VMS solver whose accuracy and reliability is assessed in a series of previous papers, see^82,83 for a detailed mathematical formulation of the IVM in the context of finite element VMS methods, and^84,85 for applications to non-Newtonian flows in complex geometry. Equal order, linear interpolation is used for spatial discretization of the velocity and pressure variables (as the inf-sup condition does not need to be satisfied due to the additional stabilization terms). Time-stepping is first-order accurate and combines explicit (for the VMS stabilization parameters), implicit (for the viscous, pressure and divergence terms), and semi-implicit integration schemes (for the time derivatives, convection terms and VMS source terms, using backward differentiation formula and Newton–Gregory backward polynomial). The time-step is set to 0.02 s, which allows distributing 40 and 46 points per cardiac cycle for aneurysm A and B, respectively. All linear systems are preconditioned with a block Jacobi method supplemented by an incomplete LU factorization, and solved with the GMRES algorithm, with tolerance threshold set to $10^{-6}$.

Computational hemodynamics framework with deep reinforcement learning

The stent design is optimized solving a decision-making problem with reinforcement learning (RL), a process by which an agent learns to earn rewards through trial-and-error interaction with its environment. At each turn, the agent observes the state $s_t$ of the environment and takes an action $a_t$, that prompts both the transition to the next state $s_{t+1}$ and the reward received $r_t$. This repeats until the agent has learnt the succession of actions maximizing its cumulative reward over an episode (i.e., the reference unit for agent update, best understood as one instance of the scenario in which it takes actions). In the present context, the environment is a patient-specific CFD simulation of aneurysm hemodynamics after implantation of flow-diverting stent, that uses the computational hemodynamics framework described above. The agent is a policy represented by a deep neural network (a collection of artificial neurons that learns to represent a non-linear relation between input and output spaces, hence deep RL or DRL) trained with a RL algorithm, as reviewed in the next sections. The environment and the agent are coupled two-way, as illustrated in Fig. 11 : on the one hand, the actions sampled by the DRL agent (a set of five variables corresponding to four number of wires and a winding factor ) are used to generate the stent meshes immersed in the CFD simulation. On the other hand, the reward function needed by the agent to learn (here, the maximum value of MWSS) is obtained by post-processing of the CFD data.

DRL agent

A fully connected neural network is used, whose neurons are stacked in layers, each of which maps the biased weighted sum of their inputs through an activation function to produce their outputs and propagate the information forward from the input to the output layer via “hidden” layers (we use here 2 such hidden layers, each with 4 neurons feeding hyperbolic tangent activation functions). The network is trained with the single-step PPO algorithm, that learns a five-dimensional (four numbers of wires plus a winding factor) multivariate normal distribution whose mean and variance depend on the network weights and biases. Single-step PPO is a variation of the proximal policy optimization algorithm (PPO²⁵) intended for situations where the optimal policy is independent of state, whose relevance for open-loop flow control is assessed in⁴⁸. Just like PPO, it uses gradient ascent to maximize the surrogate loss

$$\begin{aligned} {\mathbb {E}}_{a\sim \pi _\theta } \left[ \min \left( \frac{\pi _\theta (a)}{{\pi _\theta }_{old} (a)} , 1+\epsilon {{\,\mathrm{sgn}\,}}{\left( \widehat{A}^{\pi _\theta }(a)\right) }\right) \widehat{A}^{\pi _\theta } (a)\right] \,, \end{aligned}$$

(10)

where $\pi _\theta (a)$ is the policy, i.e., a probability distribution of actions $\pi _\theta (a)$ parameterized by a set of free parameters $\theta$ (here the weight and biases of the deep neural network) that determines the agent behaviour, $\widehat{A}^{\pi _\theta }$ is a biased estimator of the advantage function $A^{\pi _\theta }$ measuring the gain of taking action a over the average value (here its normalization to zero mean and unit variance), and $\epsilon$ is a clipping range defining how far away the new policy is allowed to go from the old. A positive (resp. negative) advantage increases (resp. decreases) the probability of taking action a, but always by a proportion smaller than $\epsilon$, otherwise the min kicks in (10) and its argument hits a ceiling of $1+\epsilon$ (resp. a floor of $1-\epsilon$). This conservatism inherited from the parent algorithm ensures that the current and new policies behave similarly (which prevents the agent from falling off a cliff and restarting with a locally bad policy, in which case the performance may collapse drastically and never recover). Another trait shared by the two algorithms is the lack of necessity for assumptions regarding the optimization problem to be solved and for fine-tuning of the network hyper-parameters (i.e., those parameters not learnt from data).

Where the two methods differ is that PPO seeks the optimal set of actions $a^\star$ earning the largest possible reward, while single-step PPO seeks the optimal state-action mapping $f_{\theta ^\star }$ such that $a^\star = f_{\theta ^\star } (s_0)$, where $s_0$ denotes some input state consistently fed to the agent for the optimal policy to eventually embody the transformation from $s_0$ to $a^\star$. Starting from a random mapping $f_{\theta _0}$ from $s_0$ to the policy determined by the free parameters initialization, the agent gets one attempt per episode at finding the optimal (i.e., it interacts with the environment only once per episode) before updating the policy. Another subtle difference is that PPO is actor-critic, i.e., it features an actor network that learns the policy, and a critic network that learns to estimate the advantage. Single-step PPO works without knowledge of the critic evaluations (and is thus not actor-critic) because the trajectory of state and actions consists of a single pair. The discount factor adjusting the trade-off between immediate and future rewards can thus be set to $\gamma =1$, in which case the advantage reduces to the whitened reward⁴⁸.

Our single step PPO method is based on the default open-source implementation of Stable Baselines (https://github.com/openai/baselines/tree/master/baselines/ppo2), for which a custom OpenAI environment has been designed with the Gym library⁸⁶. We have updated and connected the original code with our CFD library for simple reading and writing of the results (the code is shared publicly using the following link : https://github.com/jviquerat/pbo). The convergence properties are illustrated in Fig. 12 for a minimization test problem of two- and five-dimensional Rosenbrock functions, whose global minimum is notoriously difficult to catch for optimization algorithms and two-dimensional Branin function, that has two identical global minima. For this case, the single-step PPO-1 algorithm is benchmarked against classical ($\mu$-$\lambda$)-ES and CMA-ES evolutionary methods, all implemented in in-house production codes. To ensure a fair comparison, the initial parameters and starting points are identical for all methods. All runs are afforded the same budget, namely 500 evaluations (20 episodes with 5 parallel environments in PPO-1, 20 generations with 5 individuals per generation in evolutionary algorithms) for Rosenbrock and 50 evaluations for Branin (10 episodes/generations with 5 parallel environments/individuals per generation). A large initial standard deviation is used by default, to ensure a good exploration of the optimization domain. Finally, in order to emphasize flexibility and generalizability, all PPO runs are tackled without fine-tuning of the algorithm, i.e., all runs use the same meta-parameters as in Table 1. Performances are averaged over 10 runs, with standard deviations shown as the light shade around. As could have been expected, the search efficiency of CMA-ES yields the best overall performance, which reflects the benefit of efficiently elongating the research area with respect to the local shape of the cost function. Among isotropic exploration methods, PPO-1 achieves final cost levels similar to ($\mu$-$\lambda$)-ES, with faster convergence and better performance at intermediate stages (the final performance level ultimately saturates for the Rosenbrock function because the minimum is in a long, narrow valley, and PPO-1/($\mu$-$\lambda$)-ES use isotropically sampled approximations of the descent direction). The general picture to be drawn is that (1) PPO-1 exhibits strong performance compared to methods relying on similar isotropic search distributions, and (2) anisotropic search distributions are mandatory to outperform more advanced methods on a consistent basis, an issue that is being addressed in current research efforts by the authors⁸⁷.

Parallel data collection

In practice, actions are distributed to several environments running in parallel, each of which executes a self-contained MPI-parallel CFD simulation and feeds data to the DRL algorithm (hence, two levels of parallelism related to the environment and the computing architecture). All simulations are performed on a workstation of AMD EPYC 7502 processors. The algorithm waits for the simulations running in all parallel environments to complete, then shuffles and splits the rewards data set collected from all environments into several buffers (or mini-batches) used sequentially to compute the loss and perform a network update. This repeats for several epochs, i.e., several full passes of the training algorithm over the entire data set (which ultimately makes the algorithm slightly off-policy, since the policy network ends up being trained on samples generated by older policies). This simple parallelization technique is key to using DRL in the context of flow control applications, as estimating accurately the policy gradient requires assessing a sufficient number of actions drawn from the current policy, hence a large computational burden associated to reward computations for high-dimensional fluid dynamics problems (typically, the cost of a single call to the CFD solver times the number of evaluations required). In the same vein, it should be noted that the common practice in DRL studies to gain insight into the performances of the selected algorithm by averaging results over multiple independent training runs with different random seeds is not tractable, as it would trigger a prohibitively large CPU cost. The same random seeds have thus been deliberately used over the whole course of the study to ensure a minimal level of performance comparison between the two cases.

Table 1 PPO hyper parameters.

Full size table

Data availability

The datasets generated and analysed during the current study are not publicly available due the fact that they constitute an excerpt of research in progress but are available from the corresponding author on reasonable request.

References

Rinkel, G. J., Djibuti, M., Algra, A. & van Gijn, J. Prevalence and risk of rupture of intracranial aneurysms: A systematic review. Stroke 29, 251 (1998).
Article CAS PubMed Google Scholar
Vlak, M. H. M., Algra, A., Brandenburg, R. & Rinkel, G. J. E. Prevalence of unruptured intracranial aneurysms, with emphasis on sex, age, comorbidity, country, and time period: A systematic review and meta-analysis. Lancet Neurol. 10, 626 (2011).
Article PubMed Google Scholar
Rivero-Arias, O., Gray, A. & Wolstenholme, J. Burden of disease and costs of aneurysmal subarachnoid haemorrhage (aSAH) in the United Kingdom. Cost. Eff. Resour. Alloc. 8, 1 (2010).
Article Google Scholar
Wermer, M. J. H., van der Schaaf, I. C., Algra, A. & Rinkel, G. J. E. Risk of rupture of unruptured intracranial aneurysms in relation to patient and aneurysm characteristics: An updated meta-analysis. Stroke 38, 1404 (2007).
Article PubMed Google Scholar
Wardlaw, J. M. & White, P. M. The detection and management of unruptured intracranial aneurysms. Brain 123, 205 (2000).
Article PubMed Google Scholar
Sforza, D., Putman, C. M. & Cebral, J. R. Hemodynamics of cerebral aneurysms. Annu. Rev. Fluid Mech. 41, 91 (2009).
Article ADS PubMed PubMed Central MATH Google Scholar
Isaksen, J. G. et al. Determination of wall tension in cerebral artery aneurysms by numerical simulation. Stroke 39, 3172 (2008).
Article PubMed Google Scholar
Taylor, C. A. & Humphrey, J. D. Open problems in computational vascular biomechanics: Hemodynamics and arterial wall mechanics. Comput. Methods Appl. Mech. Eng. 198, 3514 (2009).
Article ADS MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Shojima, M. et al. Magnitude and role of wall shear stress on cerebral aneurysm: Computational fluid dynamic study of 20 middle cerebral artery aneurysms. Stroke 35, 2500 (2004).
Article PubMed Google Scholar
Jou, L.-D., Lee, D. H., Morsi, H. & Mawad, M. E. Wall shear stress on ruptured and unruptured intracranial aneurysms at the internal carotid artery. Am. J. Neuroradiol. 29, 1761 (2008).
Article PubMed PubMed Central Google Scholar
Cebral, J. R., Mut, F., Weir, J. & Putman, C. M. Association of hemodynamic characteristics and cerebral aneurysm rupture. Am. J. Neuroradiol. 32, 264 (2011).
Article CAS PubMed PubMed Central Google Scholar
Xiang, J. et al. Hemodynamic-morphologic discriminants for intracranial aneurysm rupture. Stroke 42, 144 (2011).
Article PubMed Google Scholar
Jiang, B., Paff, M., Colby, G. P., Coon, A. L. & Lin, L.-M. Cerebral aneurysm treatment: Modern neurovascular techniques. Stroke Vasc. Neurol. 1, 93 (2016).
Article PubMed PubMed Central Google Scholar
Rajah, G., Narayanan, S. & Rangel-Castilla, L. Update on flow diverters for the endovascular management of cerebral aneurysms. Neurosurg. Focus 42, E2 (2017).
PubMed Google Scholar
Ravindran, K. et al. Mechanism of action and biology of flow diverters in the treatment of intracranial aneurysms. Neurosurgery 86, S13 (2020).
Article PubMed Google Scholar
Maragkos, G. A. et al. Overview of different flow diverters and flow dynamics. Neurosurgery 86, S21 (2020).
Article PubMed Google Scholar
McKenna, C. G. & Vaughan, T. J. A finite element investigation on design parameters of bare and polymer-covered self-expanding wire braided stents. J. Biomed. Mater. Res., Part B Appl. Biomater. 115, 104305 (2021).
CAS Google Scholar
Zaccaria, A., Pennati, G. & Petrini, L. Analytical methods for braided stents design and comparison with FEA. J. Mech. Behav. Biomed. Mater. 119, 104560 (2021).
Article PubMed Google Scholar
Meng, H., Tutino, V. M., Xiang, J. & Siddiqui, A. High WSS or low WSS? Complex interactions of hemodynamics with intracranial aneurysm initiation, growth, and rupture: Toward a unifying hypothesis. Am. J. Neuroradiol. 35, 1254 (2014).
Article CAS PubMed PubMed Central Google Scholar
Cebral, J. R. & Meng, H. Counterpoint: Realizing the clinical utility of computational fluid dynamics-closing the gap. Am. J. Neuroradiol. 33, 396 (2012).
Article CAS PubMed PubMed Central Google Scholar
Robertson, A. M. & Watton, P. Computational fluid dynamics in aneurysm research: Critical reflections, future directions. Am. J. Neuroradiol. 33, 992 (2012).
Article CAS PubMed PubMed Central Google Scholar
Shobayashi, Y. et al. Intra-aneurysmal hemodynamic alterations by a self-expandable intracranial stent and flow diversion stent: High intra-aneurysmal pressure remains regardless of flow velocity reduction. J. Neurointerv. Surg. 5, iii38 (2013).
Article PubMed Google Scholar
Zhang, Y., Chong, W. & Qian, Y. Investigation of intracranial aneurysm hemodynamics following flow diverter stent treatment. Med. Eng. Phys. 35, 608 (2013).
Article CAS PubMed Google Scholar
Bahdanau, D. et al. An actor-critic algorithm for sequence prediction. arXiv:1607.07086 (2016).
Schulman, J., Wolski, F., Dhariwal, P., Radford, A. & Klimov, O. Proximal policy optimization algorithms. arXiv:1707.06347 (2017).
Hwangbo, J. et al. Learning agile and dynamic motor skills for legged robots. Sci. Robot. 4, eaau5872 (2019).
Article PubMed Google Scholar
Pan, X., You, Y., Wang, Z. & Lu, C. Virtual to real reinforcement learning for autonomous driving. arXiv:1704.03952 (2017).
Deng, Y., Bao, F., Kong, Y., Ren, Z. & Dai, Q. Deep direct reinforcement learning for financial signal representation and trading. IEEE Trans. Neural Netw. Learn. Syst. 28, 653 (2017).
Article PubMed Google Scholar
Fox, I., Lee, J., Pop-Busui, R. & Wiens, J. Deep reinforcement learning for closed-loop blood glucose control. In Procs. Machine Learning for Healthcare Conference 508–536 (2020).
Zhou, S. K., Le, H. N., Luu, K., Nguyen, H. V. & Ayache, N. Deep reinforcement learning in medical imaging: A literature review. arXiv:2103.05115 (2021).
Caprara, S. Towards the integration of computational methods in spinal surgical planning: A combination of deep learning, statistical, and finite element methods, Ph.D. thesis, Eidgenössische Technische Hochschule Zürich (2021).
Ren, F., Rabault, J. & Tang, H. Flow shape design for microfluidic devices using deep reinforcement learning. arXiv:1811.12444 (2018).
Yan, X., Zhu, J., Kuang, M. & Wang, X. Aerodynamic shape optimization using a novel optimizer based on machine learning techniques. Aerosp. Sci. Technol. 86, 826 (2019).
Article Google Scholar
Viquerat, J., Rabault, J., Kuhnle, A., Ghraieb, H. & Hachem, E. Direct shape optimization through deep reinforcement learning. arXiv:1908.09885 (2019).
Ma, P., Tian, Y., Pan, Z., Ren, B. & Manocha, D. Fluid directed rigid body control using deep reinforcement learning. ACM Trans. Graph. (TOG) 37, 1 (2018).
Google Scholar
Biferale, L., Bonaccorso, F., Buzicotti, M., Clark Di Leioni, P. & Gustavsson, K. Zermelo’s problem: Optimal point-to-point navigation in 2D turbulent flows using reinforcement learning. Chaos 29, 103138 (2019).
Article ADS MathSciNet CAS PubMed Google Scholar
Ren, F., Hu, H. & Tang, H. Active flow control using machine learning: A brief review. J. Hydrodyn. 32, 247 (2020).
Article ADS Google Scholar
Viquerat, J., Meliga, P., Larcher, A. & Hachem, E. A review on deep reinforcement learning for fluid mechanics: An update. Phys. Fluids 34, 111301 (2022).
Article ADS CAS Google Scholar
Belus, V. et al. Exploiting locality and translational invariance to design effective deep reinforcement learning control of the 1-dimensional unstable falling liquid film. AIP Adv. 9, 125014 (2019).
Article ADS Google Scholar
Bucci, M. A. et al. Control of chaotic systems by deep reinforcement learning. Proc. R. Soc. A 475, 20190351 (2019).
Article ADS MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Novati, G., Mahadevan, L. & Koumoutsakos, P. Controlled gliding and perching through deep-reinforcement-learning. Phys. Rev. Fluids 4, 093902 (2019).
Article ADS Google Scholar
Novati, G. et al. Synchronisation through learning for two self-propelled swimmers. Bioinspir. Biomim. 12, 036001 (2017).
Article ADS PubMed Google Scholar
Verma, S., Novati, G. & Koumoutsakos, P. Efficient collective swimming by harnessing vortices through deep reinforcement learning. Proc. Natl. Acad. Sci. U.S.A. 115, 5849 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Rabault, J., Kuchta, M., Jensen, A., Réglade, U. & Cerardi, N. Artificial neural networks trained through deep reinforcement learning discover control strategies for active flow control. J. Fluid Mech. 865, 281 (2019).
Article ADS MathSciNet MATH Google Scholar
Tang, H., Rabault, J., Kuhnle, A., Wang, Y. & Wang, T. Robust active flow control over a range of reynolds numbers using an artificial neural network trained through deep reinforcement learning. Phys. Fluids 32, 053605 (2020).
Article ADS CAS Google Scholar
Paris, R., Beneddine, R. & Dandois, J. Robust flow control and optimal sensor placement using deep reinforcement learning. J. Fluid Mech. 913, 56 (2021).
Article MathSciNet MATH Google Scholar
Xu, H., Zhang, W., Deng, J. & Rabault, J. Active flow control with rotating cylinders by an artificial neural network trained by deep reinforcement learning. J. Hydrodyn. 32, 254 (2020).
Article ADS Google Scholar
Ghraieb, H., Viquerat, J., Larcher, A., Meliga, P. & Hachem, E. Single-step deep reinforcement learning for open-loop control of laminar and turbulent flows. Phys. Rev. Fluids 6, 053902 (2021).
Article ADS MATH Google Scholar
Hachem, E., Ghraieb, H., Viquerat, J., Larcher, A. & Meliga, P. Deep reinforcement learning for the control of conjugate heat transfer. J. Comput. Phys. 436, 110317 (2021).
Article MathSciNet MATH Google Scholar
Ghraieb, H., Viquerat, J., Larcher, A., Meliga, P. & Hachem, E. Single-step deep reinforcement learning for two- and three-dimensional optimal shape design. AIP Adv. 12, 085108 (2022).
Article MATH Google Scholar
Heller, R. S. et al. Neuro-ophthalmic effects of stenting across the ophthalmic artery origin in the treatment of intracranial aneurysms. J. Neurosurg. 121, 18 (2014).
Article PubMed Google Scholar
Malek, A. M., Alper, S. L. & Izumo, S. Hemodynamic shear stress and its role in atherosclerosis. JAMA 282, 2035 (1999).
Article CAS PubMed Google Scholar
Baek, H., Jayaraman, M. V., Richardson, P. D. & Karniadakis, G. E. Flow instability and wall shear stress variation in intracranial aneurysms. J. R. Soc. Interface 7, 967 (2010).
Article CAS PubMed Google Scholar
Appanaboyina, S., Mut, F., Löhner, R., Putman, C. M. & Cebral, J. R. Computational fluid dynamics of stented intracranial aneurysms using adaptive embedded unstructured grids. Int. J. Numer. Meth. Fl. 57, 475 (2008).
Article MathSciNet CAS MATH Google Scholar
Mut, F. et al. Image-based modeling of blood flow in cerebral aneurysms treated with intrasaccular flow diverting devices. Int. J. Numer. Method. Biomed. Eng. 35, e3202 (2019).
Article PubMed PubMed Central Google Scholar
Hachem, E., Kloczko, T., Digonnet, H. & Coupez, T. Stabilized finite element solution to handle complex heat and fluid flows in industrial furnaces using the immersed volume method. Int. J. Numer. Meth. Eng. 68, 99 (2012).
Article MathSciNet MATH Google Scholar
Berg, P., Saalfeld, S., Voß, S., Beuing, O. & Janiga, G. A review on the reliability of hemodynamic modeling in intracranial aneurysms: Why computational fluid dynamics alone cannot solve the equation. Neurosurg. Focus 47, E15 (2019).
Article PubMed Google Scholar
Saqr, K. M. et al. What does computational fluid dynamics tell us about intracranial aneurysms? A meta-analysis and critical review. J. Cereb. Blood Flow Metab. 40, 1021 (2020).
Article PubMed Google Scholar
Hsu, M.-C. & Bazilevs, Y. Blood vessel tissue prestress modeling for vascular fluid-structure interaction simulation. Finite Elem. Anal. Des. 47, 593 (2011).
Article MathSciNet Google Scholar
Bing, F. et al. Stents and flow diverters in the treatment of aneurysms: Device deformation in vivo may alter porosity and impact efficacy. Neuroradiology 55, 85 (2013).
Article PubMed Google Scholar
Makoyeva, A., Bing, F., Darsaut, T. E., Salazkin, I. & Raymond, J. The varying porosity of braided self-expanding stents and flow diverters: An experimental study. Am. J. Neuroradiol. 34, 596 (2013).
Article CAS PubMed PubMed Central Google Scholar
Raut, S. S., Jana, A., De Oliveira, V., Muluk, S. C. & Finol, E. A. The importance of patient-specific regionally varying wall thickness in abdominal aortic aneurysm biomechanics. J. Biomech. Eng. 135, 2569 (2013).
Article Google Scholar
Voß, S. et al. Fluid-structure simulations of a ruptured intracranial aneurysm: Constant versus patient-specific wall thickness. Comput. Math. Methods Med. 2016, 9854539 (2016).
Article PubMed PubMed Central Google Scholar
Antiga, L., Wasserman, B. A. & Steinman, D. A. On the overestimation of early wall thickening at the carotid bulb by black blood mri, with implications for coronary and vulnerable plaque imaging. Magn. Reson. Med. 60, 1020 (2008).
Article CAS PubMed Google Scholar
Boussel, L. et al. Aneurysm growth occurs at region of low wall shear stress: Patient-specific correlation of hemodynamics and growth in a longitudinal study. Stroke 39, 2997 (2008).
Article PubMed PubMed Central Google Scholar
Sugiyama, S.-I. et al. Hemodynamic analysis of growing intracranial aneurysms arising from a posterior inferior cerebellar artery. World Neurosurg. 78, 462 (2012).
Article PubMed Google Scholar
Mantha, A., Karmonik, C., Benndorf, G., Strother, C. & Metcalfe, R. Hemodynamics in a cerebral artery before and after the formation of an aneurysm. Am. J. Neuroradiol. 27, 1113 (2006).
CAS PubMed PubMed Central Google Scholar
Meng, H. et al. Complex hemodynamics at the apex of an arterial bifurcation induces vascular remodeling resembling cerebral aneurysm initiation. Stroke 38, 1924 (2007).
Article PubMed PubMed Central Google Scholar
Shimogonya, Y., Ishikawa, T., Imai, Y., Matsuki, N. & Yamaguchi, T. Can temporal fluctuation in spatial wall shear stress gradient initiate a cerebral aneurysm? A proposed novel hemodynamic index, the gradient oscillatory number (gon). J. Biomech. 42, 550 (2009).
Article PubMed Google Scholar
Kulcsár, Z. et al. Hemodynamics of cerebral aneurysm initiation: The role of wall shear stress and spatial wall shear stress gradient. Am. J. Neuroradiol. 32, 587 (2011).
Article PubMed PubMed Central Google Scholar
Viquerat, J., Duvigneau, R., Meliga, P., Kuhnle, A. & Hachem, E. Policy-based optimization: Single-step policy gradient method seen as an evolution strategy. Neural Comput. Appl. 35, 449 (2023).
Article Google Scholar
Bouillot, P. et al. Geometrical deployment for braided stent. Med. Image Anal. 30, 85 (2016).
Article PubMed Google Scholar
Taubin, G. A signal processing approach to fair surface design. In Procs. of the 22nd Annual Conference on Computer Graphics and Interactive Techniques 351–358 (1995).
Geuzaine, C. & Remacle, J.-F. Gmsh: A 3-d finite element mesh generator with built-in pre-and post-processing facilities. Int. J. Numer. Meth. Eng. 79, 1309 (2009).
Article MathSciNet MATH Google Scholar
Coupez, T. & Hachem, E. Solution of high-reynolds incompressible flow with stabilized finite element and adaptive anisotropic meshing. Comput. Methods Appl. Mech. Engrg. 267, 65 (2013).
Article ADS MathSciNet MATH Google Scholar
Macosko, C. W. Rheology: Principles, Measurements, and Applications (Wiley-VCH, 1994).
Google Scholar
Hughes, T. J. R., Feijóo, G. R., Mazzei, L. & Quincy, J.-B. The variational multiscale method—a paradigm for computational mechanics. Comput. Methods Appl. Mech. Eng. 166, 3 (1998).
Article ADS MathSciNet MATH Google Scholar
Codina, R. Stabilization of incompressibility and convection through orthogonal sub-scales in finite element methods. Comput. Methods Appl. Mech. Eng. 190, 1579 (2000).
Article ADS MathSciNet MATH Google Scholar
Bazilevs, Y. et al. Variational multiscale residual-based turbulence modeling for large eddy simulation of incompressible flows. Comput. Methods Appl. Mech. Eng. 197, 173 (2007).
Article ADS MathSciNet MATH Google Scholar
Hachem, E., Rivaux, B., Kloczko, T., Digonnet, H. & Coupez, T. Stabilized finite element method for incompressible flows with high Reynolds number. J. Comput. Phys. 229, 8643 (2010).
Article ADS MathSciNet CAS MATH Google Scholar
Codina, R. Stabilized finite element approximation of transient incompressible flows using orthogonal subscales. Comput. Methods Appl. Mech. Eng. 191, 4295 (2002).
Article ADS MathSciNet MATH Google Scholar
Hachem, E., Feghali, S., Codina, R. & Coupez, T. Immersed stress method for fluid-structure interaction using anisotropic mesh adaptation. Int. J. Numer. Meth. Eng. 94, 805 (2013).
Article MathSciNet MATH Google Scholar
Hachem, E., Digonnet, H., Massoni, E. & Coupez, T. Immersed volume method for solving natural convection, conduction and radiation of a hat-shaped disk inside a 3d enclosure. Int. J. Numer. Method Heat Fluid Flow 22, 718 (2012).
Article MATH Google Scholar
Pereira, A., Larcher, A., Hachem, E. & Valette, R. Capillary, viscous, and geometrical effects on the buckling of power-law fluid filaments under compression stresses. Comp. Fluids 190, 514 (2019).
Article MathSciNet MATH Google Scholar
Valette, R. et al. The effect of viscosity, yield stress, and surface tension on the deformation and breakup profiles of fluid filaments stretched at very high velocities. J. Non-Newton. Fluid. 263, 130 (2019).
Article CAS Google Scholar
Brockman, G. et al. Openai gym. arXiv:1606.01540 (2016).
Viquerat, J., Duvigneau, R., Meliga, P., Kuhnle, A. & Hachem, E. Policy-based optimization: Single-step policy gradient method seen as an evolution strategy. arXiv:2104.06175 (2021).

Download references

Acknowledgements

Funded/Co-funded by the European Union (ERC, CURE, 101045042). Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union or the European Research Council. Neither the European Union nor the granting authority can be held responsible for them.

Author information

Authors and Affiliations

MINES Paris, PSL Research University, Centre de mise en forme des matériaux (CEMEF), CNRS UMR 7635, 06904, Sophia Antipolis Cedex, France
E. Hachem, P. Meliga, A. Goetz, P. Jeken Rico, J. Viquerat, A. Larcher, R. Valette, V. Lannelongue, H. Ghraieb & R. Nemer
Department of Neuroradiology, University Hospital Munich (LMU), Munich, Germany
A. F. Sanches, Y. Ozpeynirci & T. Liebig

Authors

E. Hachem
View author publications
You can also search for this author in PubMed Google Scholar
P. Meliga
View author publications
You can also search for this author in PubMed Google Scholar
A. Goetz
View author publications
You can also search for this author in PubMed Google Scholar
P. Jeken Rico
View author publications
You can also search for this author in PubMed Google Scholar
J. Viquerat
View author publications
You can also search for this author in PubMed Google Scholar
A. Larcher
View author publications
You can also search for this author in PubMed Google Scholar
R. Valette
View author publications
You can also search for this author in PubMed Google Scholar
A. F. Sanches
View author publications
You can also search for this author in PubMed Google Scholar
V. Lannelongue
View author publications
You can also search for this author in PubMed Google Scholar
H. Ghraieb
View author publications
You can also search for this author in PubMed Google Scholar
R. Nemer
View author publications
You can also search for this author in PubMed Google Scholar
Y. Ozpeynirci
View author publications
You can also search for this author in PubMed Google Scholar
T. Liebig
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.H. developed the research idea, designed the numerical experiments and contributed to data collection and analysis. E.H. and P.M. wrote the initial content. P.M., J.V. and A.L. contributed to study design, data collection and analysis. A.G., P.J.R., V.L. and H.G. performed numerical experiments. R.V., A.S. and R.N. discussed results and contributed to data analysis. Y.O. and T.L. discussed results and critically reviewed all drafts. All authors reviewed the manuscript.

Corresponding author

Correspondence to E. Hachem.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hachem, E., Meliga, P., Goetz, A. et al. Reinforcement learning for patient-specific optimal stenting of intracranial aneurysms. Sci Rep 13, 7147 (2023). https://doi.org/10.1038/s41598-023-34007-z

Download citation

Received: 17 November 2022
Accepted: 22 April 2023
Published: 02 May 2023
DOI: https://doi.org/10.1038/s41598-023-34007-z

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.