Engineered cytochrome P450 for direct arylalkene-to-ketone oxidation via highly reactive carbocation intermediates

Ketones are crucial intermediates in synthesis and frequent moieties in many products. The direct regioselective synthesis of ketones from internal alkenes could simplify synthetic routes and solve a long-standing challenge in catalysis. Here we report the laboratory evolution of a cytochrome P450 enzyme for the direct oxidation of internal arylalkenes to ketones with several thousand turnovers. This evolved ketone synthase benefits from 15 crucial mutations, most of them distal to the active site. Computational analysis revealed that all these mutations collaborate to generate and tame a highly reactive carbocation intermediate. This is achieved through a confined, rigid, and geometrically and electrostatically preorganized active site. The engineered enzyme exploits a metal–oxo species for ketone synthesis and enables various challenging alkene functionalization reactions. This includes the catalytic, enantioselective oxidation of internal alkenes to ketones and formal asymmetric hydrofunctionalizations of internal alkenes in combination with other biocatalysts. The direct regioselective oxidation of internal alkenes to ketones poses an important synthetic challenge. Now, directed evolution of a cytochrome P450 enzyme affords a ketone synthase that can efficiently oxidize internal arylalkenes directly to ketones with high chemo- and regioselectivity.

Ketones are crucial intermediates in synthesis and frequent moieties in many products. The direct regioselective synthesis of ketones from internal alkenes could simplify synthetic routes and solve a long-standing challenge in catalysis. Here we report the laboratory evolution of a cytochrome P450 enzyme for the direct oxidation of internal arylalkenes to ketones with several thousand turnovers. This evolved ketone synthase benefits from 15 crucial mutations, most of them distal to the active site. Computational analysis revealed that all these mutations collaborate to generate and tame a highly reactive carbocation intermediate. This is achieved through a confined, rigid, and geometrically and electrostatically preorganized active site. The engineered enzyme exploits a metal-oxo species for ketone synthesis and enables various challenging a lk ene f un ct io na li zation reactions. This includes the catalytic, enantioselective oxidation of internal alkenes to ketones and formal asymmetric h yd ro fu nctionalizations of internal alkenes in combination with other biocatalysts.
Ketones are essential structural motifs in pharmaceuticals, agrochemicals, materials and natural products. They are also versatile reactants in a wide range of reactions, for example, in the synthesis of chiral alcohols and amines 1 . Thus, efficient catalytic methods to produce ketones have long been sought. Selective ketone synthesis via the simple aerobic oxidation of internal alkenes would represent a powerful addition to synthetic organic chemistry. This is because internal alkenes are easily accessible from petroleum and renewable resources as well as from well-established reactions such as olefin metathesis 2 and carbonyl olefination 3 . Due to the lack of efficient and selective catalysts, the hydroboration-oxidation method using toxic and expensive boranes is still widely used 4,5 . In addition to methods using stoichiometric reagents, various catalytic strategies to access ketones from internal alkenes have been developed. In particular, attempts have been made to expand the well-studied Wacker-Tsuji oxidation from terminal to internal alkenes [6][7][8][9] . However, internal alkenes are less reactive in such palladium-catalysed Wacker-type oxidations, resulting in high catalyst loadings and consequently low total turnover numbers (<20). In addition to activity, selectivity is also a major challenge in catalyst development (Fig. 1a). High regioselectivities in Wacker-type oxidation reactions depend on specific internal alkenes bearing directing groups that favour the formation of one ketone over the other. Many of these protocols are not aerobic oxidations or depend on stoichiometric reagents such as peroxides and phenylsilanes 10,11 . A recently published dual catalytic approach used water as the terminal oxidant and produced hydrogen gas as a by-product 12 . In general, efficient and regioselective aerobic oxidation of internal alkenes to ketones is not only a considerable challenge in catalyst development, but is also highly desired as it has the potential to streamline the synthesis of many important molecules.

Directed enzyme evolution
To explore whether the enzyme-controlled metal-oxo-mediated mechanism can be harnessed for efficient and selective ketone synthesis, we set out to evolve a promiscuous enzyme. We first aimed to identify a starting point for directed evolution by determining the activity and selectivity of various cytochrome P450 monooxygenases in the oxidation of internal alkenes. In particular, we focused on an in-house mutant library of a cytochrome P450 monooxygenase from Labrenzia agreggata (P450 LA1 ), which has been recently evolved to oxidize terminal alkenes to aldehydes 13 . trans-β-Alkylstyrenes have been chosen as model substrates because such internal arylalkenes are particularly challenging substrates in Wacker-type oxidation, yielding mixtures of ketone products 7,26 . Indeed, we recently confirmed the initial activity with this substrate type 13 , demonstrating that the coupled electronhydride transfer process (Fig. 1b) is accessible for ketone synthesis.
The variant P7E (P450 LA1 -T121A-V123I-N201K-H206W-N2 09S-I326V-Y385H-E418G) was selected from the initial screening as it showed reasonable selectivity and the highest level of product formation under the screening conditions ( Supplementary Fig. 4). Using trans-β-methylstyrene as the substrate, phenylacetone was produced with a total turnover number (TTN) of 225; however, the performance of the catalyst was limited by the low selectivity of the reaction (Fig. 2a). In addition to phenylacetone (31%), epoxidation (67%) and allylic oxidation (2%) were also observed. It is worth noting that variant P7E did not produce the regioisomeric ketone propiophenone ( Supplementary Fig. 5).
We next aimed to optimize the activity and selectivity of P7E by directed evolution. Eleven rounds of evolution uncovered twelve beneficial mutations to convert P7E into an efficient ketone synthase (Fig. 2a,b). The directed evolution experiment involved mainly site-saturation mutagenesis using the 22c approach 27 as well as combinatorial mutagenesis libraries (Supplementary Tables 1 and 2). In each round of directed evolution, usually six to eight amino acid positions of the haem domain were randomized individually, and high-throughput ultrahigh-performance liquid chromatography (UHPLC) analysis was used to identify the beneficial mutations ( Supplementary Fig. 6). Interestingly, the hit rate was rather low. Typically, we could identify only we applied directed evolution to engineer an enzyme that oxidizes terminal alkenes to the corresponding aldehydes, attaining opposite selectivity to the widely used Wacker-Tsuji oxidation reaction. The evolved iron haem-based biocatalyst achieved aldehyde formation by controlling an oxo transfer reaction using a high-valent metal-oxo complex as the catalytic oxidant (Fig. 1b). Many catalysts comprising such metal-oxo complexes, including cytochrome P450s 15,16 , peroxygenases 17 and Jacobsen's catalyst 18,19 , are used to efficiently epoxidize alkenes via highly concerted reaction pathways. The key to harnessing oxo transfer for carbonyl-selective alkene oxidation is to create a catalyst that is able to intercept the epoxidation and couple oxo transfer with an electron-hydride transfer process (Fig. 1b). This can be achieved by accessing a highly reactive carbocation intermediate and making it available for organic synthesis. While this is currently out of reach for small-molecule catalysts, our recent study 13 showed that reactive centres in enzymes can be optimized with high precisioWWn to fully access and exploit such reactive carbocation intermediates. Recent mechanistic studies on this oxo transfer reaction revealed that alkene epoxidation is a dynamically favoured process and that carbonyl formation can be accomplished by controlling the accessible conformations of the radical and carbocation intermediates 14 . In addition, electrostatic preorganization of the enzyme active site favours the formation of this key carbocation intermediate 14 .
Here we report a generalizable approach to develop enzymes for the aerobic oxidation of alkenes to ketones. In particular, we have expanded the enzymatic alkene-to-carbonyl oxidation from terminal to internal arylalkenes. This generates ketones with high activity and regioselectivity in a direct aerobic oxidation reaction. Key aspects of this study are the directed enzyme evolution of a ketone synthase as well as computational studies to rationalize and propose how more than a dozen beneficial mutations collaborate to access this catalytic cycle. Furthermore, we applied the ketone synthase in synthesis as a stand-alone catalyst to generate various benzyl ketones as well as in combination with established biocatalysts in cascade reactions. The latter application enabled the formal asymmetric hydration and hydroamination (Fig. 1c)   acids in the second and third shells. We stopped the directed evolution campaign after 12 rounds of evolution, having addressed a total of 78 amino acid positions at least once, which corresponds to 18% of the amino acid sequence of the haem domain. The final variant was named ketone synthase (KS) and produced phenylacetone with high activity (2,630 TTN) and chemoselectivity (75%) as well as with complete regiocontrol. The allylic oxidation of trans-β-methylstyrene was a further competing reaction pathway in the course of directed evolution, however, the epoxide was the only by-product observed using the final KS variant (Supplementary Figs. 7 and 8). Control experiments confirmed that epoxides were not isomerized by KS ( Fig. 2c and Supplementary  Fig. 9), highlighting that epoxides are not intermediates in the reaction and supporting that catalysis proceeds via the proposed coupled electron-hydride transfer mechanism. The evolved KS is 12-fold more active than the parent P7E variant and two orders of magnitude more efficient than previously reported catalysts ( Supplementary Fig. 1). The KS variant carries 18 mutations compared with the wild-type enzyme and 12 compared with P7E. Careful analysis revealed that 15 out of these 18 mutations are crucial to favour the ketone-forming pathway ( Fig. 2a and Supplementary Fig. 4). Interestingly, the beneficial mutations are not limited to the active site. Many selectivity-determining mutations were introduced in the second and third shells, and their individual contributions add up to facilitate access to the ketone product (Fig. 2d). In addition to additive mutations, we observed cooperative effects that can be well visualized by following position 210, which changed during the evolution from threonine to valine to isoleucine and back to valine, always contributing to increases in selectivity and activity (Fig. 2b) 28 . Overall, there was no single mutation that changed the chemoselectivity in the reaction by more than 8%.

Mechanistic insights from computational modelling
To unravel the molecular basis of the evolved KS, we performed computational modelling. In particular, we aimed to describe the enzymatic reaction mechanism and explain how the multiple distal mutations contribute to fully enable the carbonyl-selective reaction pathway.
We first used density functional theory (DFT) to analyse the intrinsic reaction mechanism and geometric requirements using a computational truncated model ( Fig. 3a; see Supplementary Methods and Supplementary Computational Part I for complete mechanistic descriptions). The calculations indicated that, as previously found for styrene and the formation of the corresponding aldehyde 13,14 , the epoxidation and ketone formation from substrate 1 share a common first step in which the first C-O bond is formed (TS1; Fig. 3a and Supplementary Fig. 19). In the doublet electronic state (TS1 d , Gibbs free energy difference between reactant and transition state (ΔG ‡ ) is 8.3 kcal mol −1 ), epoxide 3 is formed directly from TS1 via the formation of a second C-O bond; in the quartet electronic state (TS1 q , ΔG ‡ = 9.2 kcal mol −1 , relative Gibbs free energy (ΔΔG) is 0.9 kcal mol −1 higher than TS1 d ), a covalent radical intermediate is formed (Int1 q , Gibbs free energy difference (ΔG) is −13.7 kcal mol −1 ). This radical intermediate Int1 q (Fig. 3a) can follow the epoxidation pathway, forming the second C-O bond through a low-energy TS (TS2 q , ΔG ‡ = 1.6 kcal mol −1 ), or form a covalent carbocation intermediate via intramolecular electron transfer from the substrate moiety to the porphyrin Fe (Int2 q and Int2 d , ΔG = 1.7 and −2.2 kcal mol −1 , respectively). Int2 d can be formed via a minimum-energy crossing point (MECP) from Int2 q ( Fig. 3a and Supplementary Fig. 23). In this truncated model, the formation of carbocation Int2 is triggered by a conformational change, namely the rotation of the benzyl group of the substrate ( Fig. 3b and Supplementary Fig. 21). This allows optimal stabilization of the benzylic carbocation by maximizing hyperconjugation with the neighbouring σ(C-H) and σ(C-C) bonds as well as resonance with the aromatic ring, and by n→p interactions between the oxygen lone pairs and the empty carbocation p orbital ( Fig. 3b and Supplementary Fig. 22) 14 . In this geometry, an intramolecular hydrogen bond between the ortho hydrogen of the phenyl ring and the oxygen atom is also established. Once formed, the carbocation within this conformation has its empty p orbital well aligned with the H atom, which can undergo a barrierless 1,2-hydride migration to form the final ketone product 2 in both the doublet and quartet electronic states. DFT calculations indicated that the carbonyl pathway can energetically compete with the epoxidation pathway as both are very similar in energy (TS1 d versus TS1 q ), and that catalytic access to the ketone product depends on stabilizing the reactive intermediates in a specific conformation.
We previously found that intrinsic dynamic effects have an important influence on the reaction pathway after transition state TS1 and the formation of the first C-O bond 14 . The covalent radical intermediate  Int1 q formed from TS1 is kinetically activated, possessing a momentum vector that matches that of the epoxide-forming transition state TS2 ( Supplementary Fig. 24). The intrinsic strong preference for epoxide formation is proposed to be a consequence of the dynamic behaviour of the radical intermediate, which is formed with an excess of energy that is not statistically distributed before further reacting 14 . This strong dynamic match favours alkene epoxidation over carbonyl formation, which requires a specific carbocation geometry to promote the 1,2-hydride migration (Fig. 3b). Consequently, we hypothesized that the carbonyl selectivity achieved by the evolved KS derives from conformational control over the intermediates due to geometric restrictions in the enzyme active site 14 .
To study the enzymatic reaction, homology models for the parent P7E and evolved KS variants were first constructed (see Supplementary    Methods and Supplementary Computational Part II for modelling details). Extensive molecular dynamics (MD) simulations (five independent replicas of 1,000 ns each, 5,000 ns in total for each system) revealed important differences between the overall structures of KS and the parent enzyme P7E that affect the shape of the active-site cavity ( Supplementary Fig. 29). Visual inspection of the active site in the evolved KS reveals a more confined substrate binding pocket compared with the parent P7E ( Fig. 4a and Supplementary Fig. 30). This reshaped active site is expected to improve the binding of substrate 1 in a reactive conformation and might also be important to control the conformations of the subsequently formed reactive intermediates. Next, we aimed to characterize the reactive binding conformations of the substrate in the P7E and KS active sites. We used substrate 1 in docking calculations, which were refined by extensive restrained MD simulations (five replicas of 500 ns, 2,500 ns in total) to characterize reactive near-attack conformations (NACs) leading to the key TS1 (see Supplementary Methods and Supplementary Computational Part II for modelling details). The simulations revealed that the KS shows improved substrate binding in relevant reactive NAC conformations due to a more packed active site compared with P7E ( Fig. 4b and Supplementary Figs. 31 and 32). This increased packing is reflected in two distinct observations. First, a more pronounced rigidity was observed for the KS active-site residues upon substrate binding in the NAC conformation (Fig. 4c, ΔRMSF). This is especially highlighted by residues 121-124 of the B′C loop region and residues 273-284 in the I helix ( Fig. 4c and Supplementary Fig. 33). Second, stronger interactions occur in the KS active site between the substrate and surrounding residues ( Fig. 4c and Supplementary Fig. 34). The substrate bound in a reactive conformation shows stronger interactions with active-site residues L97, A121V, W211, A275 and I276 for KS compared with P7E. All these residues are located around the phenyl ring of the substrate and consequently restrict its accessible conformations (Fig. 4b).
The enhanced active-site packing also plays a crucial role in controlling the radical intermediate formed from TS1, as supported by quantum mechanics/molecular mechanics (QM/MM) calculations carried out on the evolved KS ( Fig. 4d and Supplementary Fig. 41). First, tight interactions with surrounding residues help the intermediate to dissipate its excess vibrational energy and achieve thermal equilibration once formed 14 . The QM/MM-optimized radical intermediate formed in the KS active site (KS-wat-Int1 q ) has the phenyl group tightly packed between the same amino acids of the B′C loop region and I helix as identified by NAC analysis of the substrate (Fig. 4e). Second, active-site tightening and steric interactions between the radical intermediate and the active-site residues facilitate access to the geometry required for effective carbocation formation (KS-wat-Int2 q ) Fig. 4e and Supplementary Fig. 42 for an analysis of the required conformational change). This geometry allows stabilization of the carbocation intermediate due to stereoelectronic effects and aligns the cis-β-hydrogen atom with the empty carbon p orbital for a fast and selective 1,2-hydride migration. These steric interactions also increase the barrier for the second C-O bond formation (KS-wat-TS2), disfavouring the epoxidation pathway ( Fig. 4e and Supplementary Fig. 41). These interactions mainly involve hydrophobic contacts between the substrate and active-site residues A275 and V278 (I helix), A121 and I123 (B′C loop), L97 (BB′ loop), and W329 (β1 region), which contribute to maintaining a tight packing of the substrate and intermediates along the reaction coordinate.
In addition to the geometric and steric effects exerted by the enzyme active site, the role of electronic and electrostatic effects was also analysed. Our calculations indicated that carbocation formation is also favoured by electronic effects due to polar interactions in the active site and electrostatic preorganization of the catalytic pocket. Computational modelling revealed the presence of an ordered water molecule in the active site, which, in the presence of the substrate, interacts via hydrogen bonds with T283 and the oxygen atom of the Fe-oxo complex (Supplementary Fig. 36). This ordered water molecule is reminiscent of the activation of molecular oxygen to generate the iron-oxo active species 29,30 . Computations showed that the presence of this hydrogen-bonded water molecule can promote the formation of these radical and carbocation intermediates ( Supplementary  Figs. 27 and 28  In summary, computational modelling suggests that the electrostatic preorganization of the active site, the polar interaction with an ordered water molecule and the strong conformational control imposed by the KS active site are responsible for ketone formation. This precise catalytic control promotes carbocation formation and destabilizes the epoxide-forming transition state compared with the truncated, enzyme-free, model DFT calculations.

Role of distal mutations on active site preorganization
To rationalize how numerous distal mutations (Fig. 2d) optimized the active site, we studied the enzyme conformational dynamics in variants P7E and KS using dynamic correlation networks. Dynamic networks describe connections between distal protein residues based on their correlated dynamic behaviour mediated by neighbouring residues (in sequence or three-dimensional space). It has been proposed that dynamic networks can modulate the transition between active and inactive conformations of enzymes 31 and impact active-site preorganization by controlling enzyme conformational flexibility and reducing non-productive conformations 32 .
Analysis of the holo enzymes revealed the emergence of an expanded dynamic network in KS compared with P7E. Shortest path maps (SPMs) 33 , which describe communication between neighbouring residues, show that the new enlarged dynamic network in KS is centred on the substrate binding pocket and completely surrounds it ( Fig. 5 and Supplementary Fig. 37). The expanded dynamic network includes residues in the I helix and the 116-124 B′C loop region that were not part of the original network in P7E, but are important for the conformational control of the substrate and intermediates (Fig. 4c), as described earlier. The enhancement and tuning of the protein dynamic network can be attributed to the newly introduced mutations, which is in line with previous studies [31][32][33][34] , including important epistatic effects, as in related cytochrome P450s that directly impact catalysis 28 . Although only four mutations appear as nodes in the SPMs in the KS, practically all of the introduced mutations, including the distal ones, appear at adjacent positions in three-dimensional space (Supplementary Figs. 37 and 38), thus contributing to the protein dynamic behaviour [31][32][33][34] . The enhanced dynamic network in the KS is not only maintained, but strengthened in the reactive substrate-bound state ( Fig. 5 and Supplementary Fig. 38), similarly to what has been observed in other laboratory-evolved enzymes 32,35 . Taking all this into account, the observed reshaping and rigidification of the active site in the KS can be attributed to the changes in the dynamic network of the evolved enzyme due to distal mutations.

Substrate scope and application in synthesis
To explore the potential applications of such a KS, we studied the substrate scope of this enzyme and performed reactions on a preparative scale. The KS with its confined active site accepted various structurally related trans-β-methylstyrene derivatives and produced the corresponding ketones with efficiencies of up to several thousand TTNs and selectivities of up to 79% (Fig. 6a). Various substitutions are tolerated, including substitutions in the ortho (11), meta (10) and para (4-8) positions of the aromatic ring as well as in the trans-β-alkenic position (9). We also tested various structurally more unrelated internal alkenes, but these were not accepted as substrates by the highly optimized active site (Supplementary Fig. 10). As an example, cis-β-methylstyrene was  converted by the KS into epoxides but not phenylacetone (Supplementary Fig. 11). This agrees with recent mechanistic findings 14 on the evolved cytochrome P450 that oxidized terminal alkenes to aldehydes 13 , highlighting that hydride migration during the catalytic cycle of these evolved enzymes based on P450 LA1 proceeds selectively from the cis position. This also supports our conclusions that catalysts optimized for metal-oxo-mediated alkene-to-ketone oxidation require a highly confined and preorganized reactive centre.
Next, we explored substitutions at the α-position to study the enantioselectivity of the enzymatic oxidation reaction. Even though trans-α,β-dimethylstyrene (17) was not used as substrate in the screening, and the enzyme therefore was not selected for enantioselectivity during the directed evolution, the KS enabled the synthesis of the chiral ketone 12 with good enantioselectivity towards the (S) enantiomer (enantiomeric ratio, 87:13, S/R; Fig. 6a and Supplementary Fig. 13). Stereoselectivity is achieved by enantiofacial discrimination during the 1,2-hydride migration after the first C-O bond formation, in line with previous observations for a related engineered enzyme 14 . The enantioselectivity detected is also consistent with the selectivity model proposed on the basis of the computational modelling carried out on trans-β-methylstyrene ( Supplementary Fig. 13). It is worth highlighting that catalytic asymmetric oxidation of internal alkenes to chiral ketones is currently unknown. Further engineering of this cytochrome P450 enzyme or other iron-dependent monooxygenases will reveal whether KSs can be generated with even higher enantioselectivity.
The activity of the KS could be further optimized by changing the reaction conditions. Lengthening the reaction time and application of a cofactor recycling system enabled the conversion of alkene 1 to ketone 2 with a TTN of up to 4,750 ( Supplementary Fig. 14). To demonstrate that these reactions can be performed on a preparative scale (1.0 mmol), phenylacetone (2) was synthesized using a catalyst loading of 0.025 mol% KS (Fig. 6b). The product was isolated in 61% yield, with atmospheric oxygen and glucose as the only stoichiometric reagents.

Formal asymmetric hydrofunctionalization of internal alkenes
Laboratory evolution of non-natural enzyme function can provide access to new synthetic pathways and solve long-standing challenges in synthesis [36][37][38] . In this regard, we performed asymmetric redox hydrations and hydroaminations on internal arylalkenes by combining the KS with ketoreductases 39,40 and imine reductases 41,42 in cascade reactions on a preparative scale (1.0 mmol). Using this set-up, the unactivated internal arylalkene 1 was converted into chiral phenylethanols and phenylethylamine, which are important structural motifs in top-selling pharmaceuticals ( Supplementary Fig. 15). The chiral alcohols 13 and 14 as well as amine 16 were produced with excellent enantioselectivity (enantiomeric ratio of up to >99:1) in isolated yields of 66, 69 and 39%, respectively (Fig. 6b). An important feature of these reactions is that (1) they only depend on simple stoichiometric reagents such as atmospheric oxygen and isopropanol (asymmetric hydration) or atmospheric oxygen and glucose (asymmetric hydroamination) and (2) simple unprotected amines (such as 15) can be used as amine donors. This approach can in principle be expanded to other internal arylalkenes (Fig. 6a) or other amines as amine donors 42,43 .
In addition, we aimed to generate two stereocentres using the trisubstituted alkene 17 as the starting material (Fig. 7). Alcohol 18 was obtained via ketone 12 with low activity but excellent selectivity, yielding a single stereoisomer. This approach combines the KS-catalysed enantioselective alkene-to-ketone oxidation via an asymmetric hydride migration with diastereofacial differentiation of the ketoreductase.
As internal unactivated arylalkenes are easily accessible and ketones are substrates for multiple (bio)catalysts 40,42,46-51 , many new synthetic routes can be envisioned using evolved KSs.

Conclusions
Our findings demonstrate that cytochrome P450 can be evolved to oxidize internal arylalkenes to ketones with high activity and selectivity. We used the evolved KS to directly convert internal arylalkenes into ketones, including a catalytic enantioselective example. In addition, we combined the KS with established biocatalysts to convert internal arylalkenes into chiral phenylethanols and phenylethylamines on a preparative scale with high activity, regio-and enantioselectivity. What stands out is that the KS enables many important, highly selective functionalization reactions of internal arylalkenes that have so far largely eluded efficient and selective catalysis. Similar to many other engineered enzymes that enable new catalytic reactions 52,53 , the substrate scope is initially limited. However, we envision that further protein engineering and laboratory evolution of the enzyme (or other cytochrome P450s) will expand the substrate scope to various alkenes, as has been demonstrated with many other enzymes that catalyse natural and non-natural chemical reactions.
Engineered enzyme active sites can exert an exceptional degree of control over reactive intermediates 53 . This is achieved in the KS evolution by generating a confined, rigid and preorganized active site through multiple mutations that enhance a pre-existing dynamic network. The evolved KS provides access to a highly reactive carbocation species and thus enables direct ketone production from high-valent metal-oxo species. We foresee that the high degree of confinement 54 in enzyme active sites could be generally leveraged to enable other challenging catalytic cycles by precisely controlling the accessible conformations of reactive intermediates. Furthermore, we also envision that this coupled oxo-electron-hydride transfer process could be fully exploited with less reactive alkyl-substituted alkenes as substrates.

Cloning and library creation
pET22b(+) was used as a cloning and expression vector for all P450 LA1 -derived enzyme variants described in this study. Site-saturation mutagenesis libraries were generated using the '22c-trick' method 27 . Primer sequences are available in Supplementary Data Table 1. The obtained PCR products were digested with DpnI, purified on agarose gel and ligated using Gibson Assembly Master Mix 55 . The DNA was further purified before transformation into electrocompetent Escherichia coli BL21(DE3) cells (E. cloni EXPRESS BL21(DE3), Lucigen). Typically, 90 transformants were analysed per 22c-trick library, resulting in significant oversampling. In addition, the quality of each variant library was examined by sequencing (quick quality control) 27 and only libraries with nucleic acid distributions that reflect the 22 codons used were screened. For combinatorial mutagenesis libraries, primers were designed following the multichange isothermal mutagenesis protocol and the libraries were constructed as described above 56 .

Expression of cytochrome P450 variant libraries in 96-well format
An overnight culture plate (96-well format) was filled with Terrific Broth (TB) medium (with double the amount of glycerol, 100 µg ml −1 ampicillin, 500 µl well −1 ) and each well was inoculated with a single colony.

Small-scale expression of cytochrome P450 variants
The P450 LA1 -derived enzyme variants with elevated selectivity found in each round of evolution were streaked on LB-based agar medium and incubated overnight at 37 °C. An individual colony was inoculated into the overnight culture in 5 ml LB medium (100 µg ml −1 ampicillin, final concentration) at 37 °C and 180 rpm. Expression cultures were inoculated with 500 µl of preculture into 50 ml TB medium (100 µg ml −1 ampicillin, 2× glycerol, final concentration) in a 250-ml flask without baffles and incubated for 2-3 h at 37 °C and 180 rpm until an optical density at 600 nm (OD 600 ) of 0.6-0.8 was reached. The flask was cooled on ice for 10 min before expression was induced (0.2 mM IPTG, 0.5 mM 5-aminolevulinic acid, final concentration). The induced cells were shaken for 20 h at 25 °C and 180 rpm. The cells were collected (3,220g, 10 min, 4 °C) and stored at −20 °C for at least 1 d.

Biotransformations
The pelleted and frozen cells were lysed in lysis buffer (

MD simulations
MD simulations in explicit water were performed using the AMBER18 package 64,65 . Parameters for the trans-β-methylstyrene (1) substrate were generated within the Antechamber 66 module in the AMBER18 package using the general AMBER force field (gaff2) with partial charges set to fit the electrostatic potential generated at the HF/6-31G(d) level by the restrained electrostatic potential model. Parameters for the haem compound I and the axial Cys were taken from ref. 67. The enzyme variants were solvated in a pre-equilibrated cubic box with a 10-Å buffer of transferable intermolecular potential water molecules using the AMBER18 leap module, resulting in the addition of ∼16,000 solvent molecules. Explicit counter ions (Na + or Cl − ) were introduced to neutralize the system. All subsequent calculations were performed using the Stony Brook modification of the Amber14 force field (ff14SB) 68 . A two-stage geometry optimization approach was used. In the first stage, the positions of solvent molecules and ions were minimized by imposing positional restraints on the solute by a harmonic potential with a force constant of 500 kcal mol −1 Å −2 , and the second stage involved an unrestrained minimization of all the atoms in the simulation cell. The system was gently heated in six 50 ps steps, increasing the temperature by 50 K in each step (0-300 K) under constant volume and periodic boundary conditions. Water molecules were treated using the SHAKE algorithm, keeping the angle between the hydrogen atoms fixed. Long-range electrostatic effects were modelled using the particle-mesh Ewald method. An 8-Å cut-off was applied to Lennard-Jones and electrostatic interactions. Harmonic restraints of 30 kcal mol -1 were applied to the solute, and the Langevin scheme was used to control and equalize the temperature. The time step was kept at 1 fs during the heating stages. Each system was then equilibrated for 2 ns with a 2 fs time step at a constant pressure of 1 atm and temperature of 300 K without restraints. Once the systems had been equilibrated in the constant-temperature and constant-pressure ensemble, production trajectories were run under the constant-temperature and constant-volume ensemble and periodic boundary conditions. In particular, a total of 5,000 ns of simulations in the absence of substrate were accumulated for variants P7E and KS from five independent replicas of 1,000 ns for each system. The Cpptraj module from Ambertools utilities was used to process and analyse the trajectories, including cluster analyses. POVME 3.0 was used to analyse active-site volumes 69 .

Docking and substrate-bound MD simulations
The most representative structures from the previous holo state simulations were characterized by clustering of the accumulated simulation time, considering the protein backbone root mean square deviation. These structures were used for docking calculations with substrate 1, which were performed using AutoDock Vina 70 . The docking results were used as starting points for the restrained MD simulations, in which the distance between the centre of mass of the alkene (defined by atoms C1 and C2) of substrate 1 and the oxygen atom of haem compound I was kept restrained during the MD simulation (3.0-3.5 Å, using a 100 kcal mol −1 Å −2 force constant). This allowed the catalytically relevant binding poses of the substrate to be explored when it is in a NAC conformation to make the oxidation reaction happen, largely refining the docking predictions and preventing undesired unbinding events during the simulations. The same protocol previously described for the MD simulations was applied. A total of five independent replicas of 500 ns of production trajectory each were accumulated for each system, accumulating a total of 2.5 µs of restrained MD simulation time for each system. Substrate-residue interactions were calculated using the pairwise per-residue free-energy decomposition and molecular mechanics with generalized Born and surface area solvation (MM-GBSA) approach as implemented in the MMPBSA.py module 71 from AmberTools18. The MM-GBSA energies of 500 structures (1 ns each) were calculated for each MD trajectory. The final reported MM-GBSA energy was the average of the 500 structures. SPMs for dynamic correlation network analysis were estimated using the module DynaComm.py (ref. 33).

QM/MM calculations
Initial structures for QM/MM modelling were selected from the substrate-bound restrained MD simulations of the KS. Representative snapshots were selected on the basis of the different binding poses explored by the substrate. All water molecules and counter ions beyond 3 Å from any residue of the protein, cofactors or substrates were removed. The QM region included the haem porphyrin pyrrole core, the Cys390 side chain, the iron centre and the whole substrate (1). The resulting QM region had a neutral charge, and both doublet and quartet low-lying electronic states were considered. All residues and water molecules outside a 12-Å shell from the QM region were kept frozen. QM/MM calculations were carried out using the ONIOM 72 approach as implemented in the Gaussian 09 package 58 . Geometry optimizations were performed with the hybrid (U)B3LYP functional [59][60][61] using an ultrafine integration grid and the 6-31G(d) basis set on all atoms except for iron, for which an SDD basis set and related SDD pseudopotential were used. The MM parameters and charges were identical to those used in the MD simulations. A two-step sequential optimization protocol using the QuadMacro optimization algorithm was used: a first optimization using a mechanical embedding scheme was initially performed, and once optimized, MM water molecules were kept frozen and a second optimization was performed within the electrostatic embedding scheme. Stationary points were verified as minima or saddle-point (transition-state) geometries after vibrational frequency analysis, and thermal corrections were obtained at 1 atm and 298.15 K. Single-point energy calculations on the optimized structures were performed at the (U)B3LYP/Def2TZVP level of theory within the electrostatic embedding scheme, and included empirical Grimme D3 dispersion corrections with Becke-Johnson (D3BJ) damping 63 .

Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability
The original materials and data that support the findings of this study are available within the paper and its Supplementary Information or can be obtained from the corresponding authors upon reasonable request.