Generative deep learning enables the discovery of a potent and selective RIPK1 inhibitor

Li, Yueshan; Zhang, Liting; Wang, Yifei; Zou, Jun; Yang, Ruicheng; Luo, Xinling; Wu, Chengyong; Yang, Wei; Tian, Chenyu; Xu, Haixing; Wang, Falu; Yang, Xin; Li, Linli; Yang, Shengyong

doi:10.1038/s41467-022-34692-w

Download PDF

Article
Open access
Published: 12 November 2022

Generative deep learning enables the discovery of a potent and selective RIPK1 inhibitor

Nature Communications volume 13, Article number: 6891 (2022) Cite this article

12k Accesses
20 Citations
10 Altmetric
Metrics details

Subjects

Abstract

The retrieval of hit/lead compounds with novel scaffolds during early drug development is an important but challenging task. Various generative models have been proposed to create drug-like molecules. However, the capacity of these generative models to design wet-lab-validated and target-specific molecules with novel scaffolds has hardly been verified. We herein propose a generative deep learning (GDL) model, a distribution-learning conditional recurrent neural network (cRNN), to generate tailor-made virtual compound libraries for given biological targets. The GDL model is then applied to RIPK1. Virtual screening against the generated tailor-made compound library and subsequent bioactivity evaluation lead to the discovery of a potent and selective RIPK1 inhibitor with a previously unreported scaffold, RI-962. This compound displays potent in vitro activity in protecting cells from necroptosis, and good in vivo efficacy in two inflammatory models. Collectively, the findings prove the capacity of our GDL model in generating hit/lead compounds with unreported scaffolds, highlighting a great potential of deep learning in drug discovery.

MedGAN: optimized generative adversarial network with graph convolutional networks for novel molecule design

Article Open access 12 January 2024

Bruno Macedo, Inês Ribeiro Vaz & Tiago Taveira Gomes

Direct steering of de novo molecular generation with descriptor conditional recurrent neural networks

Article 18 May 2020

Panagiotis-Christos Kotsias, Josep Arús-Pous, … Esben Jannik Bjerrum

A pharmacophore-guided deep learning approach for bioactive molecular generation

Article Open access 06 October 2023

Huimin Zhu, Renyi Zhou, … Min Li

Introduction

Identifying new starting active compounds that are substantially different in chemical structure from those already on the market or in development is a crucial step in the early stage of drug development. This task is mainly accomplished by high-throughput screening, either physically or virtually, against sets of known chemical libraries. However, due to the limited structural diversity in existing chemical libraries as well as repeated screening by various companies and institutes, it is becoming more and more difficult to retrieve active compounds with new scaffolds and establish intellectual property. De novo molecular design that computationally generates new molecules with desired properties has been proposed as a solution to this problem^1,2,3. Traditional de novo molecular design methods, which include structure-based^4,5,6, ligand-based^7,8, and pharmacophore-based methods^9,10, involve a relatively manual process that requires an experienced designer and explicit design rules. These methods are also predominately fragment based, and the quality and diversity of the generated molecules strongly depend on the fragment library and the algorithm used for fragment assembly¹.

Recently, generative deep learning (GDL) has emerged as a promising approach for de novo molecular design^3,11, where deep neural networks are employed as generative models. This approach is a completely data-driven de novo molecular design strategy without the need for explicit design rules, which can also avoid the fragment issue mentioned above. It has attracted much attention with several GDL models having been established to generate molecules, including recurrent neural network (RNN)-based^12,13, variational autoencoder (VAE)-based¹⁴, generative adversarial network (GAN)-based¹⁵, graph convolution network (GCN)-based¹⁶, and transformer-based models¹⁷. Detailed description and/or comparison of these models can be found in several recent reviews^11,18,19. Among these models, the RNN-based models are the most widely used ones, whose architectures are borrowed from the natural language processing (NLP) field with molecules being represented by a sequence of tokens, such as the simplified molecular input line entry systems (SMILES)²⁰. Owing to the mature theory system of RNN, several RNN-based GDL models proposed recently produced impressive results in generating new molecules. For example, Segler et al.¹² iteratively fine-tuned a stacked RNN to generate target-focused libraries and successfully reproduced active compounds from a hold-out test set. Moret et al.¹³ utilized RNN to develop a chemical language model (CLM) that enabled the discovery of new molecular entities in a low data regime. Gómez-Bombarelli et al.¹⁴ implemented a VAE model with RNN as a decoder, which could learn to generate novel compounds with high fidelity. Kotsias et al.²¹ proposed a conditional RNN (cRNN) model, in which additional molecular descriptors or fingerprints were incorporated into the RNN initial memory state to guide the subsequent generative process.

Although many GDL models including RNN-based ones showed good performance in generating molecules, a majority of them are designed to generate the best possible molecules to satisfy a predefined goal (goal-directed)²². These goal-directed models are strongly dependent on the goal functions, which may lead to the generation of molecules that are numerically superior but not practically useful^18,23. Besides, despite that most GDL models have been demonstrated to be effective theoretically, very few have been validated by wet-lab experiments¹¹. Furthermore, chemical structures of molecules generated by these models are more or less similar to those of known active compounds against the same target. To address these issues, we here propose a GDL model based on a distribution-learning cRNN^21,22, which avoids the specification of goal function and can generate new molecules following the same chemical distribution as training set molecules. Our model incorporates transfer learning^13,24, regularization enhancement^25,26, and sampling enhancement^14,27 to enable the generation of molecules with previously unreported and diverse chemical scaffolds. This model was then applied to the discovery of receptor-interacting protein kinase 1 (RIPK1) inhibitors followed by comprehensive in vitro and in vivo validations.

RIPK1 is a serine/threonine protein kinase that participates in a variety of signaling pathways involved in cell survival²⁸. In particular, RIPK1 is a key regulator of programmed cell necrosis (necroptosis)^28,29, which is closely related to the occurrence and development of various inflammatory and immune diseases³⁰. Mechanically, when necroptosis is triggered by stimuli such as the tumor necrosis factor family of cytokines, RIPK1 will firstly be activated. The activated RIPK1 then associates with and phosphorylates its downstream protein RIPK3, which subsequently recruits and phosphorylates the pseudokinase mixed-lineage kinase domain like (MLKL)^31,32. The phosphorylated MLKLs form oligomers and translocate to the cytomembrane to execute necroptosis³³. Owing to the central role of RIPK1 in necroptosis, it is considered a promising target for treating necroptosis-related diseases^30,34. A number of RIPK1 inhibitors have been reported and five are currently under clinical trials (phase I or II) for the treatment of nervous system diseases and/or inflammatory diseases, including DNL788 (Denali; NCT05237284), DNL758 (Denali; NCT04781816), GFH312 (Genfleet; NCT04676711), SIR1–365 (Sironax; trial registered on ANZCTR: ACTRN12621000745842p) and R-552 (Rigel and Lilly; NCT05222399). Among them, only chemical structures of DNL758 and SIR1–365 are disclosed at this moment. In this study, we collected compounds with activity against RIPK1 from various publications and patents and obtained a total of 1030 compounds (Supplementary Table 1). Figure 1 shows representative compounds with different scaffold types. However, most of these reported RIPK1 inhibitors are not suitable for clinical studies due to low potency and/or poor kinase selectivity. Therefore, more RIPK1 inhibitors with previously unreported scaffolds and better potential as drug candidates should be discovered.

In this work, we present a GDL model based on a distribution-learning cRNN and then apply this model to the discovery of RIPK1 inhibitors. The rest of this article is organized as follows. We first introduce the proposed GDL model, followed by applying this model to generate a tailor-made virtual compound library targeting RIPK1 and virtual screening against this library. We next describe the retrieval of a potent and selective RIPK1 inhibitor (RI-962). The X-ray crystal structure of RIPK1 in complex with RI-962 is then illustrated. Subsequently we present the in vitro and in vivo effects of RI-962 as well as its pharmacokinetic characteristics and safety evaluation. Last are the discussion and a detailed description of the methods used.

Results

Establishment of the GDL model

The proposed GDL model is based on a distribution-learning cRNN architecture with the long short-term memory (LSTM) algorithm used³⁵; LSTM is an advanced version of RNN that is for tackling the vanishing gradients problem. Different from traditional generative RNN models, the cRNN architecture provides an explicit initial state vector to guide the molecular generation toward a focused chemical domain, which balances the output specificity between unbiased RNN and autoencoder²¹. The architecture of the proposed GDL model is schematically shown in Fig. 2a. Molecules are represented by SMILES strings²⁰, which are encoded by the “one-hot” representation for inputs and outputs. Combined with the state vectors given by the feature extractor as the conditional input, the cRNN model is trained to generate molecules following the same chemical distribution of given training data in an unsupervised-learning manner. In the training process, the cRNN is trained to reconstruct the input SMILES with regularized state vector as the conditional input; in generating process, the inference cRNN is used to generate molecules triggered by the start token <SOS> with sampling state vector as the conditional input (Fig. 2a). We applied three strategies to enhance the ability to generate molecules against a specific target (RIPK1): transfer learning, regularization enhancement, and sampling enhancement.

**Fig. 2: Establishment and performance of the cRNN-based generative model.**

Transfer learning. To build a well-performing model from limited known active compounds (target data, such as RIPK1 inhibitors here), we applied transfer learning^13,24 during the training process (Fig. 2b). For general optimization, we pre-trained the generative model using a large-scale dataset containing ~16 million molecules derived from the ZINC12 database³⁶ (source data). We then fine-tuned the model using the target data (here the target data is comprised of 1030 known RIPK1 inhibitors, Supplementary Table 1). To verify the effect of transfer learning, we evaluated the reconstruction and generation ability using dynamic validation datasets; for these dynamic validation datasets, 100,000 molecules from the source data or 1000 molecules from the target data were randomly selected for reconstruction evaluations, and 100 molecules from either the source data or the target data were randomly selected for generation evaluations. The results showed two remarkable improvements. First, the generalization ability, as assessed by the balanced reconstruction (Fig. 2c) and generation performance (Fig. 2d) on both the source data and the target data, was markedly improved. The models trained only on the target data performed worse on the reconstruction task (Fig. 2c), illustrating the importance of transfer learning. Second, when compared with the models without transfer learning, the convergence time was shortened considerably without affecting the reconstruction accuracy (Supplementary Fig. 1). Therefore, we adopted transfer learning in the subsequent model implementation.

Regularization enhancement. To improve the generation ability of the GDL model, we implemented regularization enhancement^25,26 by randomly adding Gaussian noise to the state vector during model training (Fig. 2e). As a proof of principle of the regularization enhancement, the dynamic validation dataset was evaluated. The results indicated that the GDL model benefitted from regularization enhancement: the enhanced model outperformed the other baseline methods in terms of generation capability while maintaining similar reconstruction performance (Fig. 2c, d).

Sampling enhancement. During the inverse design process of generative models, new molecules are generated by sampling a random state vector in the learned latent space. We adopted sampling enhancement^14,27 to generate new molecules from given state vectors. The performance of three sampling enhancement methods, i.e., single-point sampling, linear-interpolation sampling, and spherical-interpolation sampling, were evaluated on dynamic validation datasets containing 100 randomly selected molecules from the target data. The linear-interpolation sampling (Fig. 2f) outperformed the other two methods (Supplementary Fig. 2). Thus, linear-interpolation sampling was implemented in our framework for molecule generation.

Generation of a tailor-made virtual compound library for RIPK1 and virtual screening

The GDL model described above was applied to build a tailor-made virtual compound library for RIPK1. By running this model, we obtained a total of 79,323 molecules, in which duplicated molecules in the training sets and molecules bearing structural alerts or reactive groups had already been removed. To visualize the similarity between the source data, the target data, and the generated data in chemical space, uniform manifold approximation and projection (UMAP)³⁷ plots were generated. As shown in Fig. 3a, the molecules sampled from the generated data (blue) were shifted from the source data (red) toward the target data (purple) after transfer learning, indicating the effectiveness of transfer learning for navigating through chemical space from the source to the target. Moreover, the generated molecules were essentially similar to active compounds (target data) in terms of their physicochemical properties (Fig. 3b and Supplementary Fig. 3). Based on the analysis of relative scaffold diversity (i.e., unique scaffolds/total number of scaffolds)³⁸, the generated data (26.4%) outperformed the source data (1.2%) and the target data (14.1%) despite the fact that the number of Murcko scaffolds³⁹ in the source data (193,982) was much larger than that in the generated data (20,924) (Fig. 3c). Notably, 99.8% and 99.7% of the scaffolds in the generated data were different from the scaffolds in the source data and the target data, respectively, demonstrating the powerful ability of our model to generate additional scaffolds (Fig. 3c). Further, in terms of scaffold diversity⁴⁰, the generated data were obviously better than the target data and close to the source data (Fig. 3d and Supplementary Fig. 4a). Regarding fingerprint diversity⁴⁰, the generated data were also better than the target data for various types of fingerprints (Supplementary Fig. 4b–g).

**Fig. 3: Generation of a virtual compound library against RIPK1 using the cRNN-based generative model.**

We then carried out virtual screening against the generated molecular library to obtain drug-like hit compounds targeted RIPK1. First, in order to ensure the uniqueness of the scaffold, we removed molecules that contain the same generic Murcko scaffolds³⁹ or the same sub-structures as those in the known RIPK1 inhibitors (target data). Second, drug-like molecule screening was performed according to several important properties associated with drug-like molecules (see the Methods section). Third, pharmacophore-based virtual screening was carried out. To this end, we established a full-feature pharmacophore map^41,42 of RIPK1 inhibitors based on the reported co-crystal structures of RIPK1-ligand, which includes all the important features and interactions between the RIPK1 receptor and ligands. This pharmacophore map consists of 11 features: two hydrogen bond acceptors (A1–A2), three hydrogen bond donors (D1–D3), and six hydrophobic features (H1–H6) (Fig. 3e). Molecules that had at least four features matched with the pharmacophore map were kept. Through the above screening, 23,925 molecules remained, and these filtered molecules still maintain the scaffold and fingerprint diversity as that of the generated data (Fig. 3d and Supplementary Fig. 4). Finally, molecular docking was used to prioritize the filtered molecules. To visually observe the diversity, we generated tree maps (TMAPs)⁴³ according to RECAP⁴⁴-based structural similarity and molecular properties or docking scores; TMAPs are a technique for unsupervised visualization of high-dimensional data that creates a 2D layout of a minimum spanning tree constructed in the original space. The TMAPs (Fig. 4 and Supplementary Fig. 5) vividly show the diversity and distribution in the chemical space.

**Fig. 4: Location of selected molecules for further experimental validation in the TMAP of the filtered molecules.**

From the top-ranked 50 molecules (Supplementary Fig. 6), eight molecules (RI-056, RI-413, RI-470, RI-539, RI-753, RI-962, RI-985, RI-1155) (Supplementary Table 2) with relatively easier synthetic accessibility were chosen to carry out chemical synthesis and bioactivity evaluation; the synthetic accessibility of compounds was judged by our chemical synthesis team. Although the eight molecules were selected according to their synthetic accessibility, they still have a wide distribution in the TMAP (Fig. 4).

Retrieval of a potent and selective RIPK1 inhibitor

The selected compounds (Fig. 4 and Supplementary Table 2) were chemically synthesized. Given the space limitations, here we only describe the chemical synthesis of RI-962 (Fig. 5a); the chemical syntheses of the other compounds are presented in the Supplementary Information. Commercially available methyl 5-bromo-1-methyl-1H-indole-3-carboxylate (1) was methylated to give intermediate 2, which was hydrolyzed and reacted with α-methylbenzylamine to afford intermediate 4. Intermediate 4 reacted with bis(pinacolato)diboron to give intermediate 5. The nucleophilic acyl substitution of 7-bromo-[1,2,4]triazolo[1,5-a]pyridin-2-amine (6) generated intermediate 7, which then reacted with intermediate 5 by Suzuki–Miyaura reaction to produce compound RI-962.

**Fig. 5: The synthetic route and enzymatic activity of RI-962.**

The obtained compounds were then tested for their kinase inhibitory activity against RIPK1. Four compounds showed activity with half maximal inhibitory concentration (IC₅₀) < 10 μM (Supplementary Table 2). Among them, RI-962 was the most potent one with an IC₅₀ value of 35.0 nM against RIPK1 (Fig. 5b). The bioactivity of RI-962 was further validated by ADP-Glo assay, which gave an IC₅₀ value of 5.9 nM (Fig. 5c).

To investigate the kinase selectivity of RIPK1, we performed KINOMEscan profiling at a concentration of 10 μM against a panel of 408 human kinases (Supplementary Table 3). To kinases that have an inhibitory rate larger than 50%, further IC₅₀ values against these kinases were measured. In these assays, RI-962 showed very weak or no activity against all these kinases (IC₅₀ > 10 μM) except MLK3, which had an IC₅₀ value of 3.75 μM, 107 folds less potent against MLK3 than against RIPK1 (Supplementary Table 4).

X-ray crystal structure of RIPK1 in complex with RI-962

To understand the potency and selectivity of RI-962, we determined the co-crystal structure of the RIPK1 kinase domain in complex with RI-962 at a solution of 2.64 Å (Supplementary Table 5). As shown in Fig. 6a, RIPK1 adopts its inactive conformation that is characterized by the unique orientation of the conserved Asp-Leu -Gly (DLG) [Asp-Phe-Gly (DFG) in most other kinases] at the base of the activation loop (Fig. 6b). In the inactive conformation (DLG-out), the aspartate side chain of the DLG motif faces into a hydrophobic pocket adjacent to the ATP-binding pocket (called the allosteric site), while its neighboring phenylalanine residue occupies the ATP-binding pocket. In contrast, in the active state (DLG-in), the aspartate faces into the ATP-binding pocket to facilitate catalysis and the phenylalanine side chain occupies the allosteric site. RI-962 occupies both the ATP-binding pocket and the allosteric site simultaneously, indicating a type II kinase inhibitor; kinase inhibitors that occupy the ATP-binding pocket, the allosteric site, or both sites concurrently belong to type I, III or II, respectively. The triazolo[1,5-a] pyridine and indole moieties reside in the ATP-binding pocket and the terminal benzene ring is located in the allosteric site (Fig. 6b). Four hydrogen bonds are formed: the aminotriazole moiety forms two hydrogen bonds with the backbone N and C=O groups of the residue M95; the amide group forms one hydrogen bond with the gatekeeper residue D156, and another hydrogen bond with a water molecule (Fig. 6c).

**Fig. 6: Co-crystal structures of RIPK1 complexed with RI-962 (PDB ID: 7YDX).**

Compared with the crystal structure of RIPK1 in complex with Cpd8, which is a known type II RIPK1 inhibitor but with poor kinase selectivity, RI-962 induces an evident rotation of the αC-helix in RIPK1 by ~40° (Fig. 6d). Consequently, one of the catalytic triad residues⁴⁵, E63, is far away from K45, which breaks the salt bridge interaction between E63 and K45. This rotation also results in a larger empty space in the allosteric site. RI-962 fits snuggly into the re-shaped allosteric site and made tight hydrophobic interactions with resides M67, F162, V134, L129, L70, V75, and I154 (Fig. 6c). Overall, although RI-962 and Cpd8 adopt similar binding poses, RI-962 induces a conformational change around the allosteric site, which leads to a more suitable space in the allosteric site to accommodate RI-962, and better interactions between RI-962 and residues in the allosteric site. This together with the non-conservation of residues in the allosteric site could be used to interpret the high kinase selectivity of RI-962⁴⁶.

Nec-1a (Fig. 1) is a highly selective RIPK1 inhibitor, which has often been used as a positive control in necroptosis-related studies^45,46. The crystal structure of Nec-1a-RIPK1 complex shows that Nec-1a also induces very similar conformational change as RI-962 does and occupies (only) the allosteric site (type III), rendering its kinase selectivity (Fig. 6e). Compared with Nec-1a, RI-962 occupies both the ATP-binding pocket and the allosteric site (Fig. 6e), implying bearing more interactions with RIPK1 and hence presenting higher potency (RI-962, 35 nM vs. Nec-1a, 317 nM⁴⁵).

Cellular and molecular effects of RI-962

The TSZ (TNFα, Smac mimetic, and Z-VAD-FMK)-induced cell necroptosis models⁴⁷ were adopted to examine the cellular effect of RI-962. Four cell lines (HT29, L929, J774A.1, and U937) were used in this assay. As shown in Fig. 7a–d, RI-962 exerted a dose-dependent protective effect against necroptotic death, with EC₅₀ values of 10.0, 4.2, 11.4, and 17.8 nM for HT29, L929, J774A.1, and U937 cells, respectively, which also indicated a cell-independent activity. In addition, the dual staining of HT29 cells with CytoCalcein Violet 450 (for living cells) and 7-AAD (for necrotic cells) visually showed that RI-962 inhibited TSZ-induced necroptosis and improved cell survival in a concentration-dependent manner (Fig. 7e, f). The positive control GSK3145095⁴⁸ also displayed activity in these assays, but its potency was relatively weaker compared with that of RI-962. Then, we knocked out RIPK1 in HT29 cells using the CRISPR/Cas9 approach and found that RIPK1 knockout HT29 cells were insensitive to TSZ-induced necroptosis (Fig. 7g, h), implying that RI-962 plays its protective effect against TSZ-induced cell necroptosis by targeting RIPK1.

**Fig. 7: RI-962 protected cells from TSZ-induced necroptosis.**

We next examined the effect of RI-962 on the necroptotic signaling proteins in intact cells. As shown in Fig. 7i, RI-962 markedly inhibited the phosphorylation of RIPK1 and its downstream signaling proteins RIPK3 and MLKL in a dose-dependent manner, whereas it had no effect on the expressions of RIPK1, RIPK3, and MLKL proteins. Again, knockout of RIPK1 had the same effect (Fig. 7j). All these results suggested that RI-962 protects cells from necroptosis by inhibiting the kinase activity of RIPK1.

Pharmacokinetic characteristics and safety evaluation of RI-962

To further explore the druggability of RI-962, pharmacokinetic (PK) experiments were conducted in Sprague-Dawley (SD) rats. RI-962 given intravenously (i.v.) (5 mg/kg), intraperitoneally (i.p.) (20 mg/kg) and orally (p.o.) (20 mg/kg) showed the area under the curve (AUC_0–t) values of 4526.1 h*ng/mL, 6459.7 h*ng/mL, and 1594.9 h*ng/mL, respectively, indicating a proper drug exposure. It displayed a half-life (T_1/2) of 8.5 h and a bioavailability of 35.7% following i.p. administration. The metabolic stability of RI-962 in rats was good, with a clearance rate (CL) of 18.5 mL/min/kg. (Table 1 and Supplementary Fig. 7). We further evaluated the maximum tolerated dose of RI-962 in mice, which were well tolerated at doses up to 250 mg/kg, with no observed weight loss and no other side effects (Supplementary Fig. 8).

Table 1 Key pharmacokinetic parameters of RI-962 obtained in a preliminary pharmacokinetic assessment experiment^a

Full size table

In vivo effects of RI-962 in animal models of inflammatory disease

Necroptosis is associated with a variety of inflammatory disorders, and RIPK1 is considered as a promising intervention target for these diseases^28,30,49,50. Thus, we evaluated the in vivo effects of RI-962 in two animal models of inflammatory diseases: TNFα-induced systemic inflammatory response syndrome (SIRS) and dextran sulfate sodium (DSS)-induced inflammatory bowel disease (IBD).

We first examined the in vivo effects of RI-962 on the TNFα-induced SIRS model. SIRS is a life-threatening inflammatory state that results from the complex pathophysiologic response to infection, trauma, burns, pancreatitis, or a variety of other injuries⁵¹. In this study, a TNFα-induced SIRS mouse model was used to examine the effects of RI-962. As shown in Fig. 8a, a majority of the vehicle-treated mice died within 24 h (survival rate = 10%) after tail vein injection of TNFα. In comparison, the survival rate was increased to 90% in the RI-962-treated group. GSK3145095 also increased the survival rate (50%), but less than RI-962. Treatment with RI-962 or GSK3145095 remarkably reduced the TNFα-induced temperature loss (Fig. 8b) and the concentrations of proinflammatory cytokines (IL-1β and IL-6) in mice (Fig. 8c, d). The hematoxylin and eosin (H&E) staining of heart, liver, spleen, lung, and kidney showed that TNFα injection evidently damaged the liver (as indicated by the inflammatory cell infiltration in the portal area) and kidney (as indicated by a glomerular hemorrhage and swelling with neutrophil infiltration) (Fig. 8e), but had very weak or no obvious impact on heart, spleen and lung (Supplementary Fig. 9). Treatment with RI-962 attenuated damage to the liver and kidney (Fig. 8e). We further explored the mechanism of action by western blot. As shown in Fig. 8f, RI-962 treatment substantially reduced the level of phosphorated RIPK1 (pRIPK1) but had no impact on the RIPK1 protein, indicating the inhibition of RIPK1 kinase activity. The activation of downstream proteins, RIPK3 and MLKL, was also markedly suppressed (Fig. 8f). Taken together, these results indicate that RI-962 ameliorated TNFα-induced SIRS by inhibiting RIPK1 activity.

**Fig. 8: RI-962 ameliorates TNFα-induced SIRS.**

We then evaluated the in vivo effects of RI-962 on the DSS-induced IBD model. IBD is a chronic, debilitating intestinal disease with a variety of clinical manifestations. The main forms of IBD are ulcerative colitis and Crohn’s disease⁵². Necroptosis is a major type of cell death involved in the regulation of intestinal homeostasis in the intestinal epithelium^53,54,55. RIPK1 is thus regarded as a potential target for IBD treatment⁵⁶. In this study, we examined the effect of RI-962 in a DSS-induced IBD mouse model. As shown in Fig. 9a, DSS treatment led to a rapid loss in mouse body weight from day 5 to day 11, and treatment with RI-962 or GSK3145095 strongly ameliorated this loss of body weight. Further, treatment with RI-962 or GSK3145095 markedly reduced the DSS-induced shortening of colon length (Fig. 9b, c). Histopathological analysis showed that RI-962 substantially decreased tissue damage in the colons of DSS-treated mice (Fig. 9d). In DSS-induced colitis, numerous S100a9-positive cells (a marker of inflammation) infiltrated into the mucosa and epithelial layer of the damaged colon (Fig. 9e), while no infiltration by S100a9-positive cells was observed in the colons of mice treated with RI-962 (Fig. 9e). More importantly, treatment with RI-962 or GSK3145095 dramatically increased the survival rate of DSS-treated mice (Fig. 9f; 40 mg/kg RI-962 or GSK3145095 survival rate, 100% vs vehicle: 16.7%). In addition, RI-962 treatment during DSS challenge substantially reduced the content of proinflammatory cytokines (TNFα, IL-1β, and IL-6) in cultured colonic tissue supernatants compared with the DSS control mice (Fig. 9g–i). Finally, the western blot assay was used to investigate the effect of RI-962 on the RIPK1 signaling pathway. The results showed that RI-962 reduced the levels of pRIPK1, pRIPK3, and pMLKL proteins in the colon during DSS challenge, but did not impact the expression of RIPK1, RIPK3, and MLKL proteins (Fig. 9j), suggesting that RI-962 suppressed the RIPK1 signaling in the mouse model of DSS-induced colitis.

**Fig. 9: RI-962 reduces inflammation in acute DSS-induced colitis.**

Discussion

Developing a new drug is an expensive and time-consuming process that might take over 1 billion dollars and over 10 years. Identification of hit/lead compounds with novel structures is the first and also a critical step. The most common approach to retrieving new hit/lead compounds is to screen existing chemical libraries by using high-throughput screening methods. By this approach, one may not be able to locate additional active compounds with different scaffolds due to the limited chemical space of the existing compound libraries that have already been screened over and over again. To this end, we in this investigation proposed a GDL model to generate a tailor-made compound library with previously unreported scaffolds, which allows us to retrieve hit/lead compounds from the huge unexplored chemical space.

The proposed GDL model is a cRNN-based model²¹. The generative process of cRNN is conditioned by explicitly setting its internal state according to desired properties. Current implementations of cRNN usually employ goal-directed strategies. However, the effectiveness of the goal-directed model strongly depends on the accuracy of the goal function. Ill-defined goal functions can result in invalid molecular structures^18,23. As an alternative to goal-directed approach, the distribution-learning strategy aims to generate molecules that resemble the given dataset, which could achieve data-driven molecule generation through unsupervised learning²². We therefore established a distribution-learning cRNN model, in which three strategies including transfer learning, regularization enhancement, and sampling enhancement were incorporated. Transfer learning shifted the data distribution of the latent space from the large collection of the source data (ZINC12 database) toward the target data (known RIPK1 inhibitors), enabling the generation of drug-like and bioactive molecules. Regularization enhancement by adding random input noise, which is considered equivalent to introducing penalty terms in the objective function^25,26, is beneficial to improve the generalization performance of the GDL model. Sampling enhancement is implemented by interpolating between latent space during model generation, which improves the likelihood of successful generation of target-specific molecules with diverse chemical scaffolds.

Our GDL model has been successfully applied to establish a virtual compound library against RIPK1. The generated library was enriched with much more new scaffold molecules compared with the known RIPK1 inhibitors. Through a standard drug screening process against the established compound library, we retrieved a potent and selective RIPK1 inhibitor with a previously unreported scaffold. On the one hand, this application example verified the effectiveness of our GDL model. Despite that RIPK1 is a kinase, our GDL model could be applied to different kinds of biological targets. The only requirement is that the biological targets must have a sufficient number of known active compounds (target data). The bigger the number of known active compounds is, the better the GDL model is expected to perform. On the other hand, this application example led to the identification of a potent RIPK1 inhibitor (RI-962) with a previously unreported scaffold. Of note is that RI-962 displayed high selectivity against other 407 kinases. It also showed potent activity both in vitro and in vivo. Even so, this compound still has some unfavorable properties that need further optimization in future, for example, low oral bioavailability (Table 1). This situation is understandable because the GDL model is not a panacea and we should not hold an extravagant hope to directly generate a drug candidate by this model. Overall, we discovered a lead compound with a previously unreported scaffold against RIPK1 by using our proposed GDL model, witnessing a successful application of deep neural network in early drug discovery.

Methods

Data preparation

Compounds from ZINC12 database³⁶ were used to construct the source data for transfer learning (downloaded on August 20, 2020). Known RIPK1 inhibitors (bioactive compounds) were retrieved from ChEMBL⁵⁷ and patents (<10 μM) to form the target data. All these molecules were encoded as SMILES strings, and then canonicalized and standardized by removing stereochemical information, salts, and duplicates using the RDKit package (v2019.09.2.0, www.rdkit.org). We finally obtained a set of ~16 million molecules as the source data and 1030 bioactive molecules as the target data (Supplementary Table 1).

Implementation of the GDL model

The generative model reads the input SMILES string²⁰ of a molecule with “one-hot” representation and a state vector coded by the feature extractor, and then converts them back to the SMILES string following chemical rules. The generative model is a one-layer LSTM (256 dimensions) followed by a dense layer with a SoftMax activation function to generate a probability distribution over all possible grammar production rules for each time step. The feature extractor is a one-layer bi-directional LSTM (512 dimensions) to convert the input molecule to an initial state vector. In short, a ${{{{{{\mathrm{cRNN}}}}}}}$ takes a sequence of input vectors ${x}_{1:n}$ = (${x}_{1}$,…, ${x}_{n}$) and an initial state vector ${h}_{0}$, and returns a sequence of state vectors ${h}_{1:n}$ = (${h}_{1}$,…, ${h}_{n}$) and a sequence of output logit vectors ${o}_{1:n}$ = (${o}_{1}$,…, ${o}_{n}$) (Eq. (1)). The model ${{{{{{\mathrm{cRNN}}}}}}}$ consists of a recursively defined function $R$ (Eq. (2)), which takes a state vector ${h}_{i-1}$ and input vector ${x}_{i}$ and returns a new state vector ${h}_{i}$; another function $O$ maps a state vector ${h}_{i}$ to an output logit vector ${o}_{i}$ (Eq. (3)):

$${{{{{{\mathrm{cRNN}}}}}}}\left({h}_{0},{x}_{1:n}\right)={h}_{1:n},{o}_{1:n}$$

(1)

$${h}_{i}=R\left({h}_{i-1},\,{x}_{i}\right)\,,\,{{i}}\ge 1$$

(2)

$${o}_{i}=O\left({h}_{i}\right)\,,\,{{i}}\ge 1.$$

(3)

During training, we trained the generative model to reconstruct the training data by minimizing training loss ${{{{{\mathcal{L}}}}}}$ (Eq. (4)), which was evaluated as the similarity between the original and reconstructed vectors of molecular representations. Training loss ${{{{{\mathcal{L}}}}}}$ was computed from the cross-entropy loss function with SoftMax activation (Eq. (4)):

$${{{{{\mathcal{L}}}}}}{{{{{\mathscr{=}}}}}}\frac{1}{n}\mathop{\sum }\limits_{i=1}^{n}\left(-\mathop{\sum }\limits_{j=1}^{J}{y}_{i,j}{{\log }}{p}_{i,j}\right)$$

(4)

$${p}_{i,j}=\frac{{{{{{{\rm{e}}}}}}}^{{o}_{i,j}}}{{\sum }_{k=1}^{K}{{{{{{\rm{e}}}}}}}^{{o}_{i,k}}},$$

(5)

where $n$ is the batch size, $J$ is the dimension of each molecular representation, $i$ is the $i$th vector, $j$ is the $j$th dimension of a vector, ${k}$ is the $k$^th dimension of a vector, $K$ is the set of all tokens, $y$ is the vector of the original molecular representation (label), and $o$ is the vector of the reconstructed molecular representation. The parameters of the generative model were then updated using AdamOptimizer with learning rate of 0.0001 until convergence. Training loss was monitored and visualized using TensorBoard. Transfer learning^13,24 was implemented by updating the parameters using the target data, based on the parameters of the converged pre-trained model using the source data. Regularization enhancement^25,26 was performed by adding a Gaussian noise vector $\xi$ to the hidden vector ${h}_{0}$ (Eq. (6)):

$${h}_{0}^{{{{{{\rm{noise}}}}}}}={h}_{0}+\xi \,,\,{\xi }_{m}\in {{{{{\mathscr{N}}}}}}(\mu,\,{\sigma }^{2}),$$

(6)

where ${h}_{0}^{{noise}}$ is the regularized ${h}_{0}$, $\xi$ is the noise vector with the same dimension of ${h}_{0}$, ${\xi }_{m}$ is the $m$^th dimension of the noise vector $\xi$, and ${{{{{\mathscr{N}}}}}}(\mu,{\sigma }^{2})$ is a Gaussian distribution with mean $\mu$ and variance ${\sigma }^{2}$. The mean $\mu$ of the noise distribution was chosen to be zero.

During generation, molecular representations were generated by the start token <SOS> and ${h}_{{{{{{{\mathrm{new}}}}}}}}$ with sampling enhancement^14,27. Three types of sampling enhancement were implemented, namely, linear-interpolation sampling [Linear, Eq. (7)], spherical-interpolation sampling [Slerp, Eq. (8)], and single-point sampling [Sample, Eq. (9)]:

$${h}_{0,{{{{{\rm{new}}}}}}}^{{ij},\alpha }={{{{{\rm{Linear}}}}}}\left({h}_{0}^{i},{h}_{0}^{j}{{{{{\rm{;}}}}}}\alpha \right)=(1-\alpha ){h}_{0}^{i}+\alpha {h}_{0}^{j}\,,\,\alpha \in \left(0,1\right)$$

(7)

$${h}_{0,{{{{{\rm{new}}}}}}}^{{ij},\beta }={{{{{\rm{Slerp}}}}}}\left({h}_{0}^{i},{h}_{0}^{j}{{{{{\rm{;}}}}}}\beta \right)=\frac{{{\sin }}\left[\left(1-\beta \right)\theta \right]}{{{\sin }}\theta }{h}_{0}^{i}+\frac{{{\sin }}(\beta \theta )}{{{\sin }}\theta }{h}_{0}^{j}\,,\,\beta \in \left(0,1\right)$$

(8)

$${h}_{0,{{{{{\rm{new}}}}}}}^{i}={{{{{\rm{Sample}}}}}}\left({h}_{0}^{i}\right)={h}_{0}^{i}+{\xi }_{s}\,,\,{\xi }_{s}\in {{{{{\mathscr{N}}}}}}\left({\mu }_{s},{\sigma }_{s}^{2}\right),$$

(9)

where $\alpha$ is the linear-interpolation factor, $\beta$ is the spherical-interpolation factor, $\theta$ is the central angle of ${h}_{0}^{i}$ and ${h}_{0}^{j}$, and ${\xi }_{s}$ is a random vector that has the same dimension as ${h}_{0}^{i}$ and belongs to the Gaussian distribution with mean ${\mu }_{s}$ and variance ${\sigma }_{s}^{2}$.

All the parameters used in the GDL model are presented in Supplementary Table 6. All software programs were implemented in Python (v3.6.9) with the TensorFlow GPU backend (www.tensorflow.org, v1.10.0). Additional details are provided in Supplementary Information, including conversion between SMILES and word embedding matric (Supplementary Note 1), regularization enhancement (Supplementary Note 2), and sampling enhancement (Supplementary Note 3).

Evaluation of the GDL model

The performance of the GDL model was evaluated on subsets randomly selected from the source data or the target data. To evaluate the reconstruction capability of the GDL model, we used 100,000 molecules from the source data and 1000 molecules from the target data as subsets, and the criterion was the reconstructed rate ($R\%$, Eq. (10)):

$$R\%=\frac{{M}_{{{{{{\rm{recon}}}}}}}}{{N}_{{{{{{\rm{recon}}}}}}}}\times 100\%,$$

(10)

where ${N}_{{{{{{\rm{recon}}}}}}}$ is the number of subset molecules used for evaluation of reconstruction capability, and ${M}_{{{{{{\rm{recon}}}}}}}$ is the number of molecules that are reconstructed correctly by the GDL model. We evaluated the performances of models trained using six training methods: (1) training on the source data; (2) training on the target data; (3) training with transfer learning on the source and the target data; (4) training with regularization enhancement on the source data; (5) training with regularization enhancement on the target data; and (6) training with transfer learning and regularization enhancement on the source and the target data. To further evaluate the generation capability, we used 100 molecules from either the source data or the target data, respectively. The models were trained using four training methods with qualified reconstructed capability: (1) training on the source data; (2) training with transfer learning on the source and the target data; (3) training with regularization enhancement on the source data; and (4) training with transfer learning and regularization enhancement on the source and the target data. The generation capabilities of these trained models were then evaluated using the generative rate ($G\%$, Eq. (11)) as a criterion:

$$G\%=\frac{{M}_{{{{{{\rm{gen}}}}}}}}{{N}_{{{{{{\rm{gen}}}}}}}}\times 100\%,$$

(11)

where ${N}_{{{{{{\rm{gen}}}}}}}$ is the number of subset molecules used for the evaluation of generation capability, and ${M}_{{{{{{\rm{gen}}}}}}}$ is the number of molecules generated by the GDL model.

Calculations of molecular properties

To compare the similarity between different molecules in terms of physicochemical properties, some important physiochemical parameters associated with drug-like properties were calculated, including molecular weight (MW), the water–octanal partition coefficient (LogP)⁵⁸, the qualitative estimate of drug-likeness (QED)⁵⁹, Bertz C_T⁶⁰, the topological polar surface area (TPSA)⁶¹, water solubility (LogS)⁶², the number of rotatable bonds (rot), the number of H-bond donors (HBD), and the number of H-bond acceptors (HBA). To visualize the comparison results, histograms and kernel density estimation (KDE) maps were drawn using Seaborn (https://seaborn.pydata.org/, v0.11.1). For the drug-like compound screening, we used the following criteria: 200 ≤ MW ≤ 700, −2 ≤ LogP ≤ 6, and 0.15 ≤ QED. To avoid molecules that are very difficult to synthesize, we calculated the synthetic accessibility (SA) score⁶³ and filtered out molecules with SA score > 5; the SA scores indicate the complexity for synthesis, which ranges from lower values (easy to synthesis) to high values (difficult to synthesis). All calculations were carried out by using RDKit (https://rdkit.org/, v2019.09.2.0).

Uniform manifold approximation and projection (UMAP)

To visualize the similarity relations between the source data, the target data, and the generated data, we constructed UMAP plots³⁷ (umap-learn 0.4.6), which are two-dimensional representations of high-dimensional data distributions, from 3000, 1000, and 2000 randomly selected molecules from the source data, the target data, and the generated data, respectively.

Scaffold and fingerprint diversity

Scaffold and fingerprint diversity were analyzed and visualized using the Platform for Unified Molecular Analysis (PUMA)⁴⁰ (https://www.difacquim.com/d-tools/) with 1000 molecules randomly selected from the source data, the target data, the generated data, and the filtered molecules, respectively.

Full-feature pharmacophore map

The Discovery Studio (version 3.1) program package was used to generate a full-feature pharmacophore map for RIPK1 inhibitors. 13 crystallographic structures of RIPK1-inhibitor complexes were collected from the protein data bank (PDB)⁶⁴ (Supplementary Note 4). Taking 4NEU as the reference structure, the MODELER was used for structural alignment with default parameters settings. We then performed the Receptor-Ligand Pharmacophore Generation protocol for the automatic construction of three-dimensional pharmacophores based on the previously aligned structures. All the identified pharmacophore features including hydrogen bond donor (HDB/D), hydrogen bond acceptor (HAD/A), hydrophobic (HYD/H), positive ionizable (PI), negative ionizable (NI), ring aromatic (RA), and excluded volume features were clustered according to their interaction pattern with the receptor. Finally, 11 clustered features including two hydrogen bond acceptors (A1–A2), three hydrogen bond donors (D1–D3), and six hydrophobic features (H1–H6) were selected to form the full-feature pharmacophore map^41,42. After generating multiple molecular conformations, the screening procedure was carried out, resulting in a set of molecules with at least four matched pharmacophore features ranked based on their fit values. More details are provided in Supplementary Note 4.

Molecular docking

The GOLD program was adopted for molecular docking with GoldScore being used as the scoring function^65,66. To achieve a better screening, flexible docking was performed. The receptor structure was taken from the protein data bank (PDB)⁶⁴ (PDB entry: 4ITH). In order to accelerate flexible docking, we set limited residues to be flexible. By comparison between X-ray crystal structures of RIPK1 in complex with different ligands, we found that, among all the residues forming the active pocket (including the ATP-binding pocket and the allosteric site), nine residues often display a large displacement, including V31, I43, K45, M67, L70, M92, L157, L159, and F162. Therefore, the sidechains of the nine residues were defined as flexible sidechains in the program setting. The binding site was defined as the area within 10 Å around the 4ITH ligand, and other parameters were set to default values. The entire process of molecular docking was implemented in Discovery Studio 3.1.

Tree maps (TMAPs)

For the unsupervised visualization of high-dimensional data, a TMAP⁴³ (tmap 1.0.4; faerun 0.3.20) creates a two-dimensional layout of a minimum spanning tree constructed in the original space. In this study, TMAPs were used to visualize RECAP⁴⁴-based (rdkit 2019.09.2.0) structural similarity among the filtered molecules. Each TMAP shows the molecules as dots with up to three concentric circles: the first circle depicts the molecule properties (including MW, LogP, SA score, and QED) or docking scores (colored from red to yellow to green, moving from the maximum value to the minimum value); the second circle depicts the RECAP fragment number of a molecule (colored by the number of RECAP fragments).

Chemical synthesis

The primary synthetic data are available in the Supplementary Methods.

Cell lines and cell culture conditions

The cell lines used in this investigation were purchased from the American Type Culture Collection (ATCC). HT29, L929, HEK 293T and J774A.1 cells were cultured in DMEM (Gibco) supplemented with 10% fetal bovine serum, 100 U/mL penicillin, and 100 U/mL streptomycin. U937 cells were cultured in RPMI-1640 (Gibco) culture medium supplemented with 10% fetal bovine serum, 100 U/mL penicillin, and 100 U/mL streptomycin. Sf9 cells were cultured in SIM SF (Sino Biological Inc.) supplemented with 50 U/mL penicillin, and 50 U/mL streptomycin. HT29, L929, HEK 293T, U937 and J774A.1 cells incubations were performed at 37 °C under 5% CO₂. Sf9 cells incubations were performed at 27 °C. All cells were negative for mycoplasma, and these cell lines are not among those commonly misidentified by International Cell Line Authentication Committee (ICLAC).

Cell necroptosis protection assay

Cell necroptosis protection assays were performed in 96-well cell culture plates. Cells were plated in each well and cultured at 37 °C overnight. HT29, U937, and J774A.1 cells were treated with 10 ng/mL TNFα, 100 nM Smac mimetic, and 40 μM z-VAD-FMK for 24 h. L929 cells were treated with 10 ng/mL TNFα and 40 μM z-VAD-FMK for 24 h. The cell survival rate was determined using a CCK8 cell viability assay kit and CLARIOstar (v5.61). The concentration–response curve was fitted using Graph-Pad Prism 8.0 (GraphPad Software) to calculate the 50% effective concentration (EC₅₀). All experiments were performed at least two times, and each EC₅₀ value was expressed as mean ± standard deviation (SD).

Dual staining with CytoCalcein Violet 450 and 7-AAD

The assay of dual staining with CytoCalcein Violet 450 and 7-AAD was performed in a 24-well cell culture plate. Cells were plated in each well and cultured at 37 °C overnight. HT29 cells were treated with 10 ng/mL TNFα, 100 nM Smac mimetic, and 40 μM z-VAD-FMK for 24 h. Then use CytoCalcein Violet 450 and 7-AAD to double stain the HT29 cells and observe under the microscope. All images were acquired with an Eclipse Ci-L microscope (Nikon, Japan). The dead cells were measured with Image J software, and the data was analyzed with GraphPad software. The experiment was repeated three times.

CRISPR/Cas9-mediated RIPK1 knockout in HT29 cells

The lentiCRISPRv2 vector targeting RIPK1 (sgRNA, 5’- CTCGGGCGCCATGTAGTAGA-3’) was constructed by the Azenta company. HEK 293 T cells were transfected with lentiCRISPRv2 targeting RIPK1 and empty vector using Hieff Trans^TM Liposomal Transfection Reagent (Yeasen), respectively. The viruses were collected at 24 h and 48 h, respectively, filtered with a 0.45 mm filter head, and then added to the virus concentrate and treated at 4 °C overnight. The concentrated viruses were added to HT29 cells along with 8 μg/mL of polybrene (Yeasen) to enhance transfection efficiency. The infection assay was repeated in the next day under the same conditions. Finally, HT29 cells were screened with 3 μg/mL puromycin. Western blot analysis was used to confirm the RIPK1 deletion.

Western blot analysis

Cell pellets were collected and resuspended in RIPA lysis buffer (Beyotime), to which phenylmethylsulfonyl fluoride, a proteasome inhibitor, and a phosphatase inhibitor cocktail (Sigma) had been added. Whole-cell protein lysates were incubated on ice for 15 min and centrifuged at 13,800 × g and 4 °C for 15 min. The supernatants were collected and subjected to western blot analysis.

The liver in the SIRS model and the colon in the IBD model were harvested, homogenated and sonicated in RIPA lysis buffer. The supernatants were collected after centrifuged at 13,800 × g and 4 °C for 15 min.

The cell proteins or tissue proteins were separated in a polyacrylamide gel and transferred to a methanol-activated polyvinylidene fluoride membrane. The membrane was blocked for 2 h in Tris-buffered saline plus Tween-20 containing 5% milk and then immunoblotted sequentially with primary and secondary antibodies. Detection was performed with an ECL chemiluminescence kit (Abbkine). The antibodies used were human RIPK1 antibody (R&D, 334640, 1:1000), mouse RIPK1 antibody (Affinity, DF2642, 1:1000), human phospho-RIP (Ser166) rabbit mAb (Cell Signaling Technologies, 65746, 1:1000), mouse phospho-RIP (Ser321) rabbit mAb (Cell Signaling Technologies, 38662, 1:1000), human RIPK3 (B-2) antibody (Santa Cruz, sc-374639, 1:250), mouse RIPK3 antibody (Abcam, ab62344, 1:1000), human anti-RIP3 (phospho S227) antibody (Abcam, ab209384, 1:2000), mouse anti-RIP3 (phospho T231 + S232) antibody (Abcam, ab205421, 1:500), anti-MLKL (58–70) antibody (Sigma, M6697, 1:250), human anti-MLKL (phospho S358) antibody (Abcam, ab187091, 1:1000), mouse anti-MLKL (phospho S345) antibody (Abcam, ab196436, 1:1000), and β-actin (Proteintech, 66009-1-Ig, 1:1000).

Protein preparation and crystallization

The RIPK1 protein expression and purification were carried out following the similar protocols as those in literature⁴⁵. The human RIPK1 kinase domain containing residues 1–294 with four cysteine-to-alanine mutations (C34A, C127A, C233A, and C240A) was cloned into the vector pFastbacHAT (completed by the Azenta company). The recombinant virus containing RIPK1 was generated using the Bac-to-Bac baculovirus expression system and infected Sf9 cells. After infection by baculoviruses for 48 h, the cells were harvested in a buffer containing 25 mM Tris (pH 7.6), 1 M NaCl, 0.5 mM TCEP, and 20 mM imidazole. The RIPK1 kinase domain was purified to homogeneity using a nickel resin column. The protein was eluted in buffer containing 250 mM imidazole. The N-terminal tag was cleaved by TEV protease, and the protein was further purified using a Superdex 200 gel filtration column (GE Healthcare) and finally using a MonoQ column (GE Healthcare). The purified RIPK1 was concentrated to 10.693 mg/mL in a buffer containing 25 mM Tris-HCl pH 7.9, 150 mM NaCl, and 0.5 mM TCEP.

Crystals of the RIPK1 in complex with RI-962 (final concentration of 1 mM added to the protein) were obtained by co-crystallization via hanging drop vapor diffusion. Crystals were obtained from solution (0.25 M NH₄I, 23% polyethylene glycol 3350, and 0.03 M glycyl-glycyl-glycine) and grew to full size in ~1 week. The crystals were harvested after cryo-protection in 10% ethylene glycol and flash-frozen in liquid nitrogen for data collection.

Data collection and refinement of RIPK1

All diffraction datasets were collected on beamline BL19U1 of the Shanghai Synchrotron Radiation Facility and processed using HKL2000⁶⁷. Further data processing was carried out using programs from the CCP4 suite⁶⁸. Structures were determined by molecular replacement using a previously published structure (PDB ID: 4ITJ)⁴⁵ as the starting model. Manual model rebuilding and refinement were iteratively performed with Coot⁶⁹ and Phenix⁷⁰, respectively. The crystal of RI-962-bound RIPK1 is in the space group, P2₁2₁2₁. Each asymmetric unit contains two molecules of RIPK1. The statistics and refinement values of the crystal structure are shown in Supplementary Table 5.

In vitro kinase activity assays

In vitro kinase activity assays were conducted through the Kinase Profiling Services provided by Eurofins (Eurofins, France). The protocol for the RIPK1 assay is briefly described as follows (Protocols for other kinases are very similar and can be found in http://www.eurofins.com/pharmadiscovery). RIPK1 kinase was incubated with the test compound in assay buffer containing 8 mM MOPS (pH 7.0), 0.2 mM EDTA, 250 μM KKKSPGEYVNIEFG, 10 mM magnesium acetate, and 10 μM [γ-33P]-ATP for 15 min at room temperature. The reaction was initiated by the addition of the Mg/ATP mixture. After incubation for 40 min at room temperature, and the reaction was stopped by the addition of 3% phosphoric acid. A 10 μL portion of the reaction mixture was then spotted onto a P30 filter mat and washed four times for 4 min in 0.425% phosphoric acid and once in methanol prior to drying and scintillation counting.

Source of animals

C57BL/6 mice were purchased from GemPharmatech Co., Ltd. All mice were bred under standard conditions and used at the age of 6–8 weeks when the body weight was ~20 g. All procedures related to animal handling, care and treatment in in vivo efficacy studies were performed according to the guidelines approved by the Institutional Animal Care and Use Committee (IACUC) of West China Hospital, Sichuan University (20211062A). All procedures related to animal handling, care and treatment in pharmacokinetic (PK) studies were performed according to the guidelines approved by the Institutional Animal Care and Use Committee (IACUC) of Shanghai Medicilon Inc.

The TNFα-induced SIRS experiment

C57BL/6 female mice were first fasted for 12 h (given water) and then the C57BL/6 female mice were pretreated with vehicle, RI-962 (40 mg/kg), or GSK3145095 (40 mg/kg; GSK3145095 was purchased from NewCompoundMarket Pharmatech Co. Ltd.) via intraperitoneal injection for around 15 min and then challenged with mouse TNFα (300 μg/kg) via tail intravenous injection. The body temperatures of the mice were continuously monitored until 6 h after TNFα administration. At 6 h after TNFα injection, four mice in each group were killed at random, and the serums, heart, liver, spleen, lung, and kidney tissues were collected for analysis. Mice mortality was continuously monitored until 72 h after TNFα administration.

The DSS-induced IBD experiment

DSS (3% w/v) was administered in drinking water ad libitum for 7 d (from day 0 to day 7). DSS solution was replaced three times on day 2, day 4, and day 6. C57BL/6 female mice were injected intraperitoneally with vehicle, RI-962 (40 mg/kg), or GSK3145095 (40 mg/kg) for 10 d (from day 0 to day 9). Three mice in each group were killed at random on day 7, and distal colon tissues were collected for analysis. The mice weight and survival rate were recorded daily.

Assessment of pharmacokinetic (PK) properties

The PK properties of compounds were examined in male Sprague-Dawley rats (n = 3 per group, weight: 180–220 g). Compounds were dissolved in saline with 5% (v/v) DMSO plus 40% (v/v) PEG400. The animals were administered with a single dose of 5 mg/kg (intravenous injection (i.v.)), 20 mg/kg (intraperitoneal injection (i.p.) or oral gavage (p.o.)). Blood samples were collected at 0.083, 0.25, 0.5, 1, 2, 4, 6, 8, 10 and 24 h, and centrifuged to isolate plasma. Subsequently, the plasma compound concentrations were determined by LC-MS/MS-13 (TQ5500, SCIEX), and the PK parameters were calculated using Phoenix WinNonlin 7.0.

Enzyme-linked immunosorbent assay (ELISA)

In the TNFα-induced SIRS model, at 6 h after TNFα injection, four mice in each group were killed at random, and the serums were collected, the serum concentrations of IL-1β and IL-6 were measured using ELISA kits (Neobioscience Technology) according to manufacturer’s instructions. On day 7 of experimental DSS-induced colitis, the distal colon tissues were harvested, washed with PBS, sliced into small pieces with sizes of ~1 mm³, and cultured with serum-free RPMI-1640 medium (1 mL/100 mg colon tissue) for 12 h. The supernatant was collected by sequential centrifugation at 500 × g for 10 min and 3000 × g for 10 min. The concentrations of cytokines TNFα, IL-6, and IL-1β were measured using ELISA kits (Neobioscience Technology).

Histological analysis and immunohistochemistry staining

The heart, liver, spleen, lung, kidney and colon tissues were fixed directly in 4% paraformaldehyde (24 h), embedded in paraffin, and stained with H&E following standard procedures. All images were acquired using a Pannoramic MIDI scanner.

The colon tissues were fixed in 4% paraformaldehyde for 24 h. The tissues were sliced to a thickness of 5 µM, deparaffinized with xylene, and rehydrated with graded ethanol. The tissue sections were then placed in a repair box filled with citric acid (pH 6.0) antigen retrieval buffer for antigen retrieval in a microwave oven followed by the quenching of endogenous peroxidase activity in 3% hydrogen peroxide. The sections were incubated overnight at 4 °C with primary antibody (S100a9, 73425, CST), which was prepared in PBS (pH 7.4) according to the manufacturer’s instructions. The sections were then washed three times with PBS, incubated for 1 h with the appropriate secondary antibodies, and staining with freshly prepared DAB color developing solution. Subsequently, the sections were counterstained with hematoxylin and mounted in non-aqueous mounting medium. All images were acquired using a Pannoramic MIDI scanner.

Statistical analysis

Data on figures represent mean ± standard deviation (SD). Unless otherwise noted, the differences between two groups were analyzed by unpaired Student’s t-test, and differences with p-value < 0.05 were considered significant.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The SDF file of the generated data has been deposited in the Zenodo repository under https://doi.org/10.5281/zenodo.6451205. The crystal structure of the RIPK1–RI-962 complex has been deposited in the Protein Data Bank (PDB) under accession code 7YDX. The crystal structures of RIPK1 used in this study are available in the Protein Data Bank (PDB) under accession codes 4ITJ, 4ITI, 4ITH, 4NEU, 5HX6, 5TX5, 6C4D, 6HHO, 6NW2, 6R5F, 6OCQ, 6NYH, and 6RLN. All other data that support the conclusions are available from the corresponding authors on reasonable request. Source data are provided with this paper.

Code availability

Computer codes of our GDL model are provided as Supplementary Software and have been deposited in the Zenodo repository under https://doi.org/10.5281/zenodo.7074218.

References

Schneider, G. De Novo Molecular Design. (Wiley-VCH, 2013).
Schneider, G. & Fechner, U. Computer-based de novo design of druglike molecules. Nat. Rev. Drug Discov. 4, 649–663 (2005).
Article CAS PubMed Google Scholar
Yang, X., Wang, Y., Byrne, R., Schneider, G. & Yang, S. Concepts of artificial intelligence for computer-assisted drug discovery. Chem. Rev. 119, 10520–10594 (2019).
Article CAS PubMed Google Scholar
Schneider, G. & Fechner, U. Computer-based de novo design of drug-like molecules. Nat. Rev. Drug Discov. 4, 649–663 (2005).
Article CAS PubMed Google Scholar
Schneider, G. Future de novo drug design. Mol. Inf. 33, 397–402 (2014).
Article CAS Google Scholar
Segall, M. Advances in multiparameter optimization methods for de novo drug design. Expert Opin. Drug Discov. 9, 803–817 (2014).
Article PubMed Google Scholar
Ruddigkeit, L., Blum, L. C. & Reymond, J.-L. Visualization and virtual screening of the chemical universe database GDB-17. J. Chem. Inf. Model 53, 56–65 (2013).
Article CAS PubMed Google Scholar
Hartenfeller, M. et al. DOGS: Reaction-driven de novo design of bioactive compounds. PLoS Comput. Biol. 8, e1002380 (2012).
Article CAS PubMed PubMed Central Google Scholar
Huang, Q., Li, L. & Yang, S. PhDD: a new pharmacophore-based de novo design method of drug-like molecules combined with assessment of synthetic accessibility. J. Mol. Graph. Model. 28, 775–787 (2010).
Article CAS PubMed Google Scholar
Wang, W., Huang, Q. & Yang, S. In De Novo Molecular Design (Wiley-VCH, 2010).
Sousa, T. et al. Generative deep learning for targeted compound design. J. Chem. Inf. Model. 61, 5343–5361 (2021).
Article CAS PubMed Google Scholar
Segler, M. H. S., Kogej, T., Tyrchan, C. & Waller, M. P. Generating focused molecule libraries for drug discovery with recurrent neural networks. ACS Cent. Sci. 4, 120–131 (2018).
Article CAS PubMed Google Scholar
Moret, M., Friedrich, L., Grisoni, F., Merk, D. & Schneider, G. Generative molecular design in low data regimes. Nat. Mach. Intell. 2, 171–180 (2020).
Article Google Scholar
Gómez-Bombarelli, R. et al. Automatic chemical design using a data-driven continuous representation of molecules. ACS Cent. Sci. 4, 268–276 (2018).
Article PubMed PubMed Central Google Scholar
Zhavoronkov, A. et al. Deep learning enables rapid identification of potent DDR1 kinase inhibitors. Nat. Biotechnol. 37, 1038–1040 (2019).
Article CAS PubMed Google Scholar
Li, Y., Pei, J. & Lai, L. Structure-based de novo drug design using 3D deep generative models. Chem. Sci. 12, 13664–13675 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wang, J. et al. Multi-constraint molecular generation based on conditional transformer, knowledge distillation and reinforcement learning. Nat. Mach. Intell. 3, 914–922 (2021).
Article Google Scholar
Meyers, J., Fabian, B. & Brown, N. De novo molecular design and generative models. Drug Discov. Today 26, 2707–2715 (2021).
Article CAS PubMed Google Scholar
Tong, X. et al. Generative models for de novo drug design. J. Med. Chem. 64, 14011–14027 (2021).
Article CAS PubMed Google Scholar
Goodfellow, I., Bengio, Y. & Courville, A. Deep Learning (MIT Press, 2016).
Kotsias, P. C. et al. Direct steering of de novo molecular generation with descriptor conditional recurrent neural networks. Nat. Mach. Intell. 2, 254–265 (2020).
Article Google Scholar
Brown, N., Fiscato, M., Segler, M. H. S. & Vaucher, A. C. GuacaMol: Benchmarking models for de novo molecular design. J. Chem. Inf. Model. 59, 1096–1108 (2019).
Article CAS PubMed Google Scholar
Renz, P., Rompaey, D. V., Wegner, J. K., Hochreiter, S. & Klambauer, G. On failure modes in molecule generation and optimization. Drug Discov. Today Technol. 32, 55–63 (2019).
Article PubMed Google Scholar
Cireşan, D. C., Meier, U. & Schmidhuber, J. Transfer learning for Latin and Chinese characters with deep neural networks. In: The 2012 International Joint Conference on Neural Networks (IJCNN). 1−16 (IJCNN, 2012).
Bishop, C. M. Training with noise is equivalent to Tikhonov regularization. Neural Comput 7, 108–116 (1995).
Article Google Scholar
Reed, R. & Marks II, R. J. Neural Smithing: Supervised Learning in Feedforward Artificial Neural Networks. (MIT Press, 1999).
Harel, S. & Radinsky, K. Prototype-based compound discovery using deep generative models. Mol. Pharm. 15, 4406–4416 (2018).
Article CAS PubMed Google Scholar
Silke, J., Rickard, J. A. & Gerlic, M. The diverse role of RIP kinases in necroptosis and inflammation. Nat. Immunol. 16, 689–697 (2015).
Article CAS PubMed Google Scholar
Humphries, F., Yang, S., Wang, B. & Moynagh, P. N. RIP kinases: Key decision makers in cell death and innate immunity. Cell Death Differ. 22, 225–236 (2015).
Article CAS PubMed Google Scholar
He, S. & Wang, X. RIP kinases as modulators of inflammation and immunity. Nat. Immunol. 19, 912–922 (2018).
Article CAS PubMed Google Scholar
Li, J. et al. The RIP1/RIP3 necrosome forms a functional amyloid signaling complex required for programmed necrosis. Cell 150, 339–350 (2012).
Article CAS PubMed PubMed Central Google Scholar
Wang, H. et al. Mixed lineage kinase domain-like protein MLKL causes necrotic membrane disruption upon phosphorylation by RIP3. Mol. Cell 54, 133–146 (2014).
Article CAS PubMed Google Scholar
Chen, X. et al. Translocation of mixed lineage kinase domain-like protein to plasma membrane leads to necrotic cell death. Cell Res. 24, 105–121 (2014).
Article CAS PubMed Google Scholar
Degterev, A., Ofengeim, D. & Yuan, J. Targeting RIPK1 for the treatment of human diseases. Proc. Natl Acad. Sci. USA 116, 9714–9722 (2019).
Article CAS PubMed PubMed Central ADS Google Scholar
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9, 1735–1780 (1997).
Article CAS PubMed Google Scholar
Irwin, J. J. et al. ZINC: A free tool to discover chemistry for biology. J. Chem. Inf. Model. 52, 1757–1768 (2012).
Article CAS PubMed PubMed Central Google Scholar
McInnes, L., Healy, J., Saul, N. & Großberger, L. UMAP: uniform manifold approximation and projection. J. Open Source Softw. 3, 861 (2018).
Grisoni, F. et al. Combining generative artificial intelligence and on-chip synthesis for de novo drug design. Sci. Adv. 7, eabg3338 (2021).
Article CAS PubMed PubMed Central ADS Google Scholar
Bemis, G. W. & Murcko, M. A. The properties of known drugs. 1. Mol. Framew. J. Med. Chem. 39, 2887–2893 (1996).
Article CAS Google Scholar
González-Medina, M. & Medina-Franco, J. L. Platform for unified molecular analysis: PUMA. J. Chem. Inf. Model 57, 1735–1740 (2017).
Article PubMed Google Scholar
Yang, S. Pharmacophore modeling and applications in drug discovery: Challenges and recent advances. Drug Discov. Today 15, 444–450 (2010).
Article CAS PubMed Google Scholar
Zou, J. et al. Towards more accurate pharmacophore modeling: Multicomplex-based comprehensive pharmacophore map and most-frequent-feature pharmacophore model of CDK2. J. Mol. Graph. Model 27, 430–438 (2008).
Article CAS PubMed Google Scholar
Probst, D. & Reymond, J.-L. Visualization of very large high-dimensional data sets as minimum spanning trees. J. Cheminform. 12, 12 (2020).
Article PubMed PubMed Central Google Scholar
Lewell, X. Q. et al. RECAP–retrosynthetic combinatorial analysis procedure: a powerful new technique for identifying privileged molecular fragments with useful applications in combinatorial chemistry. J. Chem. Inf. Comput. Sci. 38, 511–522 (1998).
Article CAS PubMed Google Scholar
Xie, T. et al. Structural basis of RIP1 inhibition by necrostatins. Structure 21, 493–499 (2013).
Article CAS PubMed Google Scholar
Najjar, M. et al. Structure guided design of potent and selective ponatinib-based hybrid inhibitors for RIPK1. Cell Rep. 10, 1850–1860 (2015).
Article CAS PubMed PubMed Central Google Scholar
Li, Y. et al. Identification of 5-(2,3-dihydro-1H-indol-5-yl)-7H-pyrrolo[2,3-d] pyrimidin-4-amine derivatives as a new class of receptor-interacting protein kinase 1 (RIPK1) inhibitors, which showed potent activity in a tumor metastasis model. J. Med. Chem. 61, 11398–11414 (2018).
Article CAS PubMed Google Scholar
Harris, P. et al. Identification of a RIP1 kinase inhibitor clinical candidate (GSK3145095) for the treatment of pancreatic cancer. ACS Med. Chem. Lett. 10, 857–862 (2019).
Article CAS PubMed PubMed Central Google Scholar
Duprez, L. et al. RIP kinase-dependent necrosis drives lethal systemic inflammatory response syndrome. Immunity 35, 908–918 (2011).
Article CAS PubMed Google Scholar
Pasparakis, M. & Vandenabeele, P. Necroptosis and its role in inflammation. Nature 517, 311–320 (2015).
Article CAS PubMed ADS Google Scholar
Robertson, C. M. & Coopersmith, C. M. The systemic inflammatory response syndrome. Microbes Infect. 8, 1382–1389 (2006).
Article CAS PubMed Google Scholar
Negroni, A., Colantoni, E., Cucchiara, S. & Stronati, L. Necroptosis in intestinal inflammation and cancer: new concepts and therapeutic perspectives. Biomolecules 10, 1431 (2020).
Article CAS PubMed Central Google Scholar
Günther, C. et al. Caspase-8 regulates TNF-α-induced epithelial necroptosis and terminal ileitis. Nature 477, 335–339 (2011).
Article PubMed PubMed Central ADS Google Scholar
Welz, P.-S. et al. FADD prevents RIP3-mediated epithelial cell necrosis and chronic intestinal inflammation. Nature 477, 330–334 (2011).
Article CAS PubMed ADS Google Scholar
Weinlich, R., Oberst, A., Beere, H. M. & Green, D. R. Necroptosis in development, inflammation and disease. Nat. Rev. Mol. Cell Biol. 18, 127–136 (2017).
Article CAS PubMed Google Scholar
Liu, Z.-Y. et al. Necrostatin-1 reduces intestinal inflammation and colitis-associated tumorigenesis in mice. Am. J. Cancer Res. 5, 3174–3185 (2015).
CAS PubMed PubMed Central Google Scholar
Gaulton, A. et al. The ChEMBL database in 2017. Nucleic Acids Res. 45, D945–D954 (2017).
Article CAS PubMed Google Scholar
Wildman, S. A. & Crippen, G. M. Prediction of physicochemical parameters by atomic contributions. J. Chem. Inf. Comput. Sci. 39, 868–873 (1999).
Article CAS Google Scholar
Bickerton, G. R., Paolini, G. V., Besnard, J., Muresan, S. & Hopkins, A. L. Quantifying the chemical beauty of drugs. Nat. Chem. 4, 90–98 (2012).
Article CAS PubMed PubMed Central Google Scholar
Bertz, S. H. The first general index of molecular complexity. J. Am. Chem. Soc. 103, 3599–3601 (1981).
Article CAS Google Scholar
Ertl, P., Rohde, B. & Selzer, P. Fast calculation of molecular polar surface area as a sum of fragment-based contributions and its application to the prediction of drug transport properties. J. Med. Chem. 43, 3714–3717 (2000).
Article CAS PubMed Google Scholar
Delaney, J. S. ESOL: estimating aqueous solubility directly from molecular structure. J. Chem. Inf. Comput. Sci. 44, 1000–1005 (2004).
Article CAS PubMed Google Scholar
Ertl, P. & Schuffenhauer, A. Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions. J. Cheminformatics 1, 8 (2009).
Article Google Scholar
Berman, H. M. et al. The protein data bank. Nucl. Acids Res. 28, 235–242 (2000).
Article CAS PubMed PubMed Central ADS Google Scholar
Jones, G., Willett, P., Glen, R. C., Leach, A. R. & Taylor, R. Development and validation of a genetic algorithm for flexible docking. J. Mol. Biol. 267, 727–748 (1997).
Article CAS PubMed Google Scholar
Jones, G., Willett, P. & Glen, R. C. Molecular recognition of receptor sites using a genetic algorithm with a description of desolvation. J. Mol. Biol. 245, 43–53 (1995).
Article CAS PubMed Google Scholar
Otwinowski, Z. & Minor, W. Processing of X-ray diffraction data collected in oscillation mode. Meth. Enzymol. 276, 307–326 (1997).
Article CAS Google Scholar
Collaborative Computational Project. The CCP4 suite: programs for protein crystallography. Acta Cryst. D. 50, 760–763 (1994).
Article Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Crystallogr. D. 66, 486–501 (2010).
Article CAS PubMed PubMed Central Google Scholar
Afonine, P. V. et al. Towards automated crystallographic structure refinement with phenix.refine. Acta Crystallogr. D. 68, 352–367 (2012).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (T2221004, S.Y.; 81930125, S.Y.; 81773633, S.Y.; 61876034, J.Z.; 22207080, Y.L.), National Postdoctoral Program for Innovative Talents of China (BX2021204, Y.L.), China Postdoctoral Science Foundation (2021M702374, Y.L.),1.3.5 project for disciplines of excellence, West China Hospital, Sichuan University (ZYXY21001, S.Y.; ZYGD18001, S.Y.) and Sichuan University postdoctoral interdisciplinary Innovation Fund. We also thank the staff of the Shanghai Synchrotron Radiation Facility (SSRF) beamlines (Shanghai, China) for great support.

Author information

These authors contributed equally: Yueshan Li, Liting Zhang, Yifei Wang, Jun Zou.

Authors and Affiliations

State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, 610041, Chengdu, Sichuan, China
Yueshan Li, Liting Zhang, Yifei Wang, Jun Zou, Ruicheng Yang, Chengyong Wu, Wei Yang, Chenyu Tian, Haixing Xu, Falu Wang, Xin Yang & Shengyong Yang
Key Laboratory of Drug Targeting and Drug Delivery System of Ministry of Education, West China School of Pharmacy, Sichuan University, 610041, Chengdu, Sichuan, China
Xinling Luo & Linli Li

Authors

Yueshan Li
View author publications
You can also search for this author in PubMed Google Scholar
Liting Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yifei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jun Zou
View author publications
You can also search for this author in PubMed Google Scholar
Ruicheng Yang
View author publications
You can also search for this author in PubMed Google Scholar
Xinling Luo
View author publications
You can also search for this author in PubMed Google Scholar
Chengyong Wu
View author publications
You can also search for this author in PubMed Google Scholar
Wei Yang
View author publications
You can also search for this author in PubMed Google Scholar
Chenyu Tian
View author publications
You can also search for this author in PubMed Google Scholar
Haixing Xu
View author publications
You can also search for this author in PubMed Google Scholar
Falu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xin Yang
View author publications
You can also search for this author in PubMed Google Scholar
Linli Li
View author publications
You can also search for this author in PubMed Google Scholar
Shengyong Yang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.Y. conceived and supervised the research and designed the experiments; S.Y., Y.W., and J.Z. established the GDL model and performed virtual screening. Y.L., R.Y., H.X., and F.W. performed chemical syntheses, separation, purification, and structural characterizations. L.Z., X.L., and C.W. performed gene expression and protein purification, crystallization, diffraction data collection, and crystal structure determination. L.Z., X.L., W.Y., and C.T. performed cellular assays and in vivo studies. S.Y., Y.L., Y.W., L.Z., L.L., and X.Y. analyzed the data. S.Y., Y.L, Y.W., and L.Z. wrote the manuscript.

Corresponding author

Correspondence to Shengyong Yang.

Ethics declarations

Competing interests

The authors declare the following competing interests: Sichuan University has applied for Chinese patents of this work, covering GDL model (application number: 202210883279.7; S.Y., Y.W., J.Z., and X.Y.) and compounds including RI-962 and related compounds (application number: 202110426935.6; S.Y.). The remaining authors declare no other competing interests.

Peer review

Peer review information

Nature Communications thanks Karim Abbasi and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Software

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, Y., Zhang, L., Wang, Y. et al. Generative deep learning enables the discovery of a potent and selective RIPK1 inhibitor. Nat Commun 13, 6891 (2022). https://doi.org/10.1038/s41467-022-34692-w

Download citation

Received: 12 April 2022
Accepted: 03 November 2022
Published: 12 November 2022
DOI: https://doi.org/10.1038/s41467-022-34692-w

This article is cited by

PocketFlow is a data-and-knowledge-driven structure-based molecular generative model
- Yuanyuan Jiang
- Guo Zhang
- Shengyong Yang
Nature Machine Intelligence (2024)
Invalid SMILES are beneficial rather than detrimental to chemical language models
- Michael A. Skinnider
Nature Machine Intelligence (2024)
Improving drug discovery with a hybrid deep generative model using reinforcement learning trained on a Bayesian docking approximation
- Youjin Xiong
- Yiqing Wang
- Christopher J. Butch
Journal of Computer-Aided Molecular Design (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.