Introduction

G protein-coupled receptors (GPCRs) are involved in a wide spectrum of physiological functions and they are the most attractive target for modern drug discovery. With the advances in both static crystal structures and molecular dynamics (MD) simulations, some common features in the activation mechanism of GPCRs have been identified, including the side-chain switches, the movement of transmembrane (TM) helices, and the formation of internal water channel1,2,3,4. However, an important question why certain molecules act as agonists whereas others, even with nearly identical structure, act as antagonists or invert agonists, is not well understood.

To answer this question and facilitate GPCR targeted drug discovery, we focus on the molecular fingerprint of the β2-adrenergic receptor (β2AR). β-adrenoceptors belong to rhodopsin-like GPCRs and are subdivided into three distinct groups: β1, β2, and β3 which are identified in cardiac, airway smooth muscle, and adipose tissue, respectively5. β2AR plays an important role in many physiological processes including inhibiting labor, delaying need of micturition, facilitating respiration and providing glucose fuel6,7. Similar to other receptors, there are three different ligand types of β2AR: agonists, antagonists and inverse agonists. The agonists activate the signaling pathways and increase the receptor’s basal activities. The antagonists block the signaling transductions without affecting the receptor’s basal activities. However, the inverse agonists block the receptor’s pocket and reduce the receptor’s basal activities4,8,9. All these three types of compounds have been extensively characterized even before their structures were determined10,11,12.

With the available crystal structures9,13,14,15, we are now able to envisage both active and inactive states of the receptor. Here, we further investigated the detailed binding modes of twelve different molecules (four agonists, four antagonists, and four inverse agonists) in the corresponding states of β2AR through 12 × 100 ns molecular dynamics (MD) simulations (a total of 1200 ns trajectories). The propensities of residue-ligand interactions were presented as interaction fingerprints. Moreover, the fluctuation of the protein backbone, the volumes in the intracellular pockets, the water densities in the receptors, the domain interaction networks as well as the movements of transmembrane helices are characterized for each class of compounds.

Results and Discussion

Molecular scaffolds of the agonists, antagonists and inverse agonists

Ligands are functionally categorized into three classes, namely agonists, antagonists and inverse agonists, based on their pharmacological functions. Four typical molecules were selected from each class of compounds and their protein-bound complexes were obtained from the reported crystal structures, or by protein-ligand docking if the experimental data were unavailable (Table 1). Specifically, for the agonist-bound systems, three of them were based on the crystal structures of BI167107 (Agon-1), HBI (Agon-2) and adrenaline (Agon-3) (PDB codes: 4LDE, 4LDL and 4LDO respectively)16, whilst the fourth one was built by docking salbutamol (Agon-4), a potent β2 selective agonist17,18, into the orthosteric site of an activated receptor (pdb: 4LDE). The five top ranked docking poses are identical to the agonist in the crystal structures. (Figure Suppl. 1) For the antagonist-bound systems, only the crystal structure of alprenolol (Anta-1) bound complex is available (PDB code: 3NYA)15. The most active enantiomer of bupranolol (Anta-2)19, nadolol (Anta-3)20 and propranolol (Anta-4)19 were selected for their specificity on β2 over β1 receptors21 and docked into the orthosteric site of an inactive receptor (pdb: 3NYA). For all four inverse agonists, their protein-bound structures are available (PDB codes: 2RH113, 3D4S22, 3NY815 and 3NY915 respectively).

Table 1 The details of the ligands in the twelve studied systems.

There are common features between each class of compounds (Fig. 1). In particular, all agonist molecules consist of an aromatic ring (ring I) connected to an ethanolamine backbone (Fig. 1a). An extra N-substituted isobutylphenyl ring (ring II) is found in both Agon-1 and Agon-2. This feature is not observed in any other ligands. Interestingly, both antagonists (Fig. 1b) and inverse agonists (Fig. 1c) contain an additional oxymethylene bridge between ring I and the ethanolamine backbone. These rings I are in general less polar and contain less Hbond donors.

Protein ligand interaction fingerprint

It is of great interest to investigate how the differences in the ligand scaffold and the receptor state affect the protein-ligand interactions. To illustrate how each ligand interacts with β2AR, we generated the interaction fingerprints based on the final 100 frames of our MD simulations. The root-mean-square deviations (RMSDs) of the receptor helices to the starting frame confirm the convergence of the simulations (Figure Suppl. 2). The ligands collectively interact with W1093.28, D1133.32, V1143.33, V1173.36, F193ECL2, Y1995.38, S2035.42, S2045.43, S2075.46, W2866.48, F2896.51, F2906.52, N2936.55, Y3087.35, I3097.36 and N3127.39 (Fig. 2). Noticeably, an ionic interaction is always observed between the ethanolamine of each ligand and D1133.32. The Hbonds between ethanolamine and N3127.39 are also frequently observed (frequency > 90%) in all ligands, except for Agon-4 (Fig. 3, left panel). These findings suggest that D1133.32 and N3127.39 are the key residues for the binding of all ligands. Moreover, agonists frequently form Hbonds with multiple residues in TM5, particularly with S2035.42 and S2075.46 (Fig. 3a). Interestingly, the catechol ring I of Agon-2 interacts less frequently with Y1995.38 and S2035.42 (frequency < 30%). However, its N-substituted p-isobutylphenol in ring II forms a prominent Hbond with the amide backbone of F193ECL2 (Fig. 3a, left panel). In contrast, antagonists and inverse agonists tend to have less polar interactions with TM5. Only Anta-3 can form rare Hbonds (frequency < 20%) with both S2035.42 and S2045.43 (Fig. 3b, left panel). For the inverse agonists, only iAgo-1 and iAgo-4 form isolated Hbonds with S2035.42 and S2045.43 respectively (Fig. 3c, left panel).

Figure 2
figure 2

The 16 important residues that form polar and/or hydrophobic interactions with a ligand.

(a) BI167107 (Agon-1). (b) Alprenolol (Anta-1). (c) Carazolol (iAgo-1).

Figure 3
figure 3

The interaction fingerprints of the twelve ligands with β2AR.

(a) The four agonists. (b) The four antagonists. (c) The four inverse agonists. (left panel) The normalized frequency of Hbonds/ionic bonds in the final 100 frames. (right panel) The normalized frequency of hydrophobic interactions in the final 100 frames.

Apart from polar interactions, hydrophobic interactions are essential for stabilizing or activating the receptor. Rings I of all ligands are enveloped between the hydrophobic side chains of V1173.36, F193ECL2 and F2896.51, except that Agon-4 interacts with F193ECL2 and N2936.55 instead (Fig. 3, right panel). Moreover, we noticed that the antagonists and the inverse agonists frequently form hydrophobic interactions with Y1995.38, S2035.42, S2075.46, W2866.48 and F2906.52, whereas the agonists rarely form such interactions with these residues (Fig. 3b,c, right panel). The analysis of interaction fingerprint has so far indicated that polar interactions between the ligand and the residues on TM5 are necessary for maintaining the active state of the receptor, whereas a ligand can stabilize the receptor by exerting hydrophobic interactions on this helix and also TM6. Hence, the distinct interaction patterns with different regions of the receptor can help predicting the nature of such a ligand. Historically, in contrast to isoprenaline (Fig. 1d) which is a potent agonist on β-adrenergic receptor, dichloroisoprenaline (DCI) (Fig. 1e) unexpectedly demonstrated antagonistic effects on the receptor23,24. Medicinal chemists were then intrigued by the possibility of developing an antagonist via appropriate chemical modifications of these molecules. Eventually, pronethalol (Fig. 1f), the naphthyl analog of isoprenaline, was the first beta blocker as a clinical candidate. The subsequent introduction of an oxymethylene bridge between the naphthalene and ethanolamine led to the discovery of propranolol (Anta-4) (Fig. 1b), the first clinically useful beta blocker23,24. Our results are consistent with the discovery process of beta blockers that the replacement of catechol ring with nathphalene reduces the Hbond interactions on TM5 and promotes the hydrophobic interactions. Meanwhile, the additional oxymethylene facilitates the hydrophobic interactions between the N-substituted group of the ligand and the residues on TM6. Therefore, one would expect the fingerprint of isoprenaline demonstrates frequent hydrogen bonds with TM5 and rare hydrophobic interactions with TM6. On the other hand, the fingerprints of DCI and pronathalol should reveal dominant hydrophobic interactions with both TM5 and TM6. We notice that the potent agonist Agon-4 does not share the common polar (N3127.39) and hydrophobic interactions (V1173.36 and F2896.51) as observed in other systems (Fig. 3a, right panel). Moreover, our MD trajectory shows that Agon-4 adopts a fairly different docking pose (Figure Suppl. 3). Unlike other ligands, the ring I of Agon-4 does not penetrate deeply into the pocket and therefore no hydrophobic interactions with V1173.36 and F2896.51 are formed. Instead, the 2-methylhydroxyl group (ring I) forms a Hbond with S2035.42 and consequently ring I is firmly stabilized next to the extracellular loop 2 (ECL2), in the vicinity of F193ECL2 and N2936.51. Interestingly, the phenol (ring I) frequently forms an intramolecular Hbond with the 2-methylhydroxyl group (ring I). On the other hand, the N-substituted isobutyl group of Agon-4 discourages N3127.39 from approaching the ethanolamine backbone and thus disrupts the ligand-N3127.39 Hbond.

Flexibility of each TM region

After identifying the 16 relevant residues for ligand binding, the effect of a ligand can be reflected from the overall root-mean-square fluctuations (RMSFs) of the backbone atoms. In the presence of agonists, the average RMSF (0.69 Å) is larger than that of antagonist-bound or inverse agonist-bound systems (both 0.56 Å) and such differences are found to be statistically significant (p < 0.01) (Table Suppl. 1). This phenomenon is related to the fact that several Hbonds between an agonist and TM5 probably disrupt the interactions between the helices and consequently increase the protein flexibility in the transmembrane region. To investigate the global effect of an incoming ligand, the RMSF of each TM helix (defined as W321.31-K601.59, V672.38-M962.67, N1033.22-T1363.55, A1504.42-M1714.63, Q1975.36-K2275.66, H2696.31-I2986.60 and K3057.32-R3287.55 respectively) is also calculated for all ligand-bound systems. In each corresponding region, the average RMSF of agonist-bound systems (0.91, 0.69, 0.75, 0.96, 0.86, 0.94 and 0.90 Å) is always greater than that of antagonist-bound (0.88, 0.60, 0.61, 0.80, 0.77, 0.71 and 0.67 Å) and inverse agonist-bound systems (0.85, 0.59, 0.58, 0.76, 0.73, 0.71 and 0.68 Å). The RMSF differences of all corresponding helices except for TM1 are also statistically significant (p < 0.01) (Fig. 4). It appears that agonist molecules not only increase the flexibility near the binding site, but also the major part of the receptor.

Figure 4
figure 4

The average root-mean-square-fluctuations (RMSFs) of the protein backbone in each helix, in the presence of agonists (Agon), antagonists (Anta) or inverse agonists (iAgo).

(red) TM1: W321.31 to K601.59. (orange) TM2: V672.38 to M962.67. (yellow) TM3: N1033.22 to T1363.55. (green) TM4: A1504.42 to M1714.63. (cyan) TM5: Q1975.36 to K2275.66. (blue) TM6: H2696.31 to I2986.60. (magenta) TM7: K3057.32 to R3287.55. An asterisk indicates a significant difference between the two values at p < 0.01, using a two-tailed t-test with equal variances.

The intracellular pockets and water influx

Another important feature of receptor activation is the expansion of the intracellular pocket for the G protein coupling process. In this study, the last 100 frames in each trajectory were selected for calculating the average size of the intracellular pocket. For agonist-bound systems, the receptors are in their activated states and the average pocket volumes vary between ~450 and ~950 Å3. In contrast, the receptor remains in an inactive state in the presence of an antagonist or an inverse agonist and therefore the average pocket volumes only fluctuate between ~100 and ~200 Å3 (Fig. 5). Moreover, the rotamer of the highly conserved Y3267.53 correlates with various states of the receptor and regulates the formation of internal water channel1. Hence, we calculated the number of water molecules within 4 Å around Y3267.53 alongside the pocket volume. In Agon-1 bound β2AR, water influx from the cytoplasmic region into the receptor is observed and the average number of water molecules around Y3267.53 is 23 (Fig. 6a). In Anta-1 bound β2AR and iAgo-1 bound β2AR, there are isolated water patches that are disconnected with the bulk water in the cytoplasm. The average number of water molecules for both cases is around 14, less than two-thirds of that in the activated state (Fig. 6b,c). The details of other systems are provided in the Supplementary Information (Figure Suppl. 4).

Figure 5
figure 5

The volumes of the intracellular pockets in the twelve studied systems and the cross sections of Agon-1 bound β2AR, Anta-1 bound β2AR and iAgo-1 bound β2AR.

(Grey) Cross section plane. (Colored circle) Intracellular pocket.

Figure 6
figure 6

The number of water molecules next to Y3267.53.

(a) Top: water density of Agon-1 bound β2AR in the final 100 frames. Bottom: The number of water molecules within 4 Å of Y3267.53 in Agon-1 bound β2AR. (b) Top: water density of Anta-1 bound β2AR in the final 100 frames. Bottom: The number of water molecules within 4 Å of Y3267.53 in Anta-1 bound β2AR. (a) Top: water density of iAgo-1 bound β2AR in the final 100 frames. Bottom: The number of water molecules within 4 Å of Y3267.53 in iAgo-1 bound β2AR.

Domain interaction network

The activation of receptors can lead to a rearrangement of local interactions in the intracellular region25. In order to unravel couplings among residues within the receptor, we constructed the domain interaction networks to facilitate the analysis of state-specific couplings. Each node presents a cluster of residues in close interaction, while the thickness of the line connecting the nodes is weighted by the correlation values between the two clusters. We noticed that agonist-bound systems form less domains than the antagonist- or inverse agonist-bound systems. For example, Agon-1 has only 8 nodes whereas Anta-1 and iAgo-1 have 11 and 9 nodes respectively (Fig. 7, bottom panel). This finding suggests that there are more discrete local interactions in the inactive state of the receptor. Noticeably the position of TM6 in the intracellular region varies with the state of the receptor, leading to a different interaction network. Among all agonist-bound systems, TM6 of the activated receptor moves outwards and eventually shares the same interaction domain with TM5 (Fig. 7a, top panel). In the antagonist- and inverse agonist-bound systems, TM6 of the inactive receptor closes up the intracellular pocket and therefore the residues in TM5 cluster with those in TM3 instead (Fig. 7b,c, top panel). These findings are in agreement with our recent work on P2Y1 receptor26.

Figure 7
figure 7

Simultaneous view of community residue interaction network and 3D structure of β2AR.

(a) Agon-1 bound β2AR (b) of Anta-1 bound β2AR (c) iAgo-1 bound β2AR.

Principal component analysis on the vibrational modes

The helix movements of agonist-, antagonist- and inverse agonist-bound receptors are very different in the intracellular region, but do not demonstrate consistent patterns in the extracellular region. Figure 8 shows the first and the lowest frequency mode of the alpha carbons. The proportion of variance of atom fluctuations versus eigenvalue rank is given in Figure Suppl. 5. In the extracellular region of Agon-1 bound β2AR, the extracellular loop 2 moves towards TM2, whereas TM5, TM6, together with extracellular loop 3 move away from TM2 and open up the ligand binding pocket (Fig. 8a). However, in Agon-2 bound β2AR, TM5, TM6 and the extracellular loops 2 and 3 close up the binding pocket (Fig. 8b). Similarly, in Anta-1 bound β2AR, TM6, TM7 and the extracellular loop 2 move towards TM2 (Fig. 8c), whereas, in Anta-2 bound β2AR, these groups move away from TM2 (Fig. 8d). For the iAgo-1 bound β2AR, the extracellular loop 2 move aside and the binding site is closed up by TM6, TM7 and the extracellular loop 3 (Fig. 8e). In contrast, the binding pocket of iAgo-2 bound β2AR is closed by the extracellular loop 2 (Fig. 8f). The helix movements in the intracellular region are more consistent. In agonist bound systems, TM6 and TM7 move away from each other and thus a large void space is created (Fig. 8a,b). In antagonist and inverse agonist bound systems, the movement of the alpha carbons in TM6 are more diverse and consequently the helix keeps the intracellular pocket closed (Fig. 8c–f).

Figure 8
figure 8

The helix movements of a ligand-bound β2AR.

(a) Agon-1 bound β2AR. (b) Agon-2 bound β2AR. (c) Anta-1 bound β2AR. (d) Anta-2 bound β2AR. (e) iAgo-1 bound β2AR. (f) iAgo-2 bound β2AR.

Conclusion

The crystal structures of β2AR provide new information of this classical GPCR drug target. Protein-ligand interaction fingerprints generated from MD trajectories help identify the important residues and the type of interactions required for designing ligands with desired property. In this study, using interaction fingerprints, we analyzed dynamic behavior of 16 important residues in the binding pockets, among which D1133.32 and N3127.39 are essential for ligand binding. The polar interactions with residues in TM5, particularly S2035.42 and S2075.46, are related to the agonistic properties, whereas hydrophobic interactions with residues in TM5 and TM6 help stabilize the receptor. We demonstrate that the molecular fingerprints can be a powerful tool in capturing the specific profile of protein-ligand interactions, and can be employed together with MD simulations in predicting the nature of a ligand. Agonists predominantly form Hbonds with TM5, disrupt the interactions between helices in the extracellular region and then in the rest of TM region leading to increase the flexibility of the protein. As a result, TM5 as well as TM6 form frequent non-polar interactions in the intracellular region and move away from TM7, causing the expansion of intracellular pocket and a water influx (Fig. 9a). This also explains why the residues of TM5 and TM6 in this region share the same interaction domain. In contrast, antagonists form prominently non-polar interactions with both TM5 and TM6 (Fig. 9b), whereas inverse agonists mainly form non-polar interactions with TM6 only (Fig. 9c). These modes of interactions stabilize the extracellular region of the receptor, resulting in a smaller intracellular pocket and a lower amount of water. Residues of TM5 and TM6 in the intracellular region form discrete domains and their vibrational modes become more diverse. All these findings explain the fundamental mechanism of receptor activation and use as good guidance for GPCR targeted drug discoveries.

Figure 9
figure 9

The relationship of protein-ligand interactions and the state of β2AR.

Capped lines: (thick) dominant interactions; (thin) rare interactions; (orange) hydrogen bond interactions; (black) non-polar interactions. Yellow spheres: water molecules. (a) green sphere: agonist. (b) blue sphere: antagonist. (c) pink sphere: inverse agonist.

Methods

Loop filling and refinements

All fusion proteins, lipids, counterions and stabilizing agents were removed from the experimental structures. The missing intracellular loop ICL2 in each receptor was rebuilt and refined using the loop refinement protocol in Modeller27 V9.10. A total of 2,000 loops were generated for each receptor and the conformation with the lowest DOPE (Discrete Optimized Protein Energy) score was chosen for receptor construction. Repaired models were submitted to Rosetta V3.4 for loop refinement with kinematic loop modeling methods28. Kinematic closure (KIC) is an analytic calculation inspired by robotics techniques for rapidly determining the possible conformations of linked objects subject to constraints. In the Rosetta KIC implementation, 2N - 6 backbone torsions of an N-residue peptide segment (called non-pivot torsions) are set to values drawn randomly from the Ramachandran space of each residue type, and the remaining 6 phi/psi torsions (called pivot torsions) are solved analytically by KIC.

Protein structure preparations

All protein models were prepared in Schrodinger suite software under OPLS_2005 force field29. Hydrogen atoms were added to repaired crystal structures according to the physiological pH (7.0) with the PROPKA30 tool in Protein Preparation tool in Maestro31 (2015, Schrödinger 2015) to optimize the hydrogen bond network. Constrained energy minimizations were conducted on the full-atomic models, with heavy atom coverage to 0.4 Å.

Ligand structure preparations and docking

All ligand structures were obtained from the PubChem32 online database. The LigPrep module in Schrodinger 2015 suite software was introduced for geometric optimization by using OPLS_2005 force field. The ionization state of ligands were calculated with Epik33 tool employing Hammett and Taft methods in conjunction with ionization and tautomerization tools33. For salbutamol (Agon-1), bupranolol (Anta-2), nadolol (Anta-3) and propranolol (Anta-4), flexible ligand docking calculations were performed using Glide module34. These molecules were docked into the ligand binding site of 4LDE for the agonist and that of 3NYA for the antagonists. The best pose with the lowest GScore was selected for MD simulations.

Molecular dynamics simulations

The membrane system was built in Desmond35, with the receptor crystal structure pre-aligned in the OPM (Orientations of Proteins in Membranes) database36,37. Pre-equilibrated 76-92 POPC lipids coupled with 6900-9200 TIP3P water molecules in a box ~ 65 Å x 63 Å x 93 Å were used for building the protein/membrane system. We modeled the protein, lipids, water and ions by using CHARMM36 force field38. The ligand was assigned with CHARMM CGenFF force field39. Next, an additional 20 ns constrained equilibration was performed at constant pressure and temperature (NPT ensemble; 310 K, 1 bar), and the force constant was trapped off gradually from 10 kcal/mol to 0 kcal/mol. All bond lengths to hydrogen atoms were constrained with M-SHAKE. Van der Waals and short-range electrostatic interactions were cut off at 10 Å. Long-range electrostatic interactions were computed by the Particle Mesh Ewald (PME) summation scheme. All MD simulations were done in Desmond35 with 40 ps step size. The root-mean-square fluctuations (RMSFs) of the protein backbone were calculated in Gromacs40. The root-mean-square deviations (RMSDs) of the protein helices and the principal component analysis (PCA) of the alpha carbons in the final 50 frames were calculated in VMD41.

Average water density calculation

Water density was calculated in Volmap plugin in VMD41. Volmap creates a map of the weighted atomic density at each grid point. This is done by replacing each atom in the selection with a normalized gaussian distribution of width (standard deviation) equal to its atomic radius. The gaussian distribution for each atom is then weighted using an optional weight read from one of the atoms’ numerical properties, and defaults to a weight of one. The various gaussians are then additively distributed on a grid. The meaning of final map will depend of the weights of mass. The average water density was calculated based on the final 100 frames of each long time scale MD simulations. Final output results were visualized in VMD.

Interaction fingerprint calculations

The interaction fingerprint between protein and ligand was done with IChem42. We first extracted the final 100 snapshots of the MD simulation, and shortlisted the residues with atoms less than 5 Å away from any ligand atoms. We then used IChem to convert protein−ligand coordinates into a bit-string fingerprint (TIFP) registering the corresponding molecular interaction pattern. TIFP fingerprints have been calculated for ca. 1000 protein−ligand complexes, enabling a broad comparison of relationships between interaction pattern similarities and ligand or binding site pairwise similarities42. In this work we kept the default parameters of IChem and focused on two types of interactions: polar interaction and hydrophobic contact. The former comprises ionic bond and Hbond, whilst the latter incorporates hydrophobic contacts, the face-to-face and edge-to-face between aromatic rings. 16 residues formed interactions with one of the ligands for at least 30 times.

Residue communication network calculation

Correlated atomic fluctuations of a particular receptor state were characterized as reported elsewhere43,44,45 using Bio3D. The network nodes represent residues, which are connected through edges weighted by their constituent atomic correlation values. Community analysis and node centrality with Bio3D and suboptimal path calculation with the WISP software46 were performed on each network to characterize network properties and to identify residues involved in the dynamic coupling of distal sites. The parameters for the suboptimal path analysis included input source and sink nodes, as well as the total number of paths to be calculated. The latter parameter was set to 500 paths, which was found to yield converged results in all cases45.

Additional Information

How to cite this article: Chan, H. C. S. et al. The Principles of Ligand Specificity on beta-2-adrenergic receptor. Sci. Rep. 6, 34736; doi: 10.1038/srep34736 (2016).