Structure and drug binding of the SARS-CoV-2 envelope protein transmembrane domain in lipid bilayers

Mandala, Venkata S.; McKay, Matthew J.; Shcherbakov, Alexander A.; Dregni, Aurelio J.; Kolocouris, Antonios; Hong, Mei

doi:10.1038/s41594-020-00536-8

Download PDF

Article
Published: 11 November 2020

Structure and drug binding of the SARS-CoV-2 envelope protein transmembrane domain in lipid bilayers

Nature Structural & Molecular Biology volume 27, pages 1202–1208 (2020)Cite this article

62k Accesses
233 Citations
295 Altmetric
Metrics details

Subjects

Abstract

An essential protein of the SARS-CoV-2 virus, the envelope protein E, forms a homopentameric cation channel that is important for virus pathogenicity. Here we report a 2.1-Å structure and the drug-binding site of E’s transmembrane domain (ETM), determined using solid-state NMR spectroscopy. In lipid bilayers that mimic the endoplasmic reticulum–Golgi intermediate compartment (ERGIC) membrane, ETM forms a five-helix bundle surrounding a narrow pore. The protein deviates from the ideal α-helical geometry due to three phenylalanine residues, which stack within each helix and between helices. Together with valine and leucine interdigitation, these cause a dehydrated pore compared with the viroporins of influenza viruses and HIV. Hexamethylene amiloride binds the polar amino-terminal lumen, whereas acidic pH affects the carboxy-terminal conformation. Thus, the N- and C-terminal halves of this bipartite channel may interact with other viral and host proteins semi-independently. The structure sets the stage for designing E inhibitors as antiviral drugs.

Dimeric Transmembrane Structure of the SARS-CoV-2 E Protein

Article Open access 01 November 2023

Rongfu Zhang, Huajun Qin, … Timothy A. Cross

Antiviral HIV-1 SERINC restriction factors disrupt virus membrane asymmetry

Article Open access 20 July 2023

Susan A. Leonhardt, Michael D. Purdy, … Mark Yeager

Cryo-EM structure of SARS-CoV-2 ORF3a in lipid nanodiscs

Article 22 June 2021

David M. Kern, Ben Sorum, … Stephen G. Brohawn

Main

Eight months into the COVID-19 pandemic, no vaccines or antiviral drugs are available against the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the causative agent of the pandemic, owing to a lack of knowledge about the detailed structures and functions of the essential virus proteins. The RNA genome of SARS-CoV-2 encodes three membrane proteins (Fig. 1a): the spike protein, which binds the cell-surface receptor to mediate virus entry; the membrane protein, which contributes to virus assembly and budding¹; and the envelope protein E. E is a 75-residue viroporin (Fig. 1b) that forms a cation-selective channel across the ERGIC membrane^2,3. In SARS-CoV-1, E mediates the budding and release of progeny viruses⁴ and activates the host inflammasome⁵. E’s channel activity is blocked by hexamethylene amiloride (HMA)⁶ and amantadine (AMT)⁷; the latter also inhibits the viroporins of influenza A virus and HIV-1 (refs. ^8,9). E deletion gives rise to attenuated viruses in some coronaviruses^10,11,12, whereas E mutations that abolish channel activity cause reduced virus pathogenicity.¹² Thus E is a potential antiviral drug target and vaccine candidate against SARS-CoV-2.

**Fig. 1: Function, amino acid sequence and fingerprint NMR spectra of the SARS-CoV-2 E protein.**

Despite its importance to SARS-CoV-2 pathogenesis, E’s high-resolution structure, particularly for the ion-conducting transmembrane (TM) domain (residues 8–38) (Fig. 1b)^2,3, has been elusive. Sedimentation equilibrium and gel-electrophoresis data for the homologous SARS-CoV-1 E indicate that the TM domain assembles into a homopentamer in detergents such as sodium dodecyl sulfate (SDS) and perfluorooctanoic acid^6,13,14. Although early X-ray scattering data have suggested a helical hairpin model for E¹⁵, subsequent solution NMR studies of E bound to several detergent micelles, including dodecylphosphocholine (DPC)¹⁰, SDS⁶ and lyso-myristoylphosphatidylglycerol (LMPG)¹⁶, consistently indicate a single-span TM helix. However, the pore-facing residues and the pentameric assembly are not well-established. Fourier-transform infrared dichroic data suggest that the ETM helix orientation in lipid bilayers may be sensitive to the presence or absence of charged residues at the two termini of the TM domain, and by inference, the membrane surface charge^17,18.

Here, we use solid-state NMR to determine the structure of the SARS-CoV-2 ETM structure in phospholipid bilayers, to avoid potential structural distortion caused by detergents. The structure sets the stage for the design of E inhibitors as antiviral drugs.

Results

Backbone conformation of ETM in lipid bilayers

We reconstituted ETM into an ERGIC-mimetic lipid bilayer containing phosphatidylcholine, phosphatidylethanolamine, phosphatidylinositol, phosphatidylserine and cholesterol. For comparison, we also incorporated the protein into a dimyristoylphosphocholine (DMPC): dimyristoylphosphoglycerol (DMPG) model membrane, abbreviated as DMPX below. ETM was expressed in Escherichia coli using a hexahistidine (His₆)–small ubiquitin-like modifier (SUMO) fusion tag and purified first by nickel affinity column chromatography and then by reverse-phase HPLC after cleavage of the solubility tag (Extended Data Fig. 1).

One-dimensional (1D) ¹³C and ¹⁵N NMR spectra of the protein in ERGIC and DMPX membranes show temperature-insensitive high intensities (Extended Data Fig. 2a,b), indicating that the protein is immobilized in lipid bilayers at ambient temperature. Two-dimensional (2D) ¹⁵N-¹³C and ¹³C-¹³C correlation spectra show well-resolved peaks (Fig. 1c,d), with ¹³C and ¹⁵N linewidths of 0.5 ppm and 0.9 ppm, indicating that the protein conformation is highly homogeneous. We assigned the chemical shifts using three-dimensional (3D) correlation NMR experiments (Extended Data Fig. 3a). These chemical shifts indicate that residues 14–34 form the α-helical core of the TM domain (Extended Data Fig. 3b,c and Supplementary Table 1). Comparison of spectra between the two membranes and at different temperatures (Extended Data Fig. 2d–f) indicate that the N-terminal segment (residues Glu8–Ile13) is dynamic at high temperature but is mostly α-helical, whereas the C-terminal segment (residues Thr35–Arg38) is more rigid but displays temperature-dependent conformations. Acidic pH perturbed the chemical shifts of C-terminal residues Leu34 to Arg38 (Extended Data Fig. 4), supporting the conclusion that the C terminus is conformationally plastic.

Oligomeric structure and hydration of ETM

The overall temperature insensitivity of the protein spectra suggests that ETM is oligomerized in lipid bilayers. To determine the oligomeric structure, we prepared two mixed labeled samples to measure intermolecular contacts. An equimolar mixture of ¹³C-labeled protein and 4-¹⁹F-Phe-labeled protein (Extended Data Fig. 1e) was used to measure intermolecular ¹³C-¹⁹F distances using the rotational-echo double-resonance (REDOR) technique¹⁹ (Fig. 2a). ETM contains three regularly spaced Phe residues, Phe20, Phe23 and Phe26, at the center of the TM segment. 1D and 2D ¹³C NMR spectra were measured without and with ¹⁹F pulses. The resulting difference spectra show the signals of carbons that are in close proximity to a fluorinated Phe on a neighboring helix (Fig. 2b and Extended Data Fig. 5a–c). As expected, residues Val17 to Leu31 are affected by 4-¹⁹F-Phe, while residues Ile13 to Ser16 and Ala36 to Arg38 show no REDOR dephasing. Moreover, the three Phe residues display 2 resolved ¹⁹F chemical shifts with a roughly 2:1 intensity ratio, indicating that one of the residues has a distinct side chain conformation. A 2D ¹³C-¹⁹F correlation spectrum (Fig. 2c) shows a cross-peak between the −118 ppm ¹⁹F signal and Ala22 Cβ, indicating that this −118 ppm peak is due to either Phe20 or Phe23. The −113 ppm ¹⁹F peak shows strong cross-peaks with aromatic and numerous aliphatic ¹³C chemical shifts. Since Phe20 and Phe26 are too far away from each other to form intermolecular contacts, the −118 ppm ¹⁹F peak must be assigned to Phe20, while the −113 ppm peak must be assigned to Phe23 and Phe26. To constrain the interhelical packing at the two termini of the TM domain, we prepared a sample with mixed ¹³C and ¹⁵N labels and measured 2D NHHC correlation spectra to identify exclusively intermolecular ¹⁵N-¹³C correlations (Fig. 2d). These experiments together yielded 35 interhelical ¹³C-¹⁹F distance restraints and 52 interhelical ¹⁵N-¹³C correlations, which are crucial for determining the oligomeric structure of ETM.

**Fig. 2: Measurement of interhelical distances and water accessibility of membrane-bound ETM.**

To further constrain the architecture of ETM self-assembly, we measured residue-specific water accessibilities using water-edited 2D ¹⁵N-¹³C correlation experiments (Fig. 2e and Extended Data Fig. 5d)^20,21. Water ¹H magnetization transfer is the highest to the N-terminal residues, is the least to the central residues Leu17 to Ala32 and is moderate to the C terminus (Fig. 2f). Thus, the hydration gradient of the protein is primarily along the bilayer normal. The preferential hydration of the N terminus is especially manifested by the high water-transferred intensity of Leu19 compared with that of Thr30, despite favorable chemical exchange to the Thr side chain^22,23,24. For the dehydrated center of the TM domain, Leu28 and Val25 show higher hydration than do their neighboring residues, suggesting that these two residues face the pore. A complementary lipid-edited experiment (Fig. 2g) showed much higher intensities for the Phe side chain carbons than their corresponding water-transferred intensities, indicating that the Phe residues are largely lipid-facing. The ERGIC-bound ETM shows twofold lower water accessibility than that of the closed state of the influenza BM2 at the same pH²⁵ (Fig. 2f).

Structure calculation of ETM in ERGIC membranes

We calculated the structure of ETM using the above 56 (ϕ, ψ) torsion angles, 87 interhelical distance restraints (Supplementary Tables 2 and 3) and 196 intrahelical ¹³C-¹³C contacts obtained from 250-ms 2D ¹³C spin diffusion spectra (Extended Data Fig. 6)²⁶. Initial calculation using directionally ambiguous interhelical contacts where the observed helix is assumed to contact either of the two neighboring helices did not converge. Since previously reported micelle-bound ETM structures show substantial variations in pore residue identities and handedness of the helical bundle, we evaluated various pentamer packing models (Extended Data Fig. 7 and Supplementary Table 4) for their agreement with experimentally measured constraints, including the water and lipid accessibilities, interhelical Phe–Phe contact in the ¹³C-¹⁹F REDOR data and ¹³C secondary chemical shifts. A single pentamer model, characterized by having Asn15 and Val25 at similar pore-facing orientations and all three Phe residues facing lipids, was found to best describe the experimental data. This model was subsequently used to disambiguate the direction of interhelical contacts.

The lowest-energy structure ensemble, calculated using XPLOR-NIH (Supplementary Table 5 and Table 1), shows a long and tight 5-helix bundle with a vertical length of ~35 Å for residues Val14–Leu34. The structure resolution is higher for the middle of the TM domain, where ¹³C-¹⁹F REDOR distance restraints are available, and lower for the two termini, where fewer distance restraints are available (Fig. 3a and Extended Data Fig. 8a,b). The side chain rotamers are not precisely defined, especially for side chains well away from the central three Phe residues (Extended Data Fig. 8b). The channel diameter, represented by backbone Cα–Cα distances between helices i and i + 2 for pore-facing residues, varies from 11 Å to 14 Å. The helix is tilted by a small angle of 5–10˚ from the bilayer normal (Fig. 3b), but the orientation is not uniform along the length of the peptide, because the helix is non-ideal but exhibits a rotation angle change, or twist, between residues Phe20 and Phe23 (refs. ^10,16). Consistent with the small tilt angle, the helical bundle does not display a strong handedness. The pore of the channel is occupied by predominantly hydrophobic residues, including Asn15, Leu18, Leu21, Val25, Leu28, Ala32 and Thr35 (Fig. 3b,c and Extended Data Fig. 8a,b), explaining the poor hydration of the protein. The N-terminal pore is constricted by Asn15, which forms interhelical side chain hydrogen bonds (Fig. 3g)²⁷. The pore-facing positions of Asn15 and Val25 are consistent with single-channel conductance data showing that p.N15A and p.V25F abolish cation conductance^3,7. The helix–helix interface is stabilized by aromatic stacking of Phe23 and Phe26 (Fig. 3e,g) and van der Waals packing among methyl-rich resides such as the Val29–Leu31–Ile33 triad (Fig. 3f). These extensive hydrophobic interactions give rise to a tighter helical bundle than do the viroporins influenza BM2 and HIV-1 Vpu (Extended Data Fig. 8d).

Table 1 NMR and refinement statistics

Full size table

**Fig. 3: Structure of SARS-CoV-2 envelope protein’s transmembrane domain in ERGIC-mimetic lipid bilayers.**

ETM interactions with hexamethylene amiloride and amantadine

To investigate how ETM interacts with drugs, we measured the chemical shifts of the protein in the presence of HMA and [3-¹⁹F]amantadine. At a drug:protein molar ratio of 4:1, HMA caused significant chemical shift perturbations (CSPs) to N-terminal residues, including Thr9, Gly10, Thr11, Ile13 and Ser16, followed by more modest CSPs for the C-terminal Ala36 and Leu37 (Fig. 4a–c). This trend is consistent with the micelle data^10,16, but the CSPs in lipid bilayers are much larger, with the N-terminal ⁹TGT¹¹ triplet giving per-residue CSPs of 0.35–0.70 ppm. Moreover, the CSPs in lipid bilayers were measured under only fourfold drug excess, while in micelles, the smaller CSPs were measured under higher drug excesses of 10- to 31-fold^10,16.

**Fig. 4: Effects of HMA and AMT binding to ETM in DMPC: DMPG membranes.**

The higher sensitivity of ETM to HMA in lipid bilayers strongly suggests that the bilayer-bound protein conformation is more native. A docking pose based on these CSPs found that HMA intercalates shallowly into the N-terminal lumen with a distribution of orientations (Fig. 4d and Extended Data Fig. 9), suggesting a dynamic binding mode wherein HMA exchanges between multiple helices and inhibits cation conduction by steric occlusion of the pore. Within the ensemble of docked structures, more HMA molecules point the guanidinium into the pore and the hexamethylene ring towards the lipid headgroups than in the reverse orientation. AMT caused smaller CSPs than HMA (Fig. 4c and Extended Data Fig. 10a,b), but the binding site remains at the N terminus. Using the 3-¹⁹F label on adamantane, we measured protein-drug proximities using ¹³C-¹⁹F REDOR. The spectra showed modest dephasing for the N-terminal Asn15 and C-terminal Ile33 (Extended Data Fig. 10c–e), in qualitative agreement with the observed CSPs. The CSPs of HMA are larger than those of AMT and are consistent with the stronger affinity of HMA⁶ than AMT⁷ for SARS-CoV E, as well as with the micromolar half-maximal effective concentration (EC₅₀) reported for HMA against other human coronavirus E proteins²⁸.

Discussion

The current lipid-bilayer-based structural model of SARS-CoV-2 ETM has similarities with, but also considerable differences from, micelle-derived structural models (PDB 5X29)¹⁶. In LMPG micelles, the TM domain of a longer E construct (residues 8–65) also displays a kinked helix and a disordered N terminus, but the helical bundle is right-handed¹⁶, and the helices are more tilted and loosely packed (Extended Data Fig. 8c). In comparison, the bilayer-based ETM structural model does not have a strong handedness, consistent with the small helical tilt angle, and both reflect the measured interhelical distance restraints (Supplementary Tables 2 and 3). The heavy-atom r.m.s. deviation (r.m.s.d.) for residues 14–34 between the 2 structural models is 6.1 Å, and the positions of various important residues differ. For example, in the LMPG-derived structural model, Phe26 is pore-facing and Thr30 is interhelical¹⁶, but in the bilayer-derived structure model, both residues point to lipids. The lipid-facing position of Thr30 in the current model is supported by single-channel conductance data showing that mutations of residues such as Thr30 and Thr11 to Ala do not affect the channel activity³. Another structural model of ETM determined in DPC micelles¹⁰ showed a left-handed and coiled helical bundle that differs qualitatively from the LMPG-bound model. These structural differences likely result from a combination of insufficient experimental restraints as well as an inherent conformational plasticity of the ETM. The LMPG-based structural model was obtained from ten unambiguous interhelical distances but no orientational restraints¹⁶, whereas the DPC-based structural model was built with orientational restraints but no unambiguous interhelical distance restraints¹⁰. For comparison, the current bilayer-derived ETM structure model was calculated from 87 interhelical distance constraints (Table 1).

Apart from experimental limitations, ETM’s oligomeric structure may be intrinsically sensitive to the membrane environment²⁹ because the highly hydrophobic nature of the long central portion of the TM segment makes interhelical interactions non-specific. Indeed, SARS-CoV viruses with a p.V25F mutation develop escape mutants p.L27S, p.L19A, p.T30I and p.L37R in mice, implying that E’s channel activity is restored by these compensatory double mutations¹². We speculate this could result from moderate changes of the helix rotation angle to give rise to alternate packing of the helical bundle. Future studies of E mutants are required to elucidate the structural basis for the loss and restoration of ion-channel activity.

How does the SARS-CoV-2 ETM structure compare with the structures of equivalent viroporins of influenza and HIV-1 viruses in lipid bilayers? The ETM helical bundle is compact and rigid, while AM2 and BM2’s TM domains, which have a higher percentage of polar residues such as His and Ser, form wider and more hydrated pores (Extended Data Fig. 8d)^9,25. The HIV-1 Vpu TM domain has a high percentage of hydrophobic residues, similarly to SARS-CoV-2 E, but forms a shorter (~20 Å vertical length) pentameric helical bundle with more tilted helices (~20˚)^30,31. The ETM helical bundle is more immobilized than M2 and Vpu helical bundles³², and does not undergo rigid-body fast uniaxial rotation at high temperatures in DMPX membranes (Extended Data Fig. 2). This immobilization suggests that ETM may interact extensively with lipids³. Finally, the helix distortion at residues Phe20–Phe23 may cause the two halves of the protein to respond semi-independently to environmental factors such as pH, charge, membrane composition and other viral and host proteins.

Which structural features of this ETM helical bundle might be responsible for cation conduction? We hypothesize that the N terminus, which contains a (E/D/R)₈X(G/A/V)₁₀ XXhh(N/Q)₁₅ motif (Fig. 1b), where h is a hydrophobic residue, contains the cation selectivity filter. In this conserved motif, the most exposed residue, Glu8, belongs to a dynamic N terminus whose residues (for example Thr9 and Gly10) manifest intensities only at high temperature (Extended Data Fig. 2d–f). The Glu8 side chain carboxyl is deprotonated at neutral pH and protonated at acidic pH, as manifested by ¹³C chemical shifts (Extended Data Fig. 2c). We speculate that the protonation equilibria of this loose ring of Glu8 quintet, together with the anionic lipids in the ERGIC membrane, may regulate the ion selectivity of ETM at the channel entrance. Such a ring of negatively charged Glu residues has been observed as selectivity filters in the hexameric Ca²⁺-selective Orai channels³³ and designed K⁺ channels³⁴. The third residue of the motif (G/A/V) is conserved among coronaviruses to be small and flexible (Fig. 1b), which might permit N-terminus motion and/or prevent occlusion of the channel lumen. The last residue of the motif is conserved to be either Asn or Gln, whose polar sidechains can coordinate ions and participate in interhelical hydrogen bonds to stabilize the channel²⁷. At the C-terminal end of the TM segment, the conserved small residues Ala32 and Thr35 provide an open cavity for ions. In contrast to these small polar residues, the central portion of the TM domain contains four layers of hydrophobic residues, Leu18, Leu21, Val25 and Leu28, which narrow the pore radius to ~2 Å (Fig. 3d). This narrow pore can permit only a single file of water molecules, thus partially dehydrating any ions that move through the pore. Therefore, the structure determined here may represent the closed state of SARS-CoV-2 E, while the open state might have a larger and more hydrated pore. Narrow pores with multiple hydrophobic layers have also been observed in larger ion channels, including the tetrameric K⁺ channel TMEM175 (ref. ³⁵) and the pentameric bestrophin channels^36,37. Thus, it is possible to achieve charge stabilization and ion selectivity in such a hydrophobic environment, although the detailed mechanisms remain to be understood.

The present membrane-bound ETM structure suggests that small-molecule drugs should have high-affinity binding to both the acidic Glu8 and the polar Asn15 in order to occlude the N-terminal entrance of the protein. The membrane topology of SARS-CoV-2 E is now recognized to be N_lumen–C_cyto on the basis of antibody-detected selective permeabilization assays³⁸ and glycosylation data³⁹. This orientation may prime the protein to conduct Ca²⁺ out of the ERGIC lumen to activate the host inflammasome⁵. Thus, small-molecule drugs should ideally be targeted and delivered to the Golgi and ERGIC of host cells to maximally inhibit SARS-CoV-2 E’s channel activity⁴⁰.

Methods

Cloning of recombinant ETM(8–38)

The gene encoding full-length SARS-CoV-2 E protein (NCBI reference sequence YP_009724392.1, residues 1–75) was purchased from Genewiz. The gene encoding the TM domain (residues 8–38, ETGTLIVNSVLLFLAFVVFLLVTLAILTALR) was isolated using PCR and cloned into a Champion pET-SUMO plasmid (Invitrogen). The plasmid was transfected into E. coli BL21 (DE3) cells (Invitrogen) to express the SUMO–ETM fusion protein containing an N-terminal His₆ tag (Extended Data Fig. 1a). The construct’s DNA sequence was verified by Sanger sequencing (Genewiz).

Expression and purification of [¹³C,¹⁵N]ETM

A glycerol cell swab stored at –70 °C was used to start a 10-ml LB culture containing 50 μg ml^–1 kanamycin. The starter culture was used to inoculate 2 l of LB medium. Cells were grown at 37 °C until an optical density at 600 nm (OD₆₀₀) of 0.6–0.8 was reached, and were collected by centrifugation for 10 min at 20 °C and 4,400g. These LB cells were resuspended in 1 l of M9 medium (pH 7.8, 48 mM Na₂HPO₄, 22 mM KH₂PO₄, 8.6 mM NaCl, 4 mM MgSO₄, 0.2 mM CaCl₂, 50 mg kanamycin) containing 1 g/L ¹⁵N-NH₄Cl. The cells were incubated in M9 media for 30 min at 18 °C, then 1 g l^–1 [U-¹³C]glucose dissolved in 5 ml sterile H₂O and 3 ml 100× MEM vitamins were added. The cells were grown for another 30 min, then protein expression was induced by addition of 0.4 mM IPTG along with 2 g l^–1 [U-¹³C]glucose in 10 ml sterile H₂O. Additional IPTG was added after 1 h to bring the final concentration to 0.8 mM. Protein expression proceeded overnight for 16 h at 18 °C, reaching an OD₆₀₀ of 2.5.

The cells were spun down at 4 °C, and 5,000 r.p.m. for 10 min and resuspended in 35 ml Lysis Buffer I (pH 8.0, 50 mM Tris-HCl, 100 mM NaCl, 1.0% Triton X-100, 0.5 mg ml^–1 lysozyme, 10 μl benozonase nuclease, 1 mM Mg²⁺, 10 mM imidazole). Cells were lysed at 4 °C by sonication (5 s on and 5 s off) for 1 h using a probe sonicator. The soluble fraction of the cell lysate was separated from the inclusion bodies by centrifugation at 17,000g for 20 min at 4 °C. The supernatant was loaded onto a gravity-flow chromatography column containing ~6 ml nickel affinity resin (Profinity IMAC, BioRad) that was pre-equilibrated with Lysis Buffer I. The fractions were bound to the resin for 1 h by gentle rocking at 4 °C. The column was washed with 50 ml of Wash Buffer I (pH 8.0, 50 mM Tris-HCl, 100 mM NaCl, 0.1% DDM, 30 mM imidazole). SUMO–ETM was eluted with 10–15 ml elution buffer (pH 8.0, 50 mM Tris-HCl, 100 mM NaCl, 0.1% DDM, 250 mM imidazole) (Extended Data Fig. 1b). The eluted protein was diluted to one-third of the original concentration by adding twice the elution volume of dilution buffer (pH 8.0, 50 mM Tris-HCl, 100 mM NaCl, 0.1% DDM) to reduce the imidazole concentration before protease cleavage. Approximately 20% of the protein was found in the insoluble membrane and inclusion body fraction. To purify this fraction, the pelleted mass was resuspended in lysis buffer II (lysis buffer I with added 6 M urea) and rocked gently at 4 °C overnight. Soluble protein was isolated by centrifugation at 17,000g for 20 min at 4 °C. Nickel affinity column chromatography proceeded as described above for the soluble fraction, except that wash buffer II (wash buffer I with added 3 M urea) was used in place of wash buffer I.

The purified SUMO–ETM from both the soluble and inclusion body fractions was cleaved by adding 1:10 (wt/wt) SUMO protease:SUMO–ETM and 5 mM TCEP for 2 h at room temperature with gentle rocking. The cleavage efficiency was assessed by analytical HPLC to be ~75%. ETM was purified using preparative RP–HPLC on a Varian ProStar 210 System using an Agilent C3 column (5-μm particle size, 21.2 mm × 150 mm). The protein was eluted using a linear gradient of 5–99% (9:1, acetonitrile:isopropanol):water containing 0.1% trifluoroacetic acid over 35 min at a flow rate of 10 ml min^–1 (Extended Data Fig. 1c). The purified protein was dried down to a film with a stream of nitrogen gas and placed under vacuum overnight. The protein film was stored at −20 °C. The yield of the purified protein was 10 mg l^–1 of M9 medium. Labeling efficiency was ~94%, as estimated by MALDI mass spectrometry (Extended Data Fig. 1d). [U-¹³C]ETM and [U-¹⁵N]ETM were expressed and purified using the same protocol but substituting [¹⁵N]NH₄Cl or [¹³C]glucose with unlabeled reagents.

Expression of 4-¹⁹F-Phe fluorinated ETM

A glycerol cell swab was used to start a 10 ml LB culture containing 50 μg ml^–1 kanamycin. The starter culture was then used to inoculate 2 l of M9 medium (pH 7.8, 48 mM Na₂HPO₄, 22 mM KH₂PO₄, 8.6 mM NaCl, 4 mM MgSO₄, 0.2 mM CaCl₂, 50 mg kanamycin) containing 3 g l^–1 unlabeled glucose and 1 g l^–1 unlabeled NH₄Cl. The cells were grown in M9 at 37 °C for medium for 8 h until an OD₆₀₀ of 0.5 was reached. The cells were collected by centrifugation at 4,400g for 10 min at 20 °C, then concentrated into a fresh 1-l M9 culture and incubated at 30 °C for 60 min. Subsequently, 1.5 g l^–1 glyphosate was added to halt the pentose phosphate pathway⁴¹, followed by addition of 115 mg l-Trp, 115 mg l-Tyr and 400 mg of 4-¹⁹F-l-Phe to the culture. After 30 min, IPTG was added to a final concentration of 0.4 mM, and protein expression proceeded at 30 °C for 5.5 h. The cells were collected by centrifugation at 4,400g for 10 min at 4 °C. The pellet was stored at −70 °C until purification. Cell lysis and protein purification followed the same protocol, except that the ETM peak during preparative HPLC was collected in 2 fractions of ~1 min each. Fluorine incorporation in the two fractions was measured using MALDI mass spectrometry. The first fraction had a higher incorporation level of 83% for all 3 Phe residues labeled with ¹⁹F, indicating a per-residue labeling efficiency of 94% (Extended Data Fig. 1e). Only this fraction was used to prepare the mixed ¹³C- and ¹⁹F-labeled protein for distance measurement. The final yield of the Phe fluorinated ETM expression was 1.5 mg l^–1 of M9 medium. When the protocol was originally tested using 100 mg l^–1 4-¹⁹F-Phe, 1.0 g l^–1 glyphosate, 6 g l^–1 unlabeled glucose and with expression at 18 °C for 5.5 h, a much lower per-residue labeling efficiency of ~35% was obtained.

Membrane sample preparation

Eight protein samples were prepared for this study. Five membrane samples contained [¹³C,¹⁵N]ETM and one contained [¹³C]ETM. Another sample contained a 1:1 mixture of ¹³C-labeled protein:¹⁵N-labeled protein. The last sample contained a 1:1 mixture of ¹³C-labeled protein:4-¹⁹F-Phe-labeled protein. Six of the 8 samples were prepared in a pH 7.5 Tris buffer (20 mM Tris-HCl, 5 mM NaCl, 2 mM EDTA and 0.2 mM NaN₃). One sample was prepared in a pH 5 citrate buffer with calcium (20 mM citrate, 5 mM CaCl₂ and 0.2 mM NaN₃), while the final sample was prepared in the same pH 5 citrate buffer without calcium chloride. Further details about membrane sample preparation and 3-¹⁹F-amantadine synthesis are given in Supplementary Note 1.

Solid-state NMR experiments

Most solid-state NMR spectra were measured on a Bruker AVANCE NEO 900 MHz (21.1 T) spectrometer and an Avance II 800 MHz (18.8 T) spectrometer using 3.2 mm HCN probes. ¹³C-¹⁹F REDOR experiments were conducted on an Avance III HD 600 MHz (14.1 T) spectrometer using a 1.9 mm HFX probe. Magic-angle-spinning (MAS) frequencies were 11.8 kHz for 900-MHz experiments and 14 kHz for the 800- and 600-MHz experiments. Radiofrequency (RF) field strengths on the 3.2-mm probes were 50–91 kHz for ¹H, 50–63 kHz for ¹³C and 33–42 kHz for ¹⁵N. RF field strengths on the 1.9-mm MAS probe were 83–130 kHz for ¹H, 62.5 kHz for ¹³C and 71 kHz for ¹⁹F. Sample temperatures are direct readings from the probe thermocouple, whereas actual sample temperatures are 5–15 K higher at the MAS frequencies employed. ¹³C chemical shifts are reported on the tetramethylsilane scale using the adamantane CH₂ chemical shift at 38.48 ppm as an external standard. ¹⁵N chemical shifts are reported on the liquid ammonia scale using the N-acetylvaline peak at 122.00 ppm as an external standard.

2D ¹³C-¹³C correlation experiments were conducted using combined-driven (CORD) mixing⁴² for ¹³C spin diffusion. 2D and 3D ¹⁵N-¹³C correlation spectra, namely, NCACX, NCOCX and CONCA⁴³, were measured on the 900-MHz spectrometer. These experiments used spectrally induced filtering in combination with cross-polarization (SPECIFIC-CP)⁴⁴ for heteronuclear polarization transfer. Water-edited 2D ¹⁵N-¹³Cα correlation spectra were measured under 11.8-kHz MAS^20,21 using ¹H mixing times of 9 ms and 100 ms. 2D ¹⁵N-¹³C correlation spectra were measured using an out-and-back transferred-echo double resonance (TEDOR) pulse sequence on the 800 MHz NMR⁴⁵. Intermolecular 2D NHHC correlation spectra⁴⁶ were measured used 0.5 ms and 1 ms ¹H-¹H mixing. 1D and 2D ¹³C-¹⁹F REDOR experiments^19,47,48 were used to measure distances between 4-¹⁹F-Phe-labeled and ¹³C-labeled ETM, and between ¹³C-labeled ETM and 3-¹⁹F-AMT. Detailed parameters for the solid-state NMR experiments are given in Supplementary Table 6. Details for the ¹³C-¹⁹F REDOR simulations and fitting are given in Supplementary Notes.

NMR spectral analysis

NMR spectra were processed in the TopSpin software and chemical shifts were assigned in Sparky⁴⁹. TALOS-N⁵⁰ was used to calculate torsion angles (ϕ, ψ) after converting the ¹³C chemical shifts to the DSS scale. Residue-specific chemical shift differences (Δδ) between drug-bound and apo samples were calculated from the measured ¹³C and ¹⁵N chemical shifts (δ) according to:

$$\Delta \delta = \sqrt {\left[ {\mathop {\sum}\limits_{C_i} {\left( {\delta _{C_i}^{\rm{drug}} - \delta _{C_i}^{\rm{apo}}} \right)^2} + \frac{{\left( {\delta _N^{\rm{drug}} - \delta _N^{\rm{apo}}} \right)^2}}{{2.5}}} \right]}$$

(1)

2D heatmaps of normalized water-edited 2D NCA spectra were generated using an in-house Python script that removes spectral noise while calculating intensity ratios. The intensities of the 9 ms and 100 ms spin diffusion spectra of the ERGIC-bound ETM were read using the NMRglue package⁵¹. Spectral intensity was noise-filtered by setting signal lower than 3.5 times the average noise level in an empty region of the 2D spectrum to zero for the S spectrum and to a large number for the S₀ spectrum^24,25. The intensities were divided and scaled by the number of scans to obtain a 2D contour map that reflect the peak intensity ratios between the 9-ms and 100-ms spectra.

The water accessibility data for the high-pH influenza BM2 proton channel (Fig. 2f) were originally measured in 2D ¹³C-¹³C correlation spectra with 4 ms (S) and 100 ms (S₀) ¹H spin diffusion²⁵. To allow comparison with the ETM spectra measured at 9 ms and 100 ms mixing, we scaled the BM2 S (4 ms)/S₀ (100 ms) ratios by the integrated aliphatic intensity ratio of 1.976 between the 1D BM2 water-edited spectra measured with 9 ms and 4 ms of mixing. This scaling factor was verified to be accurate for two resolved sites, Thr24 and Gly26, in the 1D ¹³C spectra.

XPLOR-NIH structure calculations and analysis

Initial structure calculation using ambiguous interhelical restraints, where each helix can contact both neighboring helices, did not converge. Thus, we generated parallel pentameric models to specify the direction of ¹³C-¹⁹F and NHHC distance restraints where possible. The models take into account the water- and lipid-edited spectra to qualitatively identify the pore- versus lipid-facing orientations of the residues. The best-case ideal helix model (Extended Data Fig. 7a), with 3.5 residues per helical turn, places Asn15 at the pore-facing d position and Phe20 at the lipid-facing b position, in agreement with the water-edited spectra. However, the model conflicts substantially with other data. For example, Thr35(c) (Thr35 at position c) and Leu31(f) are lipid-facing in this model, which contradict the water-edited spectra; Val29(d) and Phe26 (a) are pore-facing, which contradict the water- and lipid-edited spectra. The arc of Phe20(b), Phe23(e) and FPhe6(a) on the helical wheel makes it unlikely to establish interhelical Phe–Phe contacts, thus contradicting the ¹³C-¹⁹F distance data.

Since the ideal-helix geometry cannot agree with all experimental data, we sought better models by including slight deviations from an ideal helix. We turned to the measured chemical shifts to determine where such a deviation is most likely to occur. The Cβ chemical shift of L21 is 1.4 ppm downfield from the average of all other helical Leu residues (Extended Data Fig. 3b), suggesting that the helix is disordered between Phe20 and Phe23. Indeed, such a disorder was already noted in previous solution NMR data¹⁰. We generated four alternative pentamer models with varying positions and degrees of helix disorder (Extended Data Fig. 7b-e and Supplementary Table 4). Only one model (model 5), generated by a small rotation angle advance of ~50˚ at Phe23, adequately reproduces all key features of the experimental data. This model places Asn15(d) and Val25(d) at the same pore-facing position and the three aromatic residues at the arc of Phe20(c), Phe23(f) and Phe26(b). This model was then used to disambiguate the NHHC and ¹³C-¹⁹F distance restraints (Supplementary Tables 2 and 3) by mainly considering only residues that are fewer than four residues away in the primary sequence and that are in close proximity between two helical wheels. With this approach, 42 of the 87 interhelical restraints were set to be unambiguous. In principle, the handedness of the helical bundle can be determined from the registry of interhelical contacts if the position of interfacial residues are known. However, remaining ¹³C and ¹⁵N chemical-shift overlap among the many hydrophobic residues precluded unequivocal determination of the handedness of the helical bundle. Orthogonal experimental constraints, such as backbone N-H bond orientations, which would directly probe the helix tilt angle, will be needed to obtain a higher-resolution structure.

As has been previously described²⁵, the ETM structure was calculated using XPLOR-NIH²⁶ hosted on the NMRbox⁵². The calculation contained two stages. In the first stage, five extended ETM monomers were placed in a parallel pentamer geometry with each monomer located 20 Å from the center of the pentamer. A total of 120 independent simulated annealing runs were performed with 5,000 steps of torsion angle dynamics at 5,000 K, followed by annealing to 20 K in decrements of 20 K with 100 steps at each temperature. After the annealing, energy minimizations in torsion angle and Cartesian coordinates were carried out. The five monomers were restrained to be identical in the annealing step using the non-crystallographic symmetry term PosDiffPot and the translational symmetry term DistSymmPot. Chemical-shift-derived torsion angles (ϕ, ψ) predicted by TALOS-N were implemented with the dihedral-angle restraint term CDIH, with ranges set to the higher value between twice the TALOS-N predicted uncertainty and 20°. Measured interhelical distance restraints were implemented using the NOE potential. Distance upper limits were set to 9.0 Å and 11.5 Å for 500 μs and 1,000 μs of ¹H-¹H mixing for the NHHC constraints. Negative REDOR contacts, that is, ¹³C sites without dephasing, were implemented as two NOE’s: one to each neighboring helix. Implicit hydrogen bonds using the hydrogen-bonding database potential term HBDB were implemented during annealing to favor the formation of the α-helical conformation. Finally, standard XPLOR potentials were used to restrain the torsion angles using a structural database with the term TorsionDB, and standard bond angles and lengths were set with terms BOND, ANGL, IMPR and RepelPot. The structures were sorted by energy, using all the potentials in the calculation. The scales for all potentials are given in Supplementary Table 5.

In the second stage, the three lowest-energy structures from the annealing stage were used as independent inputs for structure refinement. A total of 64 independent XPLOR-NIH runs from each of the three starting structures were performed with 5,000 steps of torsion angle dynamics at 1,000 K followed by annealing to 20 K in decrements of 10 K with 100 steps at each temperature. This was followed by energy minimizations in torsion angle and Cartesian coordinates. All the potentials employed in annealing were also used during refinement, with two additions. The ¹³C-¹³C correlations were implemented as intramolecular NOE restraints with an upper limit of 8.0 Å. Inter-residue cross-peaks to long hydrophobic side chains, such as Phe, Ile, and Leu, were sometimes violated. Consequently, the upper limits for these 5% of restraints were increased to 12.0 Å. Explicit hydrogen bonds for residues Ile13 (hydrogen-bonded to Val17)–Asn15 (hydrogen-bonded to Leu19) and Phe23 (hydrogen-bonded to Leu27)–Thr30 (hydrogen-bonded to Leu34) were substituted for implicit hydrogen bonds using the same HBDB potential. Finally, the scales of the NOE, Repel and TorsionDB potentials were increased (Supplementary Table 5). All 192 structures from the three independent runs were pooled and sorted using the CDIH, NOE, HBDB, BOND, ANGL, IMPR, Repel and Repel14 potentials, while excluding PosDiffPot, DistSymmPot and TorsionDB potentials. The ten structures with the lowest energies across the specified potentials were included in the final structural ensemble. Where single-structure images are shown, the most representative conformer, selected as the model with the lowest average r.m.s.d. for residues 10–36 with respect to all the other structural models, is shown. The Ramachandran plot statistics for the final structure ensemble are as follows: 93% of residues are in favored regions, 5% of residues are in allowed regions and 2% of residues are in disallowed regions. The only outlier is Leu37, which is outside the TM helix, near the C terminus.

Graphical images depicting the structures were generated in PyMOL v2.3.4. The reported channel radii were calculated using the HOLE program⁵³, and represent the radii of the largest sphere that can be accommodated from exclusion of the van der Waals diameter of all atoms at each XY plane along the Z channel coordinate, which is collinear with the bilayer normal and the putative direction of ion permeation. The cutoff radius for the calculation was 5 Å. The HOLE output was visualized in PyMOL by setting the van der Waals radius of the HOLE-generated spheres ‘SPH’ to the B-factor values of the SPH output. Details of HMA docking to ETM are given in Supplementary Notes.

Reporting Summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

NMR chemical shifts, distance and torsion-angle restraints have been deposited in the Biological Magnetic Resonance Bank (BMRB) with ID numbers 30795. The structural coordinates for ETM have been deposited in the Protein Data Bank with the accession code 7K3G.

Code availability

NMR pulse programs and in-house Python scripts used for structure calculation and data analysis such as water-edited spectral analysis are available upon request.

References

Weiss, S. R. & Navas-Martin, S. Coronavirus pathogenesis and the emerging pathogen severe acute respiratory syndrome coronavirus. Microbiol. Mol. Biol. Rev. 69, 635–664 (2005).
Article CAS PubMed PubMed Central Google Scholar
Wilson, L., McKinlay, C., Gage, P. & Ewart, G. SARS coronavirus E protein forms cation-selective ion channels. Virology 330, 322–331 (2004).
Article CAS PubMed Google Scholar
Verdiá-Báguena, C. et al. Coronavirus E protein forms ion channels with functionally and structurally-involved membrane lipids. Virology 432, 485–494 (2012).
Article PubMed CAS Google Scholar
Schoeman, D. & Fielding, B. C. Coronavirus envelope protein: current knowledge. Virol. J. 16, 69 (2019).
Article PubMed CAS PubMed Central Google Scholar
Nieto-Torres, J. L. et al. Severe acute respiratory syndrome coronavirus E protein transports calcium ions and activates the NLRP3 inflammasome. Virology 485, 330–339 (2015).
Article CAS PubMed Google Scholar
Li, Y., Surya, W., Claudine, S. & Torres, J. Structure of a conserved Golgi complex-targeting signal in coronavirus envelope proteins. J. Biol. Chem. 289, 12535–12549 (2014).
Article CAS PubMed PubMed Central Google Scholar
Torres, J. et al. Conductance and amantadine binding of a pore formed by a lysine-flanked transmembrane domain of SARS coronavirus envelope protein. Protein Sci. 16, 2065–2071 (2007).
Article CAS PubMed PubMed Central Google Scholar
Hong, M. & DeGrado, W. F. Structural basis for proton conduction and inhibition by the influenza M2 protein. Protein Sci. 21, 1620–1633 (2012).
Article CAS PubMed PubMed Central Google Scholar
Cady, S. D. et al. Structure of the amantadine binding site of influenza M2 proton channels in lipid bilayers. Nature 463, 689–692 (2010).
Article CAS PubMed PubMed Central Google Scholar
Pervushin, K. et al. Structure and inhibition of the SARS coronavirus envelope protein ion channel. PLoS Pathog. 5, e1000511 (2009).
Article PubMed CAS PubMed Central Google Scholar
DeDiego, M. L. et al. A severe acute respiratory syndrome coronavirus that lacks the E gene is attenuated in vitro and in vivo. J. Virol. 81, 1701–1713 (2007).
Article CAS PubMed Google Scholar
Nieto-Torres, J. L. et al. Severe acute respiratory syndrome coronavirus envelope protein ion channel activity promotes virus fitness and pathogenesis. PLoS Pathog. 10, e1004077 (2014).
Article PubMed CAS PubMed Central Google Scholar
Torres, J., Wang, J., Parthasarathy, K. & Liu, D. X. The transmembrane oligomers of coronavirus protein E. Biophys. J. 88, 1283–1290 (2005).
Article CAS PubMed PubMed Central Google Scholar
Parthasarathy, K. et al. Expression and purification of coronavirus envelope proteins using a modified β-barrel construct. Protein Expr. Purif. 85, 133–141 (2012).
Article CAS PubMed PubMed Central Google Scholar
Arbely, E. et al. A highly unusual palindromic transmembrane helical hairpin formed by SARS coronavirus E protein. J. Mol. Biol. 341, 769–779 (2004).
Article CAS PubMed PubMed Central Google Scholar
Surya, W., Li, Y. & Torres, J. Structural model of the SARS coronavirus E channel in LMPG micelles. Biochim. Biophys. Acta 1860, 1309–1317 (2018).
Article CAS Google Scholar
Torres, J. et al. Model of a putative pore: the pentameric α-helical bundle of SARS coronavirus E protein in lipid bilayers. Biophys. J. 91, 938–947 (2006).
Article CAS PubMed PubMed Central Google Scholar
Parthasarathy, K. et al. Structural flexibility of the pentameric SARS coronavirus envelope protein ion channel. Biophys. J. 95, L39–L41 (2008).
Article CAS PubMed PubMed Central Google Scholar
Gullion, T. & Schaefer, J. Rotational echo double resonance NMR. J. Magn. Reson. 81, 196–200 (1989).
CAS Google Scholar
Dregni, A. J. et al. In vitro 0N4R tau fibrils contain a monomorphic β-sheet core enclosed by dynamically heterogeneous fuzzy coat segments. Proc. Natl Acad. Sci. USA 116, 16357–16366 (2019).
Article CAS PubMed PubMed Central Google Scholar
Williams, J. K. & Hong, M. Probing membrane protein structure using water polarization transfer solid-state NMR. J. Magn. Reson. 247, 118–127 (2014).
Article CAS PubMed PubMed Central Google Scholar
Lesage, A., Emsley, L., Penin, F. & Bockmann, A. Investigation of dipolar-mediated water-protein interactions in microcrystalline Crh by solid-state NMR spectroscopy. J. Am. Chem. Soc. 128, 8246–8255 (2006).
Article CAS PubMed Google Scholar
Doherty, T. & Hong, M. 2D ¹H-³¹P solid-state NMR studies of the dependence of inter-bilayer water dynamics on lipid headgroup structure and membrane peptides. J. Magn. Reson. 196, 39–47 (2009).
Article CAS PubMed Google Scholar
Dregni, A. J., Duan, P. & Hong, M. Hydration and dynamics of full-length tau amyloid fibrils investigated by solid-state nuclear magnetic resonance. Biochemistry 59, 2237–2248 (2020).
Article CAS PubMed Google Scholar
Mandala, V. S., Loftis, A. R., Shcherbakov, A. A., Pentelute, B. L. & Hong, M. Atomic structures of closed and open influenza B M2 proton channel reveal the conduction mechanism. Nat. Struct. Mol. Biol. 27, 160–167 (2020).
Article CAS PubMed PubMed Central Google Scholar
Schwieters, C. D., Kuszewski, J. J., Tjandra, N. & Clore, G. M. The Xplor-NIH NMR molecular structure determination package. J. Magn. Reson. 160, 65–73 (2003).
Article CAS PubMed Google Scholar
Choma, C., Gratkowski, H., Lear, J. D. & DeGrado, W. F. Asparagine-mediated self-association of a model transmembrane helix. Nat. Struct. Biol. 7, 161–166 (2000).
Article CAS PubMed Google Scholar
Wilson, L., Gage, P. & Ewart, G. Hexamethylene amiloride blocks E protein ion channels and inhibits coronavirus replication. Virology 353, 294–306 (2006).
Article CAS PubMed Google Scholar
Chipot, C. et al. Perturbations of native membrane protein structure in alkyl phosphocholine detergents: a critical assessment of NMR and biophysical studies. Chem. Rev. 118, 3559–3607 (2018).
Article CAS PubMed PubMed Central Google Scholar
Park, S. H. et al. Three-dimensional structure of the channel-forming trans-membrane domain of virus protein “u” (Vpu) from HIV-1. J. Mol. Biol. 333, 409–424 (2003).
Article CAS PubMed Google Scholar
Lu, J. X., Sharpe, S., Ghirlando, R., Yau, W. M. & Tycko, R. Oligomerization state and supramolecular structure of the HIV-1 Vpu protein transmembrane segment in phospholipid bilayers. Protein Sci. 19, 1877–96 (2010).
Article CAS PubMed PubMed Central Google Scholar
Cady, S. D., Goodman, C., Tatko, C. D., DeGrado, W. F. & Hong, M. Determining the orientation of uniaxially rotating membrane proteins using unoriented samples: a ²H, ¹³C, and ¹⁵N solid-state NMR investigation of the dynamics and orientation of a transmembrane helical bundle. J. Am. Chem. Soc. 129, 5719–5729 (2007).
Article CAS PubMed Google Scholar
Hou, X., Pedi, L., Diver, M. M. & Long, S. B. Crystal structure of the calcium release-activated calcium channel Orai. Science 338, 1308–1313 (2012).
Article CAS PubMed PubMed Central Google Scholar
Xu, C. et al. Computational design of transmembrane pores. Nature 585, 129–134 (2020).
Article PubMed CAS PubMed Central Google Scholar
Lee, C. et al. The lysosomal potassium channel TMEM175 adopts a novel tetrameric architecture. Nature 547, 472–475 (2017).
Article CAS PubMed PubMed Central Google Scholar
Kane Dickson, V., Pedi, L. & Long, S. B. Structure and insights into the function of a Ca²⁺-activated Cl^– channel. Nature 516, 213–218 (2014).
Article PubMed CAS Google Scholar
Yang, T. et al. Structure and selectivity in bestrophin ion channels. Science 346, 355–359 (2014).
Article CAS PubMed PubMed Central Google Scholar
Nieto-Torres, J. L. et al. Subcellular location and topology of severe acute respiratory syndrome coronavirus envelope protein. Virology 415, 69–82 (2011).
Article CAS PubMed Google Scholar
Duart, G. et al. SARS-CoV-2 envelope protein topology in eukaryotic membranes. Open Biol. 10, 200209 (2020).
Article CAS PubMed PubMed Central Google Scholar
Abramson, A. et al. An ingestible self-orienting system for oral delivery of macromolecules. Science 363, 611–615 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lehninger, A. L., Nelson, D. L. & Cox, M. M. Principles of Biochemistry (Worth Publishers, 1993).
Hou, G., Yan, S., Trebosc, J., Amoureux, J. P. & Polenova, T. Broadband homonuclear correlation spectroscopy driven by combined R2_n^v sequences under fast magic angle spinning for NMR structural analysis of organic and biological solids. J. Magn. Reson. 232, 18–30 (2013).
Article CAS PubMed PubMed Central Google Scholar
Rienstra, C. M., Hohwy, M., Hong, M. & Griffin, R. G. 2D and 3D ¹⁵N-¹³C-¹³C NMR chemical shift correlation spectroscopy of solids: assignment of MAS spectra of peptides. J. Am. Chem. Soc. 122, 10979–10990 (2000).
Article CAS Google Scholar
Baldus, M., Petkova, A. T., Herzfeld, J. & Griffin, R. G. Cross polarization in the tilted frame: assignment and spectral simplification in heteronuclear spin systems. Mol. Phys. 95, 1197–1207 (1998).
Article CAS Google Scholar
Hong, M. & Griffin, R. G. Resonance assignment for solid peptides by dipolar-mediated ¹³C/¹⁵N correlation solid-state NMR. J. Am. Chem. Soc. 120, 7113–7114 (1998).
Article CAS Google Scholar
Lange, A., Luca, S. & Baldus, M. Structural constraints from proton-mediated rare-spin correlation spectroscopy in rotating solids. J. Am. Chem. Soc. 124, 9704–9705 (2002).
Article CAS PubMed Google Scholar
Shcherbakov, A. A. & Hong, M. Rapid measurement of long-range distances in proteins by multidimensional ¹³C-¹⁹F REDOR NMR under fast magic-angle spinning. J. Biomol. NMR 71, 31–43 (2018).
Article CAS PubMed PubMed Central Google Scholar
Shcherbakov, A. A., Roos, M., Kwon, B. & Hong, M. Two-dimensional ¹⁹F-¹³C correlation NMR for ¹⁹F resonance assignment of fluorinated proteins. J. Biomol. NMR 74, 193–204 (2020).
Article CAS PubMed PubMed Central Google Scholar
Lee, W., Tonelli, M. & Markley, J. L. NMRFAM-SPARKY: enhanced software for biomolecular NMR spectroscopy. Bioinformatics 31, 1325–1327 (2014).
Article PubMed PubMed Central Google Scholar
Shen, Y. & Bax, A. Protein backbone and sidechain torsion angles predicted from NMR chemical shifts using artificial neural networks. J. Biomol. NMR 56, 227–241 (2013).
Article CAS PubMed PubMed Central Google Scholar
Helmus, J. J. & Jaroniec, C. P. Nmrglue: an open source Python package for the analysis of multidimensional NMR data. J. Biomol. NMR 55, 355–367 (2013).
Article CAS PubMed PubMed Central Google Scholar
Maciejewski, M. W. et al. NMRbox: a resource for biomolecular NMR computation. Biophys. J. 112, 1529–1534 (2017).
Article CAS PubMed PubMed Central Google Scholar
Smart, O. S., Neduvelil, J. G., Wang, X., Wallace, B. A. & Sansom, M. S. HOLE: a program for the analysis of the pore dimensions of ion channel structural models. J. Mol. Graph 14, 354–360 (1996).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This research is funded by National Institutes of Health grant GM088204 to M.H and the MIT School of Science Sloan Fund to V.S.M. and A.A.S. The experiments made use of NMR spectrometers at the MIT/Harvard Center for Magnetic Resonance, supported by NIH grant P41 GM132079. Structure calculation made use of NMRbox, supported by NIH grant P41 GM111135.

Author information

Authors and Affiliations

Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
Venkata S. Mandala, Matthew J. McKay, Alexander A. Shcherbakov, Aurelio J. Dregni & Mei Hong
Department of Pharmaceutical Chemistry, National and Kapodistrian University of Athens, Panepistimioupolis Zografou, Athens, Greece
Antonios Kolocouris

Authors

Venkata S. Mandala
View author publications
You can also search for this author in PubMed Google Scholar
Matthew J. McKay
View author publications
You can also search for this author in PubMed Google Scholar
Alexander A. Shcherbakov
View author publications
You can also search for this author in PubMed Google Scholar
Aurelio J. Dregni
View author publications
You can also search for this author in PubMed Google Scholar
Antonios Kolocouris
View author publications
You can also search for this author in PubMed Google Scholar
Mei Hong
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.H. designed the project and supervised experiments and data analysis. V.S.M., M.J.M. and A.A.S. cloned, expressed and purified the protein and conducted the solid-state NMR experiments. V.S.M. and M.J.M. assigned and analyzed the spectra. V.S.M. calculated the structure with contributions from A.J.D. and M.J.M. A.A.S. conducted ¹⁹F NMR experiments, simulations and docking. A.K. synthesized fluorinated amantadine. All authors discussed the results of the study and wrote the paper.

Corresponding author

Correspondence to Mei Hong.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Peer reviewer reports are available. Florian Ullrich and Inês Chen were the primary editors on this article and managed its editorial process and peer review in collaboration with the rest of the editorial team. Nature Structural and Molecular Biology thanks Tatyana Polenova and Jaume Torres for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Cloning, purification and characterization of ETM.

a, Amino acid sequence of SUMO-tagged ETM. b, SDS-PAGE gel showing purification of ETM by nickel affinity column chromatography. The flowthrough contains all soluble proteins that have low affinity for nickel. The column was washed with 30 mM imidazole, and SUMO-ETM (18 kDa band) was eluted at >90% purity with 250 mM imidazole. High molecular-weight SUMO-ETM oligomers are visible as a minor species. ETM was cleaved from the SUMO tag using SUMO protease. c, Preparative reverse-phase HPLC chromatogram after protease cleavage. ETM elutes at 37.5 min. d, MALDI mass spectrum of purified U-¹³C, ¹⁵N labeled ETM. e, MALDI mass spectrum of purified 4-¹⁹F-Phe labeled ETM. The measured masses are in good agreement with the theoretical masses. 83% of the 4-¹⁹F-Phe labeled ETM monomers have all three Phe residues fluorinated, indicating a per-site labeling efficiency of 94%.

Extended Data Fig. 2 Effects of temperature and membrane composition on ETM structure.

a, ¹³C and ¹⁵N CP-MAS spectra of ERGIC-bound ETM. The spectra show high sensitivity and resolution, indicating a well ordered and rigid protein. b, ¹³C and ¹⁵N CP-MAS spectra of ETM in DMPX membranes from 303 K to 263 K. The spectral intensities and linewidths are insensitive to temperature, indicating that the protein is mostly immobilized at ambient temperature. c, ¹³C direct-polarization (DP) spectra of DMPX-bound ETM. The E8 sidechain carboxyl chemical shift changes between high and low pH, indicating that this residue is protonated at low pH. d-f, 2D ¹⁵N-¹³C (left) and ¹³C-¹³C (right) correlation spectra of ETM at high and low temperatures and in ERGIC versus DMPX membranes. Yellow rectangles highlight peaks with clear chemical shift or intensity changes. d, 2D spectra of ERGIC-bound ETM (orange) at 293 K and DMPX-bound ETM at 303 K (green). The chemical shifts are similar, indicating that the protein conformation is unaffected by the presence of POPS, POPI and cholesterol. T11 and L12 signals are not detected in the ERGIC sample at this temperature, suggesting that the N-terminus is mobile under these conditions. e, 2D spectra of ERGIC-bound ETM at 293 K (orange) and 263 K (blue). Moderate chemical shift changes are observed for C-terminal residues from T35 to R38, while the I13 signal is not visible at low temperature. f, 2D spectra of DMPX-bound ETM at 303 K (green) and 263 K (purple). The C-terminal residues exhibit temperature-dependent chemical shifts, similar to the ERGIC-bound peptide. The N-terminal residues of T9 to I13 do not exhibit signals at 263 K, indicating that the N-terminus undergoes intermediate-timescale motion at this temperature. Thus, the C-terminal conformation is temperature-dependent while the N-terminus is dynamic at high temperature.

Extended Data Fig. 3 Chemical shift assignment and secondary structure of ETM.

a, Representative strips from 3D NCACX (magenta), CONCA (green) and NCOCX (blue) spectra of ERGIC-membrane bound ETM. These spectra allow full assignment of the ¹³C and ¹⁵N chemical shifts. b, Cα (blue) and Cβ (orange) secondary chemical shifts compared to random coil chemical shifts. Most residues show positive Cα and negative Cβ secondary shifts, indicating an α-helical conformation. c, (φ, ψ) torsion angles calculated using TALOS-N. Residues G10 to L34 show α-helical conformation. Error bars represent the precision of the TALOS-N prediction, defined as one standard deviation for the (φ, ψ) angles among the best-matched peptides for each residue.

Extended Data Fig. 4 Effects of pH and ions on the chemical shifts of DMPX-membrane bound ETM. Where cations are present, the ion concentration is 5 mM.

a, 2D ¹⁵N-¹³Cα correlation spectra of high-pH ETM with 5 mM NaCl and low-pH ETM with 5 mM CaCl₂. Chemical shift changes are observed for C-terminal residues such as R38, L37 and L34 (yellow highlight). b, 2D ¹³C-¹³C correlation spectra of low-pH ETM with CaCl₂ and high-pH ETM with NaCl. c, 2D ¹³C-¹³C correlation spectrum of low-pH ETM with CaCl₂ and low-pH ETM without salt. These spectra indicate that the chemical shift changes mainly result from pH changes.

Extended Data Fig. 5 Additional ¹³C-¹⁹F REDOR spectra and water-edited spectra to determine the interhelical packing of ETM.

a, 2D ¹³C-¹³C correlation spectrum of mixed 4-¹⁹F-Phe labeled and U-¹³C,¹⁵N-labeled ETM (black). The ¹³C chemical shifts of most residues are similar to the ¹³C,¹⁵N-labeled protein (red), indicating that fluorination does not perturb the ETM conformation. F20/23 Cβ, F26 Cβ, and T30 Cγ2 show small chemical shift changes (blue) of 0.3–0.6 ppm. The spectra were measured at 293 K. b, 1D ¹³C-¹⁹F REDOR control (S₀), dephased (S), and difference (ΔS) spectra. The difference peaks result from carbons that are in close proximity to a fluorine in a neighboring helix. The broadband REDOR spectra (left) show both sidechain and backbone ¹³C signals whereas the Cα-selective REDOR spectra (right) detect only Cα signals. c, Representative ¹³C-¹⁹F REDOR dephasing curves for broadband and Cα-selective C-F REDOR spectra. The S/S₀ values have been corrected for the isotopic dilution factor (50%) and the peak-overlap factor. Best-fit distance curves are shown as solid lines, and lower and upper distance bounds are shown as dashed lines. Error bars represent random uncertainty of the measured S/S₀ values, which were propagated from the signal-to-noise ratios of the S₀ and S spectra. d, Water-edited 2D ¹⁵N-¹³Cα correlation spectra to detect well hydrated residues. The spectra were measured at 293 K under 11.8 kHz MAS using ¹H-¹H mixing times of 9 ms (red) and 100 ms (blue).

Extended Data Fig. 6 Inter-residue correlations obtained from 250 ms 2D ¹³C spin diffusion spectra of ERGIC-membrane bound ETM.

a, Representative strips from a well-resolved 3D NCACX spectrum recorded with 250 ms ¹³C spin diffusion. Inter-residue cross peaks are assigned in black and intra-residue resonances are marked in blue. b, Overlay of 2D ¹³C-¹³C correlation spectra measured with 250 ms mixing (black) and 20 ms (orange). Representative inter-residue cross peaks are assigned in blue. All spectra were measured at 293 K under 11.8 kHz MAS.

Extended Data Fig. 7 ETM pentameric models analyzed to disambiguate the direction of interhelical constraints used for structure calculation.

For each model, the heptad repeat positions (abcdefg) of every residue from L12 to T35 is indicated on the helical wheel for at least one subunit. On the two neighboring helices, residue positions that violate measured ¹³C-¹⁹F correlations are shown in pink, while residue positions that violate the water and lipid accessibility data are shown in green. The positions of Phe residues that satisfy the interhelical contacts are shown in blue. a, Model 1 places N15 at heptad position d without a twist, and is thus an ideal helix model. b, Model 2 places N15 at d with a twist such that F23 moves from position e to c. c, Model 3 places N15 at position e with a twist such that F23 moves from f to b. d, Model 4 places N15 at position a with a twist such that F23 moves from b to c. e, Model 5 places N15 at position a with a twist such that F23 moves from b to f. Model 5 does not violate any experimental data and was thus chosen to disambiguate intermolecular contacts for structure calculation. To make the interhelical contacts explicit, model 5 shows the residue positions for three consecutive helices in the pentamer.

Extended Data Fig. 8 Lipid-bilayer bound SARS-CoV-2 ETM structure (PDB code: 7K3G) and its comparison with ETM structure solved in micelles and with other viroporin structures.

a, N-terminal top views of various residues in the ETM pentamer. Most residues are hydrophobic, including both pore-facing and lipid-facing residues. The most representative structure of the lowest-energy ensemble is shown. b, Top views of representative pore-facing residues in the lowest-energy ensemble. The structure distribution is likely due to a combination of the sparseness of experimental restraints and true protein conformational disorder. c, Comparison of the ERGIC-membrane bound ETM structure model (slate and red) and the LMPG-micelle-bound ETM structure model (gray and salmon)¹⁶. Side view depicts differences in helix orientation and helical bundle handedness, while top view shows differences in pore radii. d, Structural comparison of the pentameric ETM channel, the closed tetrameric influenza BM2 proton channel²⁵, and the pentameric HIV-1 Vpu channel³⁰. The ETM pentamer is longer and tighter than the BM2 and Vpu helical bundles.

Extended Data Fig. 9 Additional docking poses of HMA to SARS-CoV-2 E, shown in side view (left) and N-terminal top view (right).

a, Structure with hexamethylene ring up and HMA vertical, obtained from docking in DMSO. b, Structure with hexamethylene ring down and HMA vertical, obtained from docking in DMSO. c, Structure with HMA across the channel entrance, bridging two helices, obtained from docking in water. The lipid-facing I13 and pore-occluding N15 are shown in sticks to guide the eye.

Extended Data Fig. 10 Effects of amantadine binding on ETM.

The peptide is reconstituted in DMPC: DMPG membranes with an AMT: ETM monomer molar ratio of 8: 1. a, 2D ¹⁵N-¹³Cα correlation spectra of apo (blue) and AMT-bound ETM (magenta). The spectra were measured at 305 K under 14 kHz MAS. Zoomed-in areas show peaks with significant CSPs. b, 2D ¹³C-¹³C correlation spectra with 20 ms mixing of apo (blue) and AMT-bound ETM (magenta). The spectra were measured at 263 K. Zoomed-in areas shows peaks with significant CSPs. The perturbed residues are concentrated in the N- and C-termini of the protein. c, 1D ¹⁹F direct-polarization spectra of 3F-AMT with and without the peptide in DMPX membranes. The spectra were measured at 270 K under 14 kHz MAS. d, ¹³Cα selective ¹⁹F-dephased REDOR spectra of AMT-bound ETM in DMPC: DMPG membranes. The ΔS spectra show dephasing at 65.5 ppm, 63.6 ppm, 56 ppm and 54 ppm. e, Broadband ¹³C-¹⁹F REDOR spectra. The ΔS spectra show ¹³C dephasing for sidechains that belong to residues that show Cα dephasing in (d).

Supplementary information

Supplementary Information

Supplementary Tables 1–6 and Supplementary Note 1.

Reporting Summary

Peer Review Information

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mandala, V.S., McKay, M.J., Shcherbakov, A.A. et al. Structure and drug binding of the SARS-CoV-2 envelope protein transmembrane domain in lipid bilayers. Nat Struct Mol Biol 27, 1202–1208 (2020). https://doi.org/10.1038/s41594-020-00536-8

Download citation

Received: 16 September 2020
Accepted: 28 October 2020
Published: 11 November 2020
Issue Date: December 2020
DOI: https://doi.org/10.1038/s41594-020-00536-8

This article is cited by

SARS-CoV-2 biology and host interactions
- Silvio Steiner
- Annika Kratzel
- Volker Thiel
Nature Reviews Microbiology (2024)
Morphological analysis for two types of viral particles in vacuoles of SARS-CoV-2-infected cells
- Hong Wu
- Yoshihiko Fujioka
- Takashi Nakano
Medical Molecular Morphology (2024)
AI-guided pipeline for protein–protein interaction drug discovery identifies a SARS-CoV-2 inhibitor
- Philipp Trepte
- Christopher Secker
- Erich E Wanker
Molecular Systems Biology (2024)
SARS-CoV-2 envelope protein triggers depression-like behaviors and dysosmia via TLR2-mediated neuroinflammation in mice
- Wenliang Su
- Jiahang Ju
- Dongliang Mu
Journal of Neuroinflammation (2023)
Transient water wires mediate selective proton transport in designed channel proteins
- Huong T. Kratochvil
- Laura C. Watkins
- William F. DeGrado
Nature Chemistry (2023)

Subjects

Abstract

Similar content being viewed by others

Main

Results

Backbone conformation of ETM in lipid bilayers

Oligomeric structure and hydration of ETM

Structure calculation of ETM in ERGIC membranes

ETM interactions with hexamethylene amiloride and amantadine

Discussion

Methods

Cloning of recombinant ETM(8–38)

Expression and purification of [13C,15N]ETM

Expression of 4-19F-Phe fluorinated ETM

Membrane sample preparation

Solid-state NMR experiments

NMR spectral analysis

XPLOR-NIH structure calculations and analysis

Reporting Summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Extended data

Extended Data Fig. 8 Lipid-bilayer bound SARS-CoV-2 ETM structure (PDB code: 7K3G) and its comparison with ETM structure solved in micelles and with other viroporin structures.

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links

Expression and purification of [¹³C,¹⁵N]ETM

Expression of 4-¹⁹F-Phe fluorinated ETM