Megahertz data collection from protein microcrystals at an X-ray free-electron laser

Grünbein, Marie Luise; Bielecki, Johan; Gorel, Alexander; Stricker, Miriam; Bean, Richard; Cammarata, Marco; Dörner, Katerina; Fröhlich, Lars; Hartmann, Elisabeth; Hauf, Steffen; Hilpert, Mario; Kim, Yoonhee; Kloos, Marco; Letrun, Romain; Messerschmidt, Marc; Mills, Grant; Nass Kovacs, Gabriela; Ramilli, Marco; Roome, Christopher M.; Sato, Tokushi; Scholz, Matthias; Sliwa, Michel; Sztuk-Dambietz, Jolanta; Weik, Martin; Weinhausen, Britta; Al-Qudami, Nasser; Boukhelef, Djelloul; Brockhauser, Sandor; Ehsan, Wajid; Emons, Moritz; Esenov, Sergey; Fangohr, Hans; Kaukher, Alexander; Kluyver, Thomas; Lederer, Max; Maia, Luis; Manetti, Maurizio; Michelat, Thomas; Münnich, Astrid; Pallas, Florent; Palmer, Guido; Previtali, Gianpietro; Raab, Natascha; Silenzi, Alessandro; Szuba, Janusz; Venkatesan, Sandhya; Wrona, Krzysztof; Zhu, Jun; Doak, R. Bruce; Shoeman, Robert L.; Foucar, Lutz; Colletier, Jacques-Philippe; Mancuso, Adrian P.; Barends, Thomas R. M.; Stan, Claudiu A.; Schlichting, Ilme

doi:10.1038/s41467-018-05953-4

Download PDF

Article
Open access
Published: 28 August 2018

Megahertz data collection from protein microcrystals at an X-ray free-electron laser

Marie Luise Grünbein¹,
Johan Bielecki²,
Alexander Gorel¹,
Miriam Stricker¹,
Richard Bean²,
Marco Cammarata ORCID: orcid.org/0000-0003-3013-1186³,
Katerina Dörner²,
Lars Fröhlich⁴,
Elisabeth Hartmann¹,
Steffen Hauf²,
Mario Hilpert¹,
Yoonhee Kim²,
Marco Kloos¹,
Romain Letrun ORCID: orcid.org/0000-0002-0569-5193²,
Marc Messerschmidt^2,5,
Grant Mills^2,6,
Gabriela Nass Kovacs¹,
Marco Ramilli²,
Christopher M. Roome¹,
Tokushi Sato^2,7,
Matthias Scholz⁴,
Michel Sliwa ORCID: orcid.org/0000-0002-5073-8180⁸,
Jolanta Sztuk-Dambietz²,
Martin Weik⁹,
Britta Weinhausen²,
Nasser Al-Qudami²,
Djelloul Boukhelef²,
Sandor Brockhauser ORCID: orcid.org/0000-0002-9700-4803^2,10,
Wajid Ehsan²,
Moritz Emons²,
Sergey Esenov²,
Hans Fangohr²,
Alexander Kaukher²,
Thomas Kluyver²,
Max Lederer²,
Luis Maia²,
Maurizio Manetti²,
Thomas Michelat ORCID: orcid.org/0000-0002-5689-8759²,
Astrid Münnich²,
Florent Pallas²,
Guido Palmer²,
Gianpietro Previtali²,
Natascha Raab²,
Alessandro Silenzi²,
Janusz Szuba²,
Sandhya Venkatesan²,
Krzysztof Wrona²,
Jun Zhu²,
R. Bruce Doak¹,
Robert L. Shoeman¹,
Lutz Foucar¹,
Jacques-Philippe Colletier⁹,
Adrian P. Mancuso²,
Thomas R. M. Barends¹,
Claudiu A. Stan ORCID: orcid.org/0000-0002-9809-4521¹¹ &
…
Ilme Schlichting ORCID: orcid.org/0000-0002-0936-7496¹

Nature Communications volume 9, Article number: 3487 (2018) Cite this article

8221 Accesses
92 Citations
111 Altmetric
Metrics details

Subjects

Abstract

X-ray free-electron lasers (XFELs) enable novel experiments because of their high peak brilliance and femtosecond pulse duration. However, non-superconducting XFELs offer repetition rates of only 10–120 Hz, placing significant demands on beam time and sample consumption. We describe serial femtosecond crystallography experiments performed at the European XFEL, the first MHz repetition rate XFEL, delivering 1.128 MHz X-ray pulse trains at 10 Hz. Given the short spacing between pulses, damage caused by shock waves launched by one XFEL pulse on sample probed by subsequent pulses is a concern. To investigate this issue, we collected data from lysozyme microcrystals, exposed to a ~15 μm XFEL beam. Under these conditions, data quality is independent of whether the first or subsequent pulses of the train were used for data collection. We also analyzed a mixture of microcrystals of jack bean proteins, from which the structure of native, magnesium-containing concanavalin A was determined.

Megahertz pulse trains enable multi-hit serial femtosecond crystallography experiments at X-ray free electron lasers

Article Open access 11 August 2022

Single-shot X-ray absorption spectroscopy at X-ray free electron lasers

Article Open access 24 October 2023

3D atomic structure from a single X-ray free electron laser pulse

Article Open access 01 February 2024

Introduction

X-ray free-electron lasers (XFELs) are novel X-ray sources that provide femtosecond pulses of a peak brilliance that exceeds that of synchrotron sources by nine orders of magnitude. The short duration of the pulses matches the chemical time scale of femtoseconds, allowing the investigation of the dynamics of matter in a time-resolved manner^1,2,3, and enables the analysis of highly radiation-sensitive objects^4,5. The high intensity of the pulses enables the study of weakly scattering objects such as very small crystals^6,7,8 and the coherence of the beam enables the imaging of non-crystalline particles^9,10. In line with these transformative capabilities, demand for beam time at XFELs is very high. For this reason, MHz repetition rate XFELs have been awaited eagerly, since they can deliver X-ray pulses with an up to ~10,000-fold higher maximum repetition rate than the first hard X-ray FEL that came online in 2009¹¹. An increase in pulse rate is expected to speed up data collection, thereby accommodating more users and allowing the collection of enough data to study systems with very weak signals. Also, high pulse rates make far better use of the often highly valuable samples that are generally delivered continuously into the X-ray beam by means of liquid jets, aerosols or molecular beams. However, data collection at MHz rates brings with it many new challenges including the rapid delivery of samples to present fresh material for each pulse, and the development of high frame rate detectors, allowing fast data acquisition and storage¹².

The European XFEL (EuXFEL) in Germany is the first MHz XFEL. Its unique design values of up to 27,000 pulses per second (delivered in 10 trains per second with a 4.5 MHz repetition rate within each train) and a peak brilliance of 5 × 10³³ photons s^–1 mm^–2 mrad⁻² (0.1% bandwidth)^13,14,15 provide unprecedented possibilities for experiments in biology, materials science, chemistry and physics by increasing the average pulse rate almost 300-fold compared to any previous XFEL. Here we report serial femtosecond crystallography (SFX) experiments on protein microcrystals carried out at MHz data acquisition rate at the SPB/SFX instrument of the EuXFEL^16,17 (June 2018, proposal number 2038). As well as addressing the challenges associated with MHz data repetition rates using a model system (lysozyme protein crystals), we investigated a microcrystalline preparation of jack bean proteins precipitated with acetone, a preparation described by James Sumner, who used this technique for the first crystallization of an enzyme (urease) in 1926¹⁸, resulting in a Nobel Prize in 1946¹⁹. That work ultimately showed that enzymes are proteins. We demonstrate here that it is possible to separate the data of the three types of protein crystals in such a microcrystalline mixture of jack bean proteins (urease, concanavalin A and B), and to determine the structures of the two concanavalins, using data collected at the first MHz XFEL.

Results

Injection and data collection

Full exploitation of the MHz repetition rate for SFX data collection requires three conditions to be fulfilled: (i) when using microjets for sample delivery, high-intensity XFEL pulses induce explosions that generate a gap in the liquid jet²⁰ and a fresh section of the running jet must advance to the X-ray interaction region before arrival of the subsequent XFEL pulse. (ii) Sample that is exposed to an XFEL pulse should not have been exposed to (stray) X-rays from the previous pulse, as this can cause radiation damage. (iii) It has been shown that the impact of an XFEL pulse on the liquid jet may launch shock waves travelling upstream of the jet before onset of the explosion²⁰. These may cause mechanical damage to crystals before they even reach the interaction region, which must be prevented for a successful measurement. Issue (i) can be addressed by using a sufficiently high speed of the jet. Challenge (ii) requires more displacement of the sample than the size of the X-ray beam and its wings. Problem (iii) is far less trivial as it is not a local effect but one capable of affecting samples far away from the actual exposure site. It is therefore critical to verify that, at the short spacing between two X-ray pulses at MHz repetition rate (~1 µs), such shock waves do not affect the sample under investigation. While a model exists to predict the jet gap size in case (i)²⁰ and the region affected in case (ii) is given by the X-ray beam properties, no predictions exist for the shock wave damage in case (iii). Therefore, now that the first MHz XFEL has become available, we investigated this issue at current EuXFEL operating parameters, using microcrystals of the model protein lysozyme.

Lysozyme microcrystals were injected in a thin liquid microjet into the XFEL beam^21,22. The sample was probed using trains of 50 XFEL pulses with an 886 ns interval between pulses (1.128 MHz intra-train repetition rate which was the highest available operating rate at the time of the experiment). The time interval between each 50-pulse train was ~100 ms ensuring that the first pulse in a train always probed an undamaged sample. To identify possible damage due to the short pulse intervals within a train, we compared the quality of data collected from the first pulse in a train with those from subsequent pulses in a train.

Diffraction data was recorded at 7.47 and 9.22 keV photon energy using a 1 megapixel Adaptive Gain Integrating Pixel Detector (AGIPD)¹². The X-ray focal size was ~15 µm diameter at 7.47 keV and ~28 µm diameter at 9.22 keV, and each pulse had ~0.9–1.5 mJ pulse energy. The pulse length was likely around 50 fs (FWHM) based on electron beam diagnostics. Since a sufficiently high jet velocity is critical to avoid sample damage by the previous pulses, we used a time-resolved optical imaging system using a fs optical pump laser²³ for jet illumination to observe the effects of the interaction with the XFEL pulses, and to determine the jet velocity during each XFEL pulse train. To avoid radiation damage from the previous pulse, the jet must advance by at least the XFEL beam diameter between pulses. The velocity required to reestablish a jet in time for the next pulse after the XFEL-induced explosion was 40–50 m s⁻¹ and was determined by recording images of the jet 13 or 124 ns after the second pulse in the train. These time delays were chosen such that the jet gaps caused by both the first and second pulses were clearly visible in the optical image. Figure 1a shows a typical jet image for a jet carrying lysozyme crystals. The presence of two distinct gaps displaced by 2.7 beam diameters and separated by a section of contiguous jet indicates that the second pulse indeed intercepted a recovered jet, and the distance between the centers of the gaps is given by the jet velocity multiplied by the in-train pulse interval. Moreover, consecutive X-ray pulses in the same train were observed to probe different crystals, further verifying that jet speed was indeed high enough to transport new sample into the interaction region (Fig. 1b, c).

The presence of shock waves could not be detected optically in the small jets used for crystal delivery since the visibility of shocks decreases rapidly with jet diameter²⁰. Due to the X-ray focus being much larger than the jet diameter, the shock waves were considerably weaker than the ones launched at similar pulse energies by beams smaller than the jet diameter, as shown in Supplementary Fig. 1 for two larger diameter water jets (12 and 27 µm).

Correlating data quality to pulse number within pulse train

The issues of potential radiation damage and shock wave effects were addressed by collecting extensive data sets on the model protein lysozyme at both 7.47 and 9.22 keV photon energy (87,000 and 45,000 indexed images, respectively) to compare the different amounts of energy deposited in the sample at these photon energies, and by then comparing the data obtained from each pulse number within a train for a given photon energy (see Fig. 2 and Supplementary Information). The diffraction data were indexed and integrated using CrystFEL^24,25.

Notably, the quality of the data is very good, allowing observation of the anomalous sulphur signal as shown in Fig. 2a. Statistics for the full data sets are given in Table 1.

Table 1 Data collection and refinement statistics

Full size table

Since the integrity of disulfide bonds is a sensitive marker for radiation damage in lysozyme crystals²⁶, we compared the bond length of the disulfide bridges derived from data collected using the first pulse and those of later pulses, respectively. Since these refined to the same value within experimental error (Supplementary Table 1), significant radiation damage caused by previous pulses appears unlikely.

We then investigated other statistical indicators of diffraction data quality to check for shock wave-induced damage. Importantly, at both 7.47 and 9.22 keV photon energy, the diffraction resolution, as well as other quality measures such as R_work/R_free and CC*²⁷ (Fig. 2b–d and Supplementary Figures 2 and 3) do not show a dependence on the position in the pulse train. In particular, there is no difference in resolution between the diffraction data collected by the first and second X-ray pulse (Fig. 2c, Supplementary Fig. 4), both distributions having a peak at the same resolution (~2.2 Å in case of the 7.47 keV data). The hit rate (the ratio between the number of detected diffraction patterns and the total number of images) shows some variation (Fig. 2e, and Supplementary Fig. 2c), as does the signal-to-noise ratio (Supplementary Fig. 3a, e), R_split (Supplementary Fig. 3c, g) and the Wilson B factor (Supplementary Fig. 3b, f); however, this can be explained by the variation of the pulse intensity over the train, which decreases with pulse number (Fig. 2d, e, Supplementary Fig. 2). For some quality indicators, the data statistics show a discontinuity between the first and the last half of the pulse train (Supplementary Figure 3c, g, Supplementary Figure 5, Supplementary Figure 6 and Supplementary Note 1), for reasons that are currently unclear. Since the pulse energy does not show a similarly abrupt behavior (Fig. 2d, e and Supplementary Fig. 2b, c), possible explanations could include a change in e.g. the calibration and/or offset parameters for the various memory cells of the detector. Indeed, we found that in the detector calibration data recorded using copper fluorescence in a series of flat field measurements, stronger intensity signals occur much more frequently for the second half of the memory cells. This increase in the calibration measurements shows the same trend with the memory cell number as the jump we observed using diffraction data. Moreover, given that radiation- and/or shock wave-induced damage should affect the whole pulse train either as a smooth trend (i.e., as a cumulative effect over the pulses) or as a sudden change from the first to the second pulse in the train, it is highly unlikely that the observed effect is caused by damage to the sample.

Analysis of microcrystals of jack bean proteins

In addition to the comprehensive analysis of SFX data collected for each shot in the pulse train using microcrystals of the well-established model system lysozyme, we were also interested to explore whether the data collected at MHz rate using a novel X-ray detector would be of sufficient quality to permit the analysis of an uncharacterized, complex system. To this end, we collected SFX data of a microcrystalline mixture of jack bean proteins, crystallized using acetone as published previously¹⁸. The microcrystalline slurry contained at least three different crystal forms (Fig. 3a). Due to their small size (on the order of 5–10 μm) the crystals could not be further characterized before the beam time and it thus remained unclear which proteins had in fact crystallized. Indexing with CrystFEL^24,25 revealed the presence of three different crystal lattices corresponding to one protein each. In line with expectations from the purification protocol^28,29 and SDS PAGE of the microcrystals (Supplementary Fig. 7), a search for known unit cell constants from the PDB resulted in identification of diffraction patterns from urease, concanavalin A and concanavalin B crystals. In contrast to the concanavalin A and B microcrystals, which diffracted strongly, urease microcrystals diffracted only to low resolution and with low signal-to-noise ratio, and no structure was refined. In total, 1,333,750 images were collected, and the final number of indexed diffraction patterns was 76,803 for concanavalin A and 23,719 for concanavalin B, with the resolution limit of the Monte-Carlo integrated data being 2.1 Å for concanavalin A and 2.2 Å for concanavalin B. The structures of concanavalin A and B were solved by molecular replacement, and data and refinement statistics are shown in Table 1.

Given the lysozyme results, it appears unlikely that our jack bean protein data are compromised by radiation damage or by shock wave effects in the current experiment. Indeed, the electron density maps are of excellent quality (Fig. 3b, c) and the overall structures are virtually identical to those determined using macroscopic crystals, with core RMSDs on Cα atoms against reference structures of 0.31 Å for concanavalin A (vs. PDB entry 1JBC³⁰) and 0.24 Å for concanavalin B (vs. PDB entry 1CNV³¹). Notably, in contrast with all other concanavalin A structures in the PDB, our structure contains a magnesium ion in one of the main metal binding sites, as expected given the metal content of native concanavalin A³². In structures determined from macroscopic crystals this site is typically occupied by a manganese ion, which increases diffraction quality³³.

Discussion

Taken together, our present data suggest that under the conditions used, neither protein structure nor crystal quality is affected by previous X-ray pulses. While these results are very promising for MHz data collection, it must be noted that the conditions of the experiment were fairly mild in terms of X-ray exposure due to the current large focus spot size of ~15 µm (FWHM) and an intra-train repetition rate of 1.128 MHz. The final design characteristics of the SPB/SFX instrument at EuXFEL foresee an X-ray focus of hundreds of nanometers (nanofocus) to a few micrometers (microfocus) resulting in a much higher fluence, combined with the possibility of a 4.5 MHz intra-train repetition rate^13,14 which will therefore require a reassessment of this issue.

Our results demonstrate that MHz XFELs can be used to collect high-quality serial femtosecond crystallography data, and that under the conditions used in the current experiment, shock waves caused by the interaction between the sample jet and the XFEL pulses do not compromise the data to a measurable extent. The data are good enough to evaluate a previously uncharacterized sample. As shown by the electron density maps of the concanavalin A and B structures determined in this study, the data are of high quality and this is likely to improve as experience with ultrafast detectors such as the AGIPD increases. Moreover, in just one shift (12 h) of data collection, ~77,000 indexed images were collected of concanavalin A and ~24,000 of concanavalin B, despite this being one of the very first experiments at a new facility.

The findings presented here are of interest for a large and continuously growing community of scientists interested in using MHz XFELs. The possibility of recording data >100 times faster than previously possible means that XFEL technology will in the near future become available to many more scientists since the cost of these measurements will decrease greatly as the time spent to acquire the data is reduced. Notably, the techniques we used in our present work and the underlying physics of operating sequential experiments at MHz rates are also directly relevant to other subfields of XFEL science, from physicists interested in extreme interactions between radiation and matter, to chemists focused on ultrafast reactions, and to other scientists interested in “big data” measurements.

Methods

Crystallization

Jack bean meal was obtained as a fine powder from Sigma (J0125). Proteins were extracted following published procedures^28,29. To this end, 50 g of jack bean powder were suspended in 200 ml of phosphate buffer (100 mM, pH 8.0) and stirred for 1 h at 4 °C. After centrifugation, acetone was added to the supernatant to yield 28% and incubated over night at 4 °C. After centrifugation the acetone concentration was increased to 31.6% and stirred for 1 h at RT. Upon further centrifugation the acetone concentration was increased to 50% and stirred 1 h at RT. After a final round of centrifugation the pellet was dissolved in 50 ml 50 mM Tris pH 8.0. This solution was dialyzed for 48 h against water at 4 °C. Rod- and rugby-ball-shaped crystals appeared overnight. After 2 weeks of storage at 4 °C, needle-shaped crystals appeared. Lysozyme microcrystals were grown by rapidly mixing 2.5 ml of protein solution (hen egg white lysoyme (Sigma) in 0.1 M sodium acetate buffer pH 3.0) and 7.5 ml precipitate solution (20% NaCl, 6% PEG 6000, 0.1 M sodium acetate pH 3.0). The mixture was left over night on a slowly rotating wheel shaker. After gravity-induced settling, the crystalline pellet was washed several times in crystal storage solution (10% NaCl, 0.1 M sodium acetate buffer, pH 4.0). The microcrystal size depends on protein concentration and temperature: ~1 µm crystals were obtained using a protein concentration of 32 mg ml⁻¹ at 4 °C²²; the microcrystals were slightly larger (~2 × 2 × 3 µm) when using a protein concentration of 50 mg ml⁻¹ at room temperature³⁴.

Injection

A suspension of microcrystals in their mother liquor was injected into the X-ray interaction region via a liquid microjet produced by a gas dynamic virtual nozzle (GDVN)³⁵ using helium as the focusing gas. The sample flow rate was 30–40 µl min⁻¹, and gas pressure 400–500 psi at the inlet of the GDVN’s gas supply line, corresponding to a flow rate of 140–250 ml min⁻¹. All samples were 20 µm filtered prior to injection, and the suspension was adjusted to contain 10–15% (v/v) settled crystalline material. During injection the sample was kept in a rotating temperature-controlled reservoir (20 °C for lysozyme microcrystals, 4 °C for jack bean protein microcrystals) to prevent crystal settling³⁶.

Jet speed is a parameter of utmost importance for our experiment, determining not only the rate at which sample is replenished in the X-ray interaction region but also the distance of microcrystals probed by consecutive X-ray pulses. Jet speed was therefore measured in situ (described below) during data collection both on a regular basis and for each change in flow conditions (e.g., new sample, crystal concentration, change in liquid flow rate or helium pressure, new GDVN, etc.). To enable comparison of all data collected in a liquid jet, jet speed was always set to a value of 40–50 m s⁻¹, typically ~45 m s⁻¹, by adjusting the sample flow rate and the pressure of the focusing gas.

Imaging the jet

The liquid jet was imaged from an off-axis perspective (orthogonal to both X-rays and jet flow direction) using a 10× infinity-corrected objective in combination with a 200 mm tube lens and a camera (Basler pilot pIA2400-17gm, Basler AG, Germany). The optical resolution of the imaging system, determined with a resolution target (Edmund Optics), was 1.6 µm. During data collection the camera pixels were 2 × 2-binned, resulting in recorded images with a scale of 0.68 µm pixel⁻¹. To illuminate the jet for high time resolution imaging while preventing motion-induced blurring, which may preclude any speed analysis for liquid jets running at the speeds required for MHz data collection³⁷, the femtosecond (fs) SASE1 optical pump laser²³ was employed for jet illumination as described in ref. ²⁰. The fs laser pulse and the camera were triggered by the EuXFEL global trigger (10 Hz) that indicates the arrival of an X-ray pulse train, thus the images were recorded at a set delay relative to the arrival of the pulse train. We set this delay to image the jet shortly after the second pulse generated a visible gap in the jet, thus imaging the effect of the first two pulses on the jet (see Fig. 1a). The optical images were recorded 124 ns (lysozyme) and 13.4 ns (jack bean proteins) after the second XFEL pulse. The imaging time delay was chosen such that the gap made by the second pulse in the jet was clearly visible during the experiment, in order to provide feedback for the proper alignment of the jet (i.e., the best alignment occurred when the gap size was maximized, which indicates that the jet acquired the maximum possible radiation dose). The jets carrying lysozyme, because they were less stable in shape than the urease/concanavalin jets, required a longer imaging delay such that a larger gap size compensated for the jet’s shot-to-shot jitter.

Jet speed determination

In situ measurement of jet speed is constantly required for MHz data collection to verify that the gap produced by one X-ray pulse has moved downstream before arrival of the subsequent X-ray pulse. This is particularly important when flow conditions change. Measuring jet speed is generally done by tracking a feature over time. In our case, the tracked “feature” was the center of the gap produced by the XFEL interaction with the jet, which is flushed downstream by the subsequently injected sample at the speed of the jet itself²⁰. Imaging two gaps in the jet that are produced by two X-ray pulses therefore allows determining jet speed in a single image provided the imaging quality and time resolution is high enough to determine the center of both gaps: If the two gaps are located at distance d from each other, and the corresponding X-ray pulses were spaced by Δt, then jet speed v is obtained as v = d/Δt (see Fig. 1a).

Data collection

The experiment was performed at the SPB/SFX instrument of the EuXFEL^16,17. Ten pulse trains per second consisting of 50 pulses at 1.128 MHz intra-train repetition rate (886 ns spacing between pulses as measured during the experiment) were used for data collection. We note that during our experiment, the EuXFEL could deliver up to 60 pulses per train to SPB/SFX and that the accelerator was indeed running in 60-pulse mode, with the first 10 pulses used for electron orbit feedback and then being sent to the pre-undulator dump, without lasing. While an increase in the electron orbit stability has been observed in the accelerator with this procedure, possible increases in positional or intensity stability of X-rays at the SPB/SFX instrument have not yet been determined. We chose to discard the first 10 pulses before the sample, due to the possibility that the first pulses in a full train currently may have lower intensities due to the feedback loop requiring the first pulses in a train to optimize beam properties over the remainder of a given train. Thus, using the first several pulses in a maximum-size train could have led to an underestimate of sample damage. The photon energy was tuned to 7.47 and 9.22 keV for the lysozyme control data sets and to 7.48 keV for the jack bean protein microcrystals data set. For the lysozyme control data, the crystal size was chosen so as to have the diffraction limit fall within the boundary of the detector, to be able to see any damage effects on the diffraction resolution (1 × 1 × 1 μm for the 7.47 keV data, 2 × 2 × 3 μm for the 9.22 keV data). At the beginning of each shift the X-ray focus size was minimized by adjusting the photon energy, and then measured, by imaging the size of the fluorescent spot produced by single focused XFEL pulses on a YAG screen (Ce:YAG, 20 µm thickness, Crytur) placed at the interaction region. The X-ray focus was ~15 µm for the 7.47 keV lysozyme and the 7.48 keV jack bean protein data and ~28 µm for the 9.22 keV lysozyme data. For each individual X-ray pulse, the pulse energy was recorded by an X-ray gas monitor detector (XGMD) upstream of the experimental hutch. Microcrystals were injected into the X-ray interaction region using a GDVN as described above.

Data processing and structure solution

Data from the AGIP detector was calibrated using the calibration pipeline established at EuXFEL^38,39, with constants provided by the facility and the AGIPD consortium. CASS⁴⁰ was used for online data analysis, hit identification and data preprocessing. Indexing and integration were performed with CrystFEL version 0.6.3. The detector distance was the same for each of the five shifts of data collection. The position of the sample jet was continuously adjusted to maximize the hit rate. The positions and orientations of individual sensor modules of the AGIPD were refined as described¹. The quality of the lysozyme control data was investigated using custom-written python- and mathematica scripts as well as programmes from the CCP4 suite⁴¹. Lysozyme structures were refined against the 7.47 and 9.22 keV datasets using PHENIX⁴² (including simulated annealing), after molecular replacement with PHASER⁴³. In the 7.47 keV structure, 99.2%, 0.8%, and 0.0% of residues are in the preferred, allowed and disallowed regions of the Ramachandran plot, respectively. For the 9.22 keV structure, these numbers are 99.2%, 0.8%, and 0.0%. Complete data and model statistics are given in Table 1.

The concanavalin A data were phased by molecular replacement with PHASER⁴³, using PDB entry 1JBC³⁰ as the search model after removal of the waters and the metal ions. A clear solution (TFZ = 8.9) was found, and the structure was refined by iterative cycles of rebuilding in COOT⁴⁴ and refinement in PHENIX⁴², including simulated annealing. The final model has excellent geometry, with 97.4% of residues in the preferred regions of the Ramachandran plot, 2.6% in allowed regions and 0.0% in disallowed regions. A phased anomalous difference density map was calculated to help identify the metal ions bound to the protein. As expected at the photon energy used for data collection (7.48 keV), clear anomalous difference density (6.9σ peak height) was found at the position of the calcium ion, but none was found for the other metal ion, which was modeled as a magnesium ion based on the lack of anomalous signal, the coordination as well as the expected metal content for native concanavalin A³².

The concanavalin B data were treated by AMBIGATOR to remove the indexing ambiguity^25,45. These data were then also phased by molecular replacement with PHASER using PDB entry 1CNV³¹ as the search model after removal of the waters, again resulting in a very clear solution (TFZ = 12.6). The final structure was obtained using iterative cycles of rebuilding in COOT and refinement in PHENIX (including simulated annealing), resulting in a model with excellent geometry, with 97.9% of residues in preferred regions, 1.7% in allowed regions and 0.4% (1 residue) in disallowed regions of the Ramachandran plot. This latter residue is in a highly strained part of the main chain, involved in a cis peptide known to occur in concanavalin B.

Simulated annealing composite omit maps of representative regions of both structures are shown in Fig. 3. Atomic coordinates and structure factor amplitudes have been deposited in the Protein Data Bank under entry codes 6GW9 (concanavalin A) and 6GWA (concanavalin B). Data and model statistics for both structures are given in Table 1.

Code availability

Analysis scripts are available from the authors upon request.

Data availability

Coordinates and structure factor amplitudes have been deposited in the Protein Data Bank under accession codes 6H0K (lysozyme, 7.47 keV), 6H0L (lysozyme, 9.22 keV), 6GW9 (concanavalin A) and 6GWA (concanavalin B). Data from this experiment have been registered under DOI 10.22003/XFEL.EU-DATA-002038-00. Other data are available from the corresponding authors upon reasonable request.

References

Barends, T. R. M. et al. Direct observation of ultrafast collective motions in CO myoglobin upon ligand dissociation. Science 350, 445–450 (2015).
Article ADS CAS Google Scholar
Pande, K. et al. Femtosecond structural dynamics drives the trans/cis isomerization in photoactive yellow protein. Science 352, 725–729 (2016).
Article ADS CAS Google Scholar
Coquelle, N. et al. Chromophore twisting in the excited state of a photoswitchable fluorescent protein captured by time-resolved serial femtosecond crystallography. Nat. Chem. 10, 31–37 (2018).
Article CAS Google Scholar
Kern, J. et al. Simultaneous femtosecond X-ray spectroscopy and diffraction of photosystem II at room temperature. Science 340, 491–495 (2013).
Article ADS CAS Google Scholar
Suga, M. et al. Native structure of photosystem II at 1.95 angstrom resolution viewed by femtosecond X-ray pulses. Nature 517, 99–103 (2015).
Article ADS CAS Google Scholar
Colletier, J. P. et al. De novo phasing with X-ray laser reveals mosquito larvicide BinAB structure. Nature 539, 43–47 (2016).
Article ADS CAS Google Scholar
Sawaya, M. R. et al. Protein crystal structure obtained at 2.9 Ångstrom resolution from injecting bacterial cells into an X-ray free-electron laser beam. Proc. Natl Acad. Sci. USA 111, 12769–12774 (2014).
Article ADS CAS Google Scholar
Ginn, H. M. et al. Structure of cpv17 polyhedrin determined by the improved analysis of serial femtosecond crystallographic data. Nat. Commun. 6, 6435 (2015).
Article CAS Google Scholar
Seibert, M. M. et al. Single mimivirus particles intercepted and imaged with an X-ray laser. Nature 470, 78–81 (2011).
Article ADS CAS Google Scholar
Kassemeyer, S. et al. Optimal mapping of X-ray laser diffraction patterns into three dimensions using routing algorithms. Phys. Rev. E 88, 042710 (2013).
Article ADS Google Scholar
Bostedt, C. et al. Linac coherent light source: the first five years. Rev. Mod. Phys. 88, 015007 (2016).
Article ADS Google Scholar
Henrich, B. et al. The adaptive gain integrating pixel detector AGIPD: a detector for the European XFEL. Nucl. Instrum. Meth. Phys. Res. A 633, S11–S14 (2011).
Article CAS Google Scholar
Altarelli, M. The European X-ray free-electron laser facility in Hamburg. Nucl. Instrum. Meth. Phys. Res. B 269, 2845–2849 (2011).
Article ADS CAS Google Scholar
Altarelli, M. The European X-ray free-electron laser: toward an ultra-bright, high repetition-rate X-ray source. High Power Laser Sci. Eng. 3, https://doi.org/10.1017/hpl.2015.17 (2015).
Tschentscher, T. et al. Photon beam transport and scientific instruments at the European XFEL. Appl. Sci. 7, 592 (2017).
Article Google Scholar
Mancuso, A. P. Conceptual Design Report: Scientific Instrument Single Particles, Clusters, and Biomolecules (SPB). XFEL.EU Technical Report 1–96, https://doi.org/10.3204/XFEL.EU/TR-2011-007 (2011).
Mancuso, A. P., Aquila, A., Borchers, G., Giewekemeyer, K. & Reimers, N. Technical Design Report: Scientific Instrument Single Particles, Clusters, and Biomolecules (SPB). XFEL.EU Technical Report 1–232, https://doi.org/10.3204/XFEL.EU/TR-2013-004 (2013).
Sumner, J. B. The isolation and crystallization of the enzyme urease. Prelim. Pap. J. Biol. Chem. 69, 435–441 (1926).
CAS Google Scholar
Edsall, J. T. 50 years ago: James Sumner and the crystallization of urease. Trends Biochem. Sci. 1, 21 (1976).
Article Google Scholar
Stan, C. A. et al. Liquid explosions induced by X-ray laser pulses. Nat. Phys. 12, 966–971 (2016).
Article CAS Google Scholar
Chapman, H. N. et al. Femtosecond X-ray protein nanocrystallography. Nature 470, 73–77 (2011).
Article ADS CAS Google Scholar
Boutet, S. et al. High-resolution protein structure determination by serial femtosecond crystallography. Science 337, 362–364 (2012).
Article ADS CAS Google Scholar
Pergament, M. et al. Versatile optical laser system for experiments at the European X-ray free-electron laser facility. Opt. Express 24, 29349–29359 (2016).
Article ADS CAS Google Scholar
White, T. A. et al. CrystFEL: a software suite for snapshot serial crystallography. J. Appl. Crystallogr 45, 335–341 (2012).
Article CAS Google Scholar
White, T. A. et al. Recent developments in CrystFEL. J. Appl. Crystallogr. 49, 680–689 (2016).
Article CAS Google Scholar
Southworth-Davies, R. J., Medina, M. A., Carmichael, I. & Garman, E. F. Observation of decreased radiation damage at higher dose rates in room temperature protein crystallography. Structure 15, 1531–1541 (2007).
Article CAS Google Scholar
Karplus, P. A. & Diederichs, K. Linking crystallographic model and data quality. Science 336, 1030–1033 (2012).
Article ADS CAS Google Scholar
McPherson, A., Geller, J. & Rich, A. Crystallographic studies on concanavalin-B. Biochem. Biophys. Res. Commun. 57, 494–499 (1974).
Article CAS Google Scholar
Jabri, E., Lee, M. H., Hausinger, R. P. & Karplus, P. A. Preliminary crystallographic studies of urease from jack bean and from Klebsiella aerogenes. J. Mol. Biol. 227, 934–937 (1992).
Article CAS Google Scholar
Parkin, S., Rupp, B. & Hope, H. Atomic resolution structure of concanavalin A at 120 K. Acta Crystallogr. D 52, 1161–1168 (1996).
Article CAS Google Scholar
Hennig, M., Jansonius, J. N., Terwisscha Van Scheltinga, A. C., Dijkstra, B. W. & Schlesier, B. Crystal-structure of concanavalin-B at 1.65 Ångstrom resolution - an inactivated chitinase from seeds of Canavalia ensiformis. J. Mol. Biol. 254, 237–246 (1995).
Article CAS Google Scholar
Yariv, J., Kalb, A. J. & Levitzki, A. Interaction of concanavalin A with methyl alpha-d-glucopyranoside. Biochim. Biophys. Acta 165, 303–30 (1968).
Article CAS Google Scholar
Kalb, A. J., Yariv, J., Helliwell, J. R. & Papiz, M. Z. The effect of metal-ion homogeneity on the diffraction limit of orthorhombic (I222) crystals of concanavalin-A. J. Cryst. Growth 88, 537–540 (1988).
Article ADS CAS Google Scholar
Gorel, A. et al. Multi-wavelength anomalous diffraction de novo phasing using a two-colour X-ray Free-Electron Laser with wide tunability. Nat. Commun. 8, 1170 (2017).
Weierstall, U., Spence, J. C. H. & Doak, R. B. Injector for scattering measurements on fully solvated biospecies. Rev. Sci. Instrum. 83, 035108 (2012).
Lomb, L. et al. An anti-settling sample delivery instrument for serial femtosecond crystallography. J. Appl. Crystallogr. 45, 674–678 (2012).
Article CAS Google Scholar
Grünbein, M. L., Shoeman, R. L. & Doak, R. B. Velocimetry of fast microscopic liquid jets by nanosecond dual-pulse laser illumination for megahertz X-ray free-electron lasers. Opt. Express 26, 7190–7203 (2018).
Article ADS Google Scholar
Kuster, M. et al. Detectors and calibration concept for the European XFEL. Synchrotron Radiat. News 27, 35–38 (2014).
Article Google Scholar
Fangohr, H. et al. Data analysis support in KARABO at European XFEL. TUCPA01 (2018). https://doi.org/10.18429/JACoW-ICALEPCS2017-TUCPA01
Foucar, L. et al. CASS-CFEL-ASG software suite. Comput. Phys. Commun. 183, 2207–2213 (2012).
Article ADS CAS Google Scholar
The CCP4 suite. Programs for protein crystallography. Acta Crystallogr. D 50, 760–763 (1994).
Article Google Scholar
Adams, P. D. et al. Phenix: a comprehensive python-based system for macromolecular structure solution. Acta Crystallogr. D 66, 213–221 (2010).
McCoy, A. J. et al. Phaser crystallographic software. J. Appl. Cryst. 40, 658–674 (2007).
Article CAS Google Scholar
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D 60, 2126–2132 (2004).
Article Google Scholar
Brehm, W. & Diederichs, K. Breaking the indexing ambiguity in serial crystallography. Acta Crystallogr. D 70, 101–109 (2014).
Article CAS Google Scholar

Download references

Acknowledgements

We acknowledge European XFEL in Schenefeld, Germany, for provision of X-ray free-electron laser beamtime at the SPB/SFX instrument and thank the instrument group and facility staff for their great assistance. We thank Oleksandr Yefanov for helpful discussions on detector geometry optimization, as well as Harald Sinn and Winfried Decking for helpful discussions on beam- and pulse train properties. We thank the AGIPD consortium for providing some of the calibration constants for the detector. We also thank Melanie Müller for peptide mass fingerprinting. The authors are indebted to the SFX User Consortium for the provision of instrumentation and personnel that has enabled this experiment. This research was supported by the Max Planck Society and travel grants from the European XFEL. J.-P.C., M.W., M.C. and M.Sl. acknowledge support from ANR Grant (BioXFEL). J.-P.C. acknowledges financial support by CEA, CNRS, Université Grenoble Alpes, and the Agence Nationale de la Recherche (Grants Nr. ANR-15-CE18-0005-02, ANR-17-CE11-0018-01). C.A.S. was supported by startup funds from Rutgers University Newark. G.M. acknowledges the support of the Australian Research Council Centre of Excellence in Advanced Molecular Imaging [CE140100011]. G.M. also acknowledges the Australian Nuclear Science and Technology Organisation (ANSTO) in providing financial support for this work. We also acknowledge the travel funding provided by the International Synchrotron Access Program (ISAP) managed by the Australian Synchrotron, part of ANSTO, and funded by the Australian Government.

Author information

Authors and Affiliations

Max Planck Institute for Medical Research, Jahnstrasse 29, 69120, Heidelberg, Germany
Marie Luise Grünbein, Alexander Gorel, Miriam Stricker, Elisabeth Hartmann, Mario Hilpert, Marco Kloos, Gabriela Nass Kovacs, Christopher M. Roome, R. Bruce Doak, Robert L. Shoeman, Lutz Foucar, Thomas R. M. Barends & Ilme Schlichting
European XFEL GmbH, Holzkoppel 4, 22869, Schenefeld, Germany
Johan Bielecki, Richard Bean, Katerina Dörner, Steffen Hauf, Yoonhee Kim, Romain Letrun, Marc Messerschmidt, Grant Mills, Marco Ramilli, Tokushi Sato, Jolanta Sztuk-Dambietz, Britta Weinhausen, Nasser Al-Qudami, Djelloul Boukhelef, Sandor Brockhauser, Wajid Ehsan, Moritz Emons, Sergey Esenov, Hans Fangohr, Alexander Kaukher, Thomas Kluyver, Max Lederer, Luis Maia, Maurizio Manetti, Thomas Michelat, Astrid Münnich, Florent Pallas, Guido Palmer, Gianpietro Previtali, Natascha Raab, Alessandro Silenzi, Janusz Szuba, Sandhya Venkatesan, Krzysztof Wrona, Jun Zhu & Adrian P. Mancuso
Department of Physics, UMR 625, UBL, University of Rennes 1, 35042, Rennes, France
Marco Cammarata
Deutsches Elektronensynchrotron DESY, Notkestraße 85, 22607, Hamburg, Germany
Lars Fröhlich & Matthias Scholz
BioXFEL STC, 700 Ellicott Street, Buffalo, NY, 14203, USA
Marc Messerschmidt
ARC Centre of Excellence for Advanced Molecular Imaging, La Trobe Institute for Molecular Science, La Trobe University, Melbourne, VIC, 3086, Australia
Grant Mills
Center for Free-Electron Laser Science, Deutsches Elektronensynchrotron, Notkestraße 85, 22607, Hamburg, Germany
Tokushi Sato
Laboratoire de Spectrochimie Infrarouge et Raman, CNRS, UMR 8516, Université de Lille, 59000, Lille, France
Michel Sliwa
Institut de Biologie Structurale, Université Grenoble Alpes, CEA, CNRS, 38044, Grenoble, France
Martin Weik & Jacques-Philippe Colletier
Biological Research Centre (BRC), Hungarian Academy of Sciences, Temesvári krt. 62, Szeged, 6726, Hungary
Sandor Brockhauser
Department of Physics, Rutgers University Newark, 101 Warren Street, Newark, NJ, 07102, USA
Claudiu A. Stan

Authors

Marie Luise Grünbein
View author publications
You can also search for this author in PubMed Google Scholar
Johan Bielecki
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Gorel
View author publications
You can also search for this author in PubMed Google Scholar
Miriam Stricker
View author publications
You can also search for this author in PubMed Google Scholar
Richard Bean
View author publications
You can also search for this author in PubMed Google Scholar
Marco Cammarata
View author publications
You can also search for this author in PubMed Google Scholar
Katerina Dörner
View author publications
You can also search for this author in PubMed Google Scholar
Lars Fröhlich
View author publications
You can also search for this author in PubMed Google Scholar
Elisabeth Hartmann
View author publications
You can also search for this author in PubMed Google Scholar
Steffen Hauf
View author publications
You can also search for this author in PubMed Google Scholar
Mario Hilpert
View author publications
You can also search for this author in PubMed Google Scholar
Yoonhee Kim
View author publications
You can also search for this author in PubMed Google Scholar
Marco Kloos
View author publications
You can also search for this author in PubMed Google Scholar
Romain Letrun
View author publications
You can also search for this author in PubMed Google Scholar
Marc Messerschmidt
View author publications
You can also search for this author in PubMed Google Scholar
Grant Mills
View author publications
You can also search for this author in PubMed Google Scholar
Gabriela Nass Kovacs
View author publications
You can also search for this author in PubMed Google Scholar
Marco Ramilli
View author publications
You can also search for this author in PubMed Google Scholar
Christopher M. Roome
View author publications
You can also search for this author in PubMed Google Scholar
Tokushi Sato
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Scholz
View author publications
You can also search for this author in PubMed Google Scholar
Michel Sliwa
View author publications
You can also search for this author in PubMed Google Scholar
Jolanta Sztuk-Dambietz
View author publications
You can also search for this author in PubMed Google Scholar
Martin Weik
View author publications
You can also search for this author in PubMed Google Scholar
Britta Weinhausen
View author publications
You can also search for this author in PubMed Google Scholar
Nasser Al-Qudami
View author publications
You can also search for this author in PubMed Google Scholar
Djelloul Boukhelef
View author publications
You can also search for this author in PubMed Google Scholar
Sandor Brockhauser
View author publications
You can also search for this author in PubMed Google Scholar
Wajid Ehsan
View author publications
You can also search for this author in PubMed Google Scholar
Moritz Emons
View author publications
You can also search for this author in PubMed Google Scholar
Sergey Esenov
View author publications
You can also search for this author in PubMed Google Scholar
Hans Fangohr
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Kaukher
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Kluyver
View author publications
You can also search for this author in PubMed Google Scholar
Max Lederer
View author publications
You can also search for this author in PubMed Google Scholar
Luis Maia
View author publications
You can also search for this author in PubMed Google Scholar
Maurizio Manetti
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Michelat
View author publications
You can also search for this author in PubMed Google Scholar
Astrid Münnich
View author publications
You can also search for this author in PubMed Google Scholar
Florent Pallas
View author publications
You can also search for this author in PubMed Google Scholar
Guido Palmer
View author publications
You can also search for this author in PubMed Google Scholar
Gianpietro Previtali
View author publications
You can also search for this author in PubMed Google Scholar
Natascha Raab
View author publications
You can also search for this author in PubMed Google Scholar
Alessandro Silenzi
View author publications
You can also search for this author in PubMed Google Scholar
Janusz Szuba
View author publications
You can also search for this author in PubMed Google Scholar
Sandhya Venkatesan
View author publications
You can also search for this author in PubMed Google Scholar
Krzysztof Wrona
View author publications
You can also search for this author in PubMed Google Scholar
Jun Zhu
View author publications
You can also search for this author in PubMed Google Scholar
R. Bruce Doak
View author publications
You can also search for this author in PubMed Google Scholar
Robert L. Shoeman
View author publications
You can also search for this author in PubMed Google Scholar
Lutz Foucar
View author publications
You can also search for this author in PubMed Google Scholar
Jacques-Philippe Colletier
View author publications
You can also search for this author in PubMed Google Scholar
Adrian P. Mancuso
View author publications
You can also search for this author in PubMed Google Scholar
Thomas R. M. Barends
View author publications
You can also search for this author in PubMed Google Scholar
Claudiu A. Stan
View author publications
You can also search for this author in PubMed Google Scholar
Ilme Schlichting
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.L.G., M.St., M.K., R.L.S. and R.B.D. performed injection, E.H. and I.S. prepared crystals, M.H. and L.Fou. performed online data analysis, A.G., J.-P.C., and T.R.M.B. performed offline analysis. J.B., M.Me., K.D., R.B., T.S.,Y. K., A.P.M. and R.L. performed data collection. M.Me., J.S.-D., S.H., N.R., A.Mü., A.K. and M.R. developed, controlled and operated the detector. M.Sch. and L.Fr. prepared and tested the specific repetition rates of the accelerator. H.F., T.M., A.S., S.B., J.Z., T. K., S. E. and W.E. developed control and online analysis software. K.W., D.B., L.M., G.P., J.S., N.Al Q. and M. Ma. performed data management. T.S., J.B., R.L., M. L. G. and C.A.S. set up femtosecond jet imaging, I.S. and C.A.S. conceived the experiment. I.S., C.A.S, J.B., B.W. and A.P.M. designed the experiment. M.L.G., A.G., M.St., R.B., J.B., M.C., K.D., E.H., M.H., Y.K., M.K., R.L., J.S.-D., A.P.M., M.Me., G.M., G.N.K., C.M.R., T.S., M.Sl., M.W., B.W., G. M., M.R., R.B.D., R.L.S., L.Fou., J.-P.C., T.R.M.B., C.A.S. and I.S. prepared the experiment. F.P., S.V., G.P., M.E. and M.L. designed and set up the femtosecond laser. M.L.G., T.R.M.B., C.A.S. and I.S. wrote the paper with input from all authors.

Corresponding authors

Correspondence to Thomas R. M. Barends, Claudiu A. Stan or Ilme Schlichting.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Grünbein, M.L., Bielecki, J., Gorel, A. et al. Megahertz data collection from protein microcrystals at an X-ray free-electron laser. Nat Commun 9, 3487 (2018). https://doi.org/10.1038/s41467-018-05953-4

Download citation

Received: 11 July 2018
Accepted: 01 August 2018
Published: 28 August 2018
DOI: https://doi.org/10.1038/s41467-018-05953-4

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.