Multi-wavelength anomalous diffraction de novo phasing using a two-colour X-ray free-electron laser with wide tunability

Gorel, Alexander; Motomura, Koji; Fukuzawa, Hironobu; Doak, R. Bruce; Grünbein, Marie Luise; Hilpert, Mario; Inoue, Ichiro; Kloos, Marco; Kovácsová, Gabriela; Nango, Eriko; Nass, Karol; Roome, Christopher M.; Shoeman, Robert L.; Tanaka, Rie; Tono, Kensuke; Joti, Yasumasa; Yabashi, Makina; Iwata, So; Foucar, Lutz; Ueda, Kiyoshi; Barends, Thomas R. M.; Schlichting, Ilme

doi:10.1038/s41467-017-00754-7

Download PDF

Article
Open access
Published: 27 October 2017

Multi-wavelength anomalous diffraction de novo phasing using a two-colour X-ray free-electron laser with wide tunability

Alexander Gorel¹,
Koji Motomura^2,3,
Hironobu Fukuzawa^2,3,
R. Bruce Doak¹,
Marie Luise Grünbein¹,
Mario Hilpert¹,
Ichiro Inoue³,
Marco Kloos¹,
Gabriela Kovácsová¹,
Eriko Nango^3,4,
Karol Nass¹,
Christopher M. Roome¹,
Robert L. Shoeman¹,
Rie Tanaka³,
Kensuke Tono⁵,
Yasumasa Joti⁵,
Makina Yabashi ORCID: orcid.org/0000-0002-2472-1684³,
So Iwata^3,4,
Lutz Foucar¹,
Kiyoshi Ueda^2,3,
Thomas R. M. Barends¹ &
…
Ilme Schlichting¹

Nature Communications volume 8, Article number: 1170 (2017) Cite this article

3322 Accesses
27 Citations
8 Altmetric
Metrics details

Subjects

Abstract

Serial femtosecond crystallography at X-ray free-electron lasers (XFELs) offers unprecedented possibilities for macromolecular structure determination of systems prone to radiation damage. However, de novo structure determination, i.e., without prior structural knowledge, is complicated by the inherent inaccuracy of serial femtosecond crystallography data. By its very nature, serial femtosecond crystallography data collection entails shot-to-shot fluctuations in X-ray wavelength and intensity as well as variations in crystal size and quality that must be averaged out. Hence, to obtain accurate diffraction intensities for de novo phasing, large numbers of diffraction patterns are required, and, concomitantly large volumes of sample and long X-ray free-electron laser beamtimes. Here we show that serial femtosecond crystallography data collected using simultaneous two-colour X-ray free-electron laser pulses can be used for multiple wavelength anomalous dispersion phasing. The phase angle determination is significantly more accurate than for single-colour phasing. We anticipate that two-colour multiple wavelength anomalous dispersion phasing will enhance structure determination of difficult-to-phase proteins at X-ray free-electron lasers.

Serial femtosecond crystallography

Article 04 August 2022

Thomas R. M. Barends, Benjamin Stauch, … Ilme Schlichting

Megahertz pulse trains enable multi-hit serial femtosecond crystallography experiments at X-ray free electron lasers

Article Open access 11 August 2022

Susannah Holmes, Henry J. Kirkwood, … Connie Darmanin

Time-resolved serial femtosecond crystallography at the European XFEL

Article 18 November 2019

Suraj Pandey, Richard Bean, … Marius Schmidt

Introduction

The bright femtosecond X-ray pulses of X-ray free-electron lasers (XFELs) provide novel opportunities for macromolecular structure determination¹. In particular, by using a ‘diffraction before destruction’ approach^{2, 3}, they allow structure determination of systems prone to radiation damage such as nano- and microcrystals^4,5,6 or, in many cases, crystals with high-solvent content. The molecules themselves can often be highly radiation sensitive, for example, owing to the presence of metals^7,8,9 or other redox-sensitive cofactors.

To date, most crystal structures determined via XFEL data collection were solved by molecular replacement using prior structural information for phasing. This approach is suitable when seeking specific information about known protein structures, such as the undamaged active site of a metalloenzyme^7,8,9,10,11 or the nature of a short-lived reaction species as probed in a time-resolved experiment^{12,13,14,15,16,17}. In the long run, however, as XFEL-based data collection matures and also becomes more accessible (with several new XFEL sources coming online this year alone), more and more systems will be studied for which no previous structural information is available. De novo phasing then becomes mandatory. De novo phasing of XFEL data has recently been demonstrated for several model systems, employing a variety of methods based on anomalous signals^{18,19,20,21,22,23} utilising element-specific scattering at X-ray absorption edges or isomorphous differences between native and heavy atom derivatized crystals^{5, 24}. Importantly, a previously unknown structure has now also been solved de novo with XFEL data⁵.

Despite these successes, de novo phasing of XFEL data remains challenging. This is due to the stochastic nature of XFEL sources and methods of data collection, compounded by current detectors and analysis programmes that limit the accuracy of the integrated diffraction intensities. In contrast to conventional rotation data acquisition, the femtosecond exposure time at XFELs precludes any rotation during exposure and thus results in the collection of still images that contain only partial reflections. Since exposure to the full XFEL beam destroys the illuminated crystal (or at least the illuminated portion thereof), a new crystal, (or a fresh portion), is required for the next exposure. In the case of microcrystals, this must necessarily be a fresh, randomly oriented crystal, leading to a data collection approach termed serial femtosecond crystallography (SFX). The size and quality of microcrystals can vary, however. Moreover, the crystals can intersect the focused XFEL beam anywhere between the low intensity periphery and the high intensity centre of the of X-ray focal spot. Hence, diffraction intensities vary from shot to shot even for identical microcrystals in identical orientations. In addition, the XFEL pulse and photon energy distribution (intensity and wavelength) vary from shot to shot. Together, all of this results in significant fluctuations in the measured intensities that must be averaged out. Consequently, a great deal of data must be collected; the multiplicity of measurements for a given reflection being typically several 100- to 1,000-fold depending on the phasing method and signal strength. This demands not only significant quantities of sample but also of XFEL beam time, both of which are typically precious and often limiting. Improved use of one or both is essential to future evolution of XFEL-based structural biology. To double data collection efficiency, Hunter et al.²² employed two interaction chambers in series, collecting SFX data using the primary XFEL beam and then ‘reusing’ the ‘spent’ XFEL beam after it had passed the first sample and detector²⁵. However, this type of data collection does not reduce sample consumption.

The recently established two-colour operation of the SPring-8 Angstrom Compact free-electron LAser (SACLA) in Japan²⁶ opened up a novel possibility of collecting two SFX datasets simultaneously, without doubling the amount of sample used. Owing to the unprecedentedly large energy separation of the two tuneable colours of that XFEL beam²⁶ two distinct and spatially well separated diffraction patterns can be recorded simultaneously on one diffraction image of the same crystal. The simultaneous arrival of the two XFEL pulses precludes damage effects from the first pulse affecting the diffraction of the second pulse²⁷. This allows simultaneous same-crystal acquisition of two-wavelength datasets for multiple wavelength anomalous dispersion (MAD) phasing. (This is in marked contrast to data collection at synchrotron sources where they are typically collected sequentially.).

In principle, given the availability of more information, MAD phase angles are expected to be more accurate than those from single wavelength anomalous dispersion (SAD) experiments. To explore whether this can be put to use for XFEL-based de novo phasing with the added benefit of halved sample consumption, we performed a two-colour SFX experiment at SACLA. Using microcrystals of the well-established model system lysozyme, in complex with a lanthanide compound we demonstrate here that simultaneously collected two-colour SFX diffraction data can be analysed and phased de novo. The two colours (7 and 9 keV) were chosen to be above the M-edges and L-edges, respectively (see Fig. 1a). We compare phasing via multiple wavelength and SAD of the two and single-colour SFX data, respectively, and show that the phases are significantly more accurate, facilitating model building for the two-colour data.

Results

Experimental set-up and parameter determination

To test whether two-colour data offer advantages for de novo phasing of SFX data, we used microcrystals of a well-characterised lysozyme heavy atom derivative that gives a strong anomalous signal from two gadolinium atoms per asymmetric unit²⁸. This is the same system we employed previously to establish that de novo phasing of SFX data is possible¹⁸. Lysozyme microcrystals were soaked in gadoteridol, an organic gadolinium complex, and then embedded in a grease matrix²⁹ for high-viscosity extrusion injection³⁰ into the XFEL beam at SACLA. Two-colour data collection was performed at beamline 3 (BL3) in the DAPHNIS chamber³¹ using a multiport charge coupled device (MPCCD) detector³² (see Fig. 1b). SACLA operated at 30 Hz and simultaneously delivered two-colour X-ray pulses of 10 fs duration and nominally 7.0 keV (λ = 1.770 Å) and 9.0 keV (λ = 1.378 Å) photon energy of 0.14 mJ average power. The focal spot-size was measured to be 1.4 µm (vertical) × 1.6 µm (horizontal) in FWHM and spatial overlap of the two colours was confirmed. To account for the at times higher pulse energy of the 7 keV beam as well as the higher detector quantum efficiency (DQE 0.7 at 7 keV and 0.4 at 9 keV³² (http://xfel.riken.jp/users/mpccd_detector/instructions_ver1.0_revised.pdf)) and scattering cross sections, we inserted a 25 µm Al filter upstream of the sample that transmitted 60% and 80 % of the 7 keV and 9 keV photons, respectively. We collected 570,000 diffraction patterns in ~12 h. Online data analysis was performed with CASS³³ and the Graphic User Interface to the offline data processing pipeline Cheetah Dispatcher³⁴ was used to identify 208,373 hits (37% hit rate), using the pipe-line generated geometry file. We used powder patterns of silicon nanocrystals for the accurate determination of the detector distance by applying an interest point algorithm and distance score function optimisation as described in detail in the Supplementary Methods. A wide-range inline spectrometer was used to simultaneously record the spectral information for the 7 keV and 9 keV colours for each XFEL pulse as described in the Methods Section and the Supplementary Note 1. Software modules were implemented to integrate the spectrometer readout into data processing by a Python interface of the SACLA API (application programming interface) to the metadata database (write_spectra.py) and to add the correct wavelength to the respective diffraction image (write_calib_color.py) so that it can be accessed by the processing software (the indexamajig module from CrystFEL³⁵). Supplementary Fig. 1 shows a flowchart of the data analysis. Using the corrected values of the wavelengths and the refined detector distance increased the indexing rate significantly. Out of 208,373 hits we could index 15,243 (7.3 %) in 7 keV, 23,860 (11.4 %) in 9 keV and 2,129 (1%) in both colours (see Fig. 1c, Table 1, Supplementary Figs. 9–11).

Table 1 Indexing rate of the 208,373 hits at the various stages of the analysis

Full size table

Efficient two-colour data processing

The two-colour beam is generated in a split undulator operation of the SACLA XFEL. The pulse energies of the two colours can be balanced or adjusted relatively by changing the number of undulators²⁶. We aimed at equal distribution, but the pulse energy distribution of the two colours varied during the experiment, with the consequence that the diffraction images typically contained a strong and a weak diffraction pattern (see Supplementary Fig. 9). This made peak finding more challenging, given that the analysis software identifies spots in a diffraction pattern by use of intensity thresholding. The strong Bragg reflections from the brighter colour are more likely to lie above the threshold than those of the weaker colour and consequently the list of diffraction spots compiled for indexing will be dominated by spots from the strong pattern. Initial indexing was performed separately for the two colours (see Supplementary Note 2) and yielded the expected unit cell parameters (a = b = 78.3 Å, c = 39.1 Å, α = β = γ = 90°) for gadoteridol-derivatized lysozyme¹⁸, which were subsequently imposed loosely on indexing. A median filter was applied (see Methods Section for details) to reduce the effects of background and to increase both indexing accuracy and resolution by including weak high resolution reflections into orientation matrix calculation. This resulted in the indexing of the strong diffraction pattern.

To process the second, weaker diffraction pattern in the image, the threshold and minimum I/σ values of the peak search parameters were lowered to include weak Bragg reflections (see Supplementary Fig. 10 in Supplementary Note 2). As some of the identified peaks are part of the stronger diffraction pattern, these need to be removed from the peak list for the search for peaks from the second diffraction pattern. To this end, the write_subtract.py module was implemented to remove all spots from the peak list that were closer than 10 pixels to spot positions indexed with the first colour. The remaining peak positions were then passed directly to CrystFEL’s indexamajig module³⁵ for indexing. This procedure significantly increased the two-colour indexing rate (11.1 % (23,144 images of 208,373), see Table 1). The final structure factors for the two colours were calculated from diffraction images that were two-colour indexable (see Table 2 for data statistics and Supplementary Fig. 11).

Table 2 SFX data statistics

Full size table

Phasing

Phases were determined automatically using AutoSHARP³⁶, using data to 1.9 Å resolution. This programme searches for the heavy atoms, refines their positions, B-factors and occupancies, calculates phases and performs solvent flattening. It then performs an initial round of autobuilding using BUCCANEER³⁷, followed by more solvent flattening taking the initial model into account, and then performs a final round of model building using ARP/WARP³⁸.

To investigate the usefulness of the two-colour phasing approach, SAD (using the 9 keV data only) and MAD (using both colours) automatic phasing was attempted with subsets of 9,000, 6,000, 5,000 and 4,000 images. At 4,000 images, both SAD and MAD failed as defined here by the failure of the programme to build the correct structure. All other attempts were successful in that >90% of the structure was built correctly in the second round of automatic building.

However, there was a clear improvement in the accuracy of the phase angle upon comparing the two-colour MAD results with the single-colour SAD results, as shown in Table 3. Plotting the estimate of the cosine of the phase angle error (figure of merit, FOM) as a function of resolution shows this improvement to mainly be seen at medium and low resolution (see Fig. 2). More importantly, at 5,000 images, the results of automatic building were clearly better in both the first and the second round of automatic building. This suggests that for difficult cases, the two-colour approach is superior.

Table 3 Final phasing statistics

Full size table

Discussion

Two-colour XFEL operation^{26, 39} enables new scientific applications, ranging from X-ray pump/X-ray probe experiments to the expected use for MAD phasing of SFX data³⁹. The split undulator operation at SACLA provides two-colour double X-ray pulses with large and flexible wavelength separation of more than 30%²⁶. A large wavelength separation facilitates data analysis of two-colour SFX data because it ensures that most Bragg reflections in the diffraction pattern are spatially well separated and can be integrated without deconvolution, which would compromise data quality. We describe a proof-of-concept study using two-colour XFEL pulses for MAD phasing of SFX data of a lysozyme gadolinium derivative, a well-characterised model system^{18, 28}.

The choice of the photon energies of the two pulses depends on the energy of absorption edge(s). We chose 7 keV and 9 keV, below and above the L-edges of Gd, respectively. This yields a large anomalous signal difference and a good spatial separation of the two diffraction patterns. In fact, this photon energy difference is so large that very different regions of reciprocal space are probed. In addition to the two photon energies the ratio of their pulse energies needs to be chosen. X-ray—matter cross sections depend strongly on photon energy as can detector quantum efficiencies. The lower photon energy will give stronger Bragg intensities that are often recorded more efficiently, whereas the higher photon energy will produce much weaker Bragg intensities that are then measured with lower efficiency. The intensity ratio of the two colours can be addressed either on the machine side by changing the number of undulators used to produce each colour²⁶ or by inserting a filter into the X-ray beam that absorbs and thus attenuates the colour with the lower photon energy.

The analysis of two-colour SFX data is not straight-forward. In fact, direct processing with CrystFEL³⁵ was unsuccessful, as only a minute fraction of the hits could be indexed in both colours (see Table 1). Despite aiming for similar pulse energies for the two colours and compensating for the difference in DQE by inserting an aluminium filter, the intensity distribution of the two patterns in the diffraction image varied. One diffraction pattern typically dominated and could be indexed in one colour, but indexing of the weaker second diffraction pattern in the other colour typically failed. To index the weaker diffraction pattern, the threshold for identifying peaks had to be lowered and the previously indexed peaks were eliminated from the list. Using this approach (see Supplementary Note 2) we successfully indexed and integrated 11.1 % of the hits in both colours (see Tables 1, 2).

We deliberately used a model system with an unusually strong anomalous signal. In spite of this, we see a significant increase in the data information content of the two-colour data used for MAD phasing, as evidenced by the higher figure of merit indicating more accurate initial phases, and easier model building compared with the single-colour data SAD-phasing approach. This difference is particularly striking at 5,000 images, which is a comparatively low number for SFX data collection. Hence, these data are of lower precision than those from larger number of images, as evidenced by the data statistics (Table 2). At 5,000 images, the first round of automated building essentially failed in the SAD case, whereas in the MAD case most of the structure was built (see Fig. 3). It has been suggested that for suboptimal data, density modification might more easily improve even inaccurate phases provided by MAD, which are unimodal, rather than SAD phases which are additionally compromised by a handedness ambiguity⁴⁰. This could help explain the superiority of the MAD phases during the later stages of structure determination. We expect the difference between SAD and MAD to be even larger for more challenging cases with weaker anomalous signals.

Traditionally, two-wavelength MAD phasing involves data collection at the peak and the inflection point (which are very close together) of an absorption edge, where the scattering properties are extremely sensitive to wavelength changes. This is challenging at XFELs owing to the inherent energy jitter of the self-amplified spontaneous emission beam. Although one could resolve such narrow energy gaps between two-colour pulses by sorting the data according to the measured per-pulse photon energy spectra after data collection, data analysis would still be extremely challenging because Bragg peaks would spatially overlap⁴¹. Therefore, we measured above the M- and L-edges, respectively, which, in addition, gives a large anomalous difference signal. This approach works for all elements that have more than three edges, i.e., all elements with Z ≥ 52 (Te), which includes in particular the metals used in traditional heavy atom derivatives (Hg, Pt, Au, …). But even in the absence of a second absorption edge, the two-colour approach is likely to be highly useful for systems that are difficult to phase. When collecting SFX data with a two-colour beam that has a large energy separation, very different regions of reciprocal space are in diffraction condition simultaneously. Indexing the reflections belonging to one colour yields the orientation matrix of the unit cell relative to the laboratory system. Future software may then use this matrix as a starting point for the initial indexing of the Bragg reflections of the second colour. Since they provide a different set of diffraction conditions, the matrix can be optimised for the second colour and through iterative refinement using the two sets of reflections, an extremely accurate orientation matrix can be obtained, in particular for the weak high resolution reflections. This is akin to the advantage of the rotation method where the initially determined orientation matrix is refined by minimising the difference in locations of predicted and observed reflections occurring in a different part of reciprocal space observed in later frames. Ideally, a global refinement including both colours should be performed but this is not possible with the currently available software. We expect that such improvements in analysis software together with detector developments increasing in particular the dynamic range will greatly facilitate two-colour data collection and MAD phasing at XFELs.

Given the emergence of and rapidly increasing demand for serial data collection at synchrotron sources^{30, 42,43,44,45,46,47,48,49}, this approach also requires efficient de novo phasing methods. Interestingly, it has been demonstrated previously that a dichromatic beam approach for MAD data collection is feasible at synchrotron sources⁴¹, analogously to the experiment described here. Although the data may not be radiation damage free, it would be easy to achieve enough spatial separation between reflections, maximise the phasing signal by selecting the absorption edge or inflection point. This is feasible because of the low bandwidth at synchrotron sources, which, in addition, do not suffer from fluctuations in the relative intensity of the two colour beams.

In conclusion, we have demonstrated that XFEL-based two-colour phasing is not only feasible but also advantageous. Using a well-characterised model system we show that significantly fewer indexed patterns are required for de novo phasing using two-colour data compared with single-colour data. This should reduce the required amounts of sample and beamtime requirements. We expect two-colour data collection to be particularly useful for difficult-to-phase projects where it may make the crucial difference between being able to solve the structure and not.

Methods

Sample preparation and injection

The two-colour experiment (proposal number 2015B8045) was performed in January 2016 at the Japanese XFEL SACLA in Hyogo. Lysozyme/gadoteridol microcrystals were prepared as described previously¹⁸ except that crystal growth was done at 20 °C, resulting in larger crystals (10 × 10 × 10–15 μm). In brief, 2.5 ml of protein solution (32 mg ml⁻¹ hen egg white lysoyme (Sigma) in 0.1 M sodium acetate buffer pH 3.0) and 7.5 ml precipitate solution (20 % NaCl, 6 % PEG 6,000, 0.1 M sodium acetate pH 3.0) were mixed rapidly and left over night at room temperature on a slowly rotating wheel shaker. After gravity-induced settling, the crystalline pellet was washed several times in crystal storage solution (8% NaCl, 0.1 M sodium acetate buffer, pH 4.0). At least 30 min prior to data collection, 100 mM gadoteridol (Gd³⁺:10-(2-hydroxypropyl)-1,4,7,10-tetraazacyclododecane-1,4,7-triacetic acid)²⁸ was added to the storage solution and the crystals were left to incubate at room temperature.

A total of 7 µl of microcrystalline pellet was mixed with 75 µl grease (Super Lube) and then filled into the reservoir of a High Viscosity Extrusion injector³⁰. The injector was mounted in the DAPHNIS chamber³¹ which was filled with a humid helium atmosphere. Sample was extruded at a flow rate of 0.3 µl min⁻¹. Because of the limited dynamic range of the MPCCD detector, different crystal sizes and thicknesses of aluminium attenuators were tested in order to minimise the number of saturated reflections while keeping as much as possible of the weak high resolution diffraction.

Wavelength determination using inline spectrometers

A single-shot inline spectrometer was used to measure part of the Debye–Scherrer (111) diffraction rings from a diamond powder²⁶ using a MPCCD detector and stored as an image of 1,024 × 512 pixel. For the profile parameter calculation the image was collapsed into a one dimensional image of 1024 pixel; all pixel reads from the same column were summed to give rise to a double Lorentzian beam intensity profile⁵⁰. We implemented the write_spectra.py module to perform non-linear model-fits with automatically estimated starting values for specified runs and to write the fitted parameters into a HDF5 data format file (spectra files). The energy calibration function was obtained from the comparison between the respective readings of the wide range and the narrow range inline spectrometers that were acquired during two reference runs for both photon energies (7 keV or 9 keV) (see Supplementary Note 1). The write_calib_color.py module was implemented to apply the energy calibration function to the fitted parameters obtained with the write_spectra.py module and to add the wavelength to the respective diffraction images.

Peak identification and thresholding

We used CrystFEL version 0.6.2. Peaks were identified by thresholding by CrystFEL’s indexamajig module³⁵. The initial threshold value τ was determined as the median of τ = μ + 4σ, the sum of the mean µ of the pixel intensity reads and its standard deviation σ (obtained from 1,000 diffraction images). Assuming a Cauchy distribution, this corresponds to the 0.92 quantile of the pixel read values in the image. Peaks were identified using a threshold τ of 700 arbitrary detector units (ADU), and default values for the minimal signal to noise ratio (min-snr = 5) and the minimal gradient of (min-gradient = 10,000). Indexing yielded the same unit cell parameters (a = b = 78.3 Å, c = 39.1 Å, α = β = γ = 90°) as determined previously¹⁸. These values were loosely imposed on the subsequent analysis steps. Deviations of the values of unit cell lengths and angles were restricted to 10% and 2%, respectively.

The final parameters were chosen such that over a wide resolution range the diffraction spots could be found by CrystFEL’s indexamajig module³⁵. For this purpose, the peak values and the peak background values of successfully indexed images were inspected. From the distribution of peak values above background a threshold of 200 ADU was selected. From the distribution of ratios between the peak value and the background noise a value of 5 for the signal-to-noise ratio (snr) was determined. Thus, in combination with the optional median filtering (--median-filter) the effects of the background were minimised, as the background is subtracted before thresholding takes place. From the spatial distribution of diffraction peaks (diameter 2 pixels) a mean distance of 15 pixels between two adjacent diffraction spots of a diffraction pattern was estimated. Thus a median filter with window size 16 pixels was chosen. After successful processing of the complete dataset with these values and an error analysis it was decided to increase the integration radii to (6,6,8) to compensate for the errors in diffraction spot position predictions owing to residual errors in the wavelength and detector distance estimates.

To identify the second diffraction pattern in the diffraction image, the peak search parameters were lowered to select a broader set of peaks from the image (threshold 150, min-snr 3 and min-gradient 10,000, median filter 16 pixels). The write_subtract.py module was implemented to remove all spots from the peak list that are closer than 10 pixels to peaks identified as belonging to the first diffraction pattern. The remaining peaks were indexed in the second colour.

Data analysis and phasing

Data analysis was performed on the SACLA High Performance Computing Cluster. For the purpose of visualisation, analysis, iteration and filtering within data processing routines written in Python, a parser for the CrystFEL³⁵ stream file was implemented. The stream2h5.py module scans the gigabyte-sized stream file once and transforms each line into a target data structure from which other routines (e.g., the write_subtract.py module) can extract the required information directly. The parser produces a file in HDF5 data format (which is smaller than the stream file by roughly a factor of two) to make parameters from the CrystFEL³⁵ stream file available in a standardised and time-efficient way.

Phasing was performed with AutoSHARP³⁶ Version 2.8.5, using data to 1.9 Å resolution. The 9 keV (1.38 Å wavelength) data was used either on its own for SAD phasing, or as the peak wavelength for 2-colour MAD phasing, in which case the 7 keV (1.77 Å wavelength) data were used as inflection point data. Initial estimates of f′/f″ for the 9 and 7 keV data were −4.0/11.7 e⁻ and −10.0/3.8 e⁻, respectively. AutoSHARP³⁶ searched for 2 Gd atoms using SHELXD⁵¹, and after phasing and solvent flattening with automated optimisation of the solvent content performed two cycles of autobuilding, the first with BUCCANEER³⁷ and the second with ARP/wARP³⁸, with additional automatic solvent flattening in between. A final model refined against the 5,000 image 9 keV dataset was obtained by iterative rebuilding using COOT⁵² and refinement using REFMAC5⁵³. The final model displayed excellent geometry (RMSD bond lengths 0.007 Å, RMSD angles 1.6°, no Ramachandran outliers) and good R-factors (R/Rfree 0.186/0.214).

Code availability

Our scripts can be downloaded from https://github.com/AlexanderGorel/crystallography under the GNU General Public License v3.0.

Data availability

We have deposited the diffraction data reported in this study (all images collected as well as hits only) for method development in the CXIDB.org data bank with the accession code id-66 (http://cxidb.org/id-66.html). Coordinates and structure factors derived from the 5,000 images lysozyme data have been deposited in the Protein Data Bank (http://www.wwpdb.org) under the accession code 5OER. Other data are available from the corresponding author upon reasonable request.

References

Schlichting, I. Serial femtosecond crystallography: the first five years. IUCrJ 2, 246–255 (2015).
Article CAS PubMed PubMed Central Google Scholar
Neutze, R., Wouts, R., van der Spoel, D., Weckert, E. & Hajdu, J. Potential for biomolecular imaging with femtosecond X-ray pulses. Nature 406, 752–757 (2000).
Article ADS CAS PubMed Google Scholar
Chapman, H. N., Caleman, C. & Timneanu, N. Diffraction before destruction. Phil. Trans. R. Soc. B 369, 20130313 (2014).
Article PubMed PubMed Central Google Scholar
Sawaya, M. R. et al. 2.9 Å-resolution protein crystal structure obtained from injecting bacterial cells into an x-ray free-electron laser beam. Proc. Natl. Acad. Sci. USA 111, 12769–12774 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Colletier, J. P. et al. De novo phasing with X-ray laser reveals mosquito larvicide BinAB structure. Nature 539, 43–47 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Gati, C. et al. Atomic structure of granulin determined from native nanocrystalline granulovirus using an X-ray free-electron laser. Proc. Natl. Acad. Sci. USA 114, 2247–2252 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Kern, J. et al. Simultaneous femtosecond X-ray spectroscopy and diffraction of photosystem II at room temperature. Science 340, 491–495 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Hirata, K. et al. Determination of damage-free crystal structure of an X-ray-sensitive protein using an XFEL. Nat. Methods 11, 734–736 (2014).
Article CAS PubMed Google Scholar
Chreifi, G. et al. Crystal structure of the pristine peroxidase ferryl center and its relevance to proton-coupled electron transfer. Proc. Natl. Acad. Sci. USA 113, 1226–1231 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Suga, M. et al. Native structure of photosystem II at 1.95 A resolution viewed by femtosecond X-ray pulses. Nature 517, 99–103 (2015).
Article ADS CAS PubMed Google Scholar
Fukuda, Y. et al. Redox-coupled structural changes in nitrite reductase revealed by serial femtosecond and microfocus crystallography. J. Biochem. 159, 527–538 (2016).
Article CAS PubMed PubMed Central Google Scholar
Barends, T. R. et al. Direct observation of ultrafast collective motions in CO myoglobin upon ligand dissociation. Science 350, 445–450 (2015).
Article ADS CAS PubMed Google Scholar
Pande, K. et al. Femtosecond structural dynamics drives the trans/cis isomerization in photoactive yellow protein. Science 352, 725–729 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Nango, E. et al. A three-dimensional movie of structural changes in bacteriorhodopsin. Science 354, 1552–1557 (2016).
Article ADS CAS PubMed Google Scholar
Stagno, J. R. et al. Structures of riboswitch RNA reaction states by mix-and-inject XFEL serial crystallography. Nature 541, 242–246 (2017).
Article ADS CAS PubMed Google Scholar
Suga, M. et al. Light-induced structural changes and the site of O=O bond formation in PSII caught by XFEL. Nature 543, 131–135 (2017).
Article ADS CAS PubMed Google Scholar
Young, I. D. et al. Structure of photosystem II and substrate binding at room temperature. Nature 540, 453–457 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Barends, T. R. et al. De novo protein crystal structure determination from X-ray free-electron laser data. Nature 505, 244–247 (2014).
Article ADS CAS PubMed Google Scholar
Nakane, T. et al. Native sulfur/chlorine SAD phasing for serial femtosecond crystallography. Acta. Crystallogr. D Biol. Crystallogr. 71, 2519–2525 (2015).
Article CAS PubMed PubMed Central Google Scholar
Nass, K. et al. Protein structure determination by single-wavelength anomalous diffraction phasing of X-ray free-electron laser data. IUCrJ 3, 180–191 (2016).
Article CAS PubMed PubMed Central Google Scholar
Batyuk, A. et al. Native phasing of x-ray free-electron laser data for a G protein-coupled receptor. Sci. Adv. 2, e1600292 (2016).
Article ADS PubMed PubMed Central Google Scholar
Hunter, M. S. et al. Selenium single-wavelength anomalous diffraction de novo phasing using an X-ray-free electron laser. Nat. Commun. 7, 13388 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Nakane, T. et al. Membrane protein structure determination by SAD, SIR, or SIRAS phasing in serial femtosecond crystallography using an iododetergent. Proc. Natl. Acad. Sci. USA 113, 13039–13044 (2016).
Article CAS PubMed PubMed Central Google Scholar
Yamashita, K. et al. An isomorphous replacement method for efficient de novo phasing for serial femtosecond crystallography. Sci. Rep. 5, 14017 (2015).
Article ADS PubMed PubMed Central Google Scholar
Boutet, S. et al. Characterization and use of the spent beam for serial operation of LCLS. J. Synchrotron. Radiat. 22, 634–643 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hara, T. et al. Two-colour hard X-ray free-electron laser with wide tunability. Nat. Commun. 4, 2919 (2013).
Article PubMed CAS Google Scholar
Inoue, I. et al. Observation of femtosecond X-ray interactions with matter using an X-ray-X-ray pump-probe scheme. Proc. Natl. Acad. Sci. USA 113, 1492–1497 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Girard, E., Chantalat, L., Vicat, J. & Kahn, R. Gd-HPDO3A, a complex to obtain high-phasing-power heavy-atom derivatives for SAD and MAD experiments: results with tetragonal hen egg-white lysozyme. Acta. Crystallogr. D Biol. Crystallogr. 58, 1–9 (2002).
Article PubMed CAS Google Scholar
Sugahara, M. et al. Grease matrix as a versatile carrier of proteins for serial crystallography. Nat. Methods 12, 61–63 (2015).
Article CAS PubMed Google Scholar
Botha, S. et al. Room-temperature serial crystallography at synchrotron X-ray sources using slowly flowing free-standing high-viscosity microstreams. Acta Crystallogr. D Biol. Crystallogr. 71, 387–397 (2015).
Article CAS PubMed Google Scholar
Tono, K. et al. Diverse application platform for hard X-ray diffraction in SACLA (DAPHNIS): application to serial protein crystallography using an X-ray free-electron laser. J. Synchrotron. Radiat. 22, 532–537 (2015).
Article CAS PubMed PubMed Central Google Scholar
Kameshima, T. et al. Development of an X-ray pixel detector with multi-port charge-coupled device for X-ray free-electron laser experiments. Rev. Sci. Instrum. 85, 033110 (2014).
Article ADS PubMed CAS Google Scholar
Foucar, L. et al. CASS - CFEL-ASG software suite. Comput. Phys. Commun. 183, 2207–2213 (2012).
Article ADS CAS Google Scholar
Nakane, T. et al. Data processing pipeline for serial femtosecond crystallography at SACLA. J. Appl. Crystallogr. 49, 1035–1041 (2016).
Article CAS PubMed PubMed Central Google Scholar
White, T. A. et al. CrystFEL: a software suite for snapshot serial crystallography. J. Appl. Cryst. 45, 335–341 (2012).
Article CAS Google Scholar
Vonrhein, C., Blanc, E., Roversi, P. & Bricogne, G. Automated structure solution with autoSHARP. Methods Mol. Biol. 364, 215–230 (2007).
CAS PubMed Google Scholar
Cowtan, K. The Buccaneer software for automated model building. 1. Tracing protein chains. Acta. Crystallogr. D Biol. Crystallogr. 62, 1002–1011 (2006).
Article PubMed CAS Google Scholar
Langer, G., Cohen, S. X., Lamzin, V. S. & Perrakis, A. Automated macromolecular model building for X-ray crystallography using ARP/wARP version 7. Nat. Protoc. 3, 1171–1179 (2008).
Article CAS PubMed PubMed Central Google Scholar
Marinelli, A. et al. High-intensity double-pulse X-ray free-electron laser. Nat. Commun. 6, 6369 (2015).
Article CAS PubMed PubMed Central Google Scholar
Gonzalez, A. A comparison of SAD and two-wavelength MAD phasing for radiation-damaged Se-MET crystals. J. Synchrotron. Radiat. 14, 43–50 (2007).
Article CAS PubMed Google Scholar
Kumasaka, T., Yamamoto, M., Yamashita, E., Moriyama, H. & Ueki, T. Trichromatic concept optimizes MAD experiments in synchrotron X-ray crystallography. Structure 10, 1205–1210 (2002).
Article CAS PubMed Google Scholar
Martin-Garcia, J. M. et al. Serial millisecond crystallography of membrane and soluble protein microcrystals using synchrotron radiation. IUCrJ 4, 439–454 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hasegawa, K. et al. Development of a dose-limiting data collection strategy for serial synchrotron rotation crystallography. J. Synchrotron. Radiat. 24, 29–41 (2017).
Article CAS PubMed PubMed Central Google Scholar
Owen, R. L. et al. Low-dose fixed-target serial synchrotron crystallography. Acta. Crystallogr. D Biol. Crystallogr. 73, 373–378 (2017).
Article CAS Google Scholar
Kovácsová, G. et al. Viscous hydrophilic injection matrices for serial crystallography. IUCrJ 4, 400–410 (2017).
Article PubMed PubMed Central Google Scholar
Nogly, P. et al. Lipidic cubic phase serial millisecond crystallography using synchrotron radiation. IUCrJ 2, 168–176 (2015).
Article CAS PubMed PubMed Central Google Scholar
Gati, C. et al. Serial crystallography on in vivo grown microcrystals using synchrotron radiation. IUCrJ 1, 87–94 (2014).
Article CAS PubMed PubMed Central Google Scholar
Stellato, F. et al. Room-temperature macromolecular serial crystallography using synchrotron radiation. IUCrJ 1, 204–212 (2014).
Article CAS PubMed PubMed Central Google Scholar
Roedig, P. et al. High-speed fixed-target serial virus crystallography. Nat. Meth. 14, 805–810 (2017).
Article CAS Google Scholar
Tamasaku, K. et al. Inline spectrometer for shot-by-shot determination of pulse energies of a two-color X-ray free-electron laser. J. Synchrotron. Radiat. 23, 331–333 (2016).
Article CAS PubMed PubMed Central Google Scholar
Schneider, T. R. & Sheldrick, G. M. Substructure solution with SHELXD. Acta. Crystallogr. D Biol. Crystallogr. 58, 1772–1779 (2002).
Article PubMed CAS Google Scholar
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta. Crystallogr. D Biol. Crystallogr. 60, 2126–2132 (2004).
Article PubMed CAS Google Scholar
Murshudov, G. N. et al. REFMAC5 for the refinement of macromolecular crystal structures. Acta . Crystallogr. D Biol. Crystallogr. 67, 355–367 (2011).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by the X-ray Free-Electron Laser Priority Strategy Program (Ministry of Education, Culture, Sports, Science and Technology of Japan) and partially by the Strategic Basic Research Program (JST) and RIKEN Pioneering Project Dynamic Structural Biology. We acknowledge computational support from the SACLA High Performance Computing system. The research was supported by the Max Planck Society and Dynamic Alliance for Open Innovation Bridging Human, Environment and Materials and TAGEN project of Tohoku University. We thank Dr. Roland van Gessel, Bracco Imaging Deutschland, Konstanz, Germany, for the very generous gift of the sample of gadoteridol.

Author information

Authors and Affiliations

Max-Planck-Institut für medizinische Forschung, Jahnstrasse 29, Heidelberg, 69120, Germany
Alexander Gorel, R. Bruce Doak, Marie Luise Grünbein, Mario Hilpert, Marco Kloos, Gabriela Kovácsová, Karol Nass, Christopher M. Roome, Robert L. Shoeman, Lutz Foucar, Thomas R. M. Barends & Ilme Schlichting
Institute of Multidisciplinary Research for Advanced Materials, Tohoku University, Sendai, 980-8577, Japan
Koji Motomura, Hironobu Fukuzawa & Kiyoshi Ueda
RIKEN SPring-8 Center, Kouto 1-1-1, Sayo, Hyogo, 679-5148, Japan
Koji Motomura, Hironobu Fukuzawa, Ichiro Inoue, Eriko Nango, Rie Tanaka, Makina Yabashi, So Iwata & Kiyoshi Ueda
Department of Cell Biology, Graduate School of Medicine, Kyoto University, Yoshidakonoe-cho, Sakyo-ku, Kyoto, 606-8501, Japan
Eriko Nango & So Iwata
Japan Synchrotron Radiation Research Institute, 1-1-1 Kouto, Sayo-cho, Sayo-gun, Hyogo, 679-5198, Japan
Kensuke Tono & Yasumasa Joti

Authors

Alexander Gorel
View author publications
You can also search for this author in PubMed Google Scholar
Koji Motomura
View author publications
You can also search for this author in PubMed Google Scholar
Hironobu Fukuzawa
View author publications
You can also search for this author in PubMed Google Scholar
R. Bruce Doak
View author publications
You can also search for this author in PubMed Google Scholar
Marie Luise Grünbein
View author publications
You can also search for this author in PubMed Google Scholar
Mario Hilpert
View author publications
You can also search for this author in PubMed Google Scholar
Ichiro Inoue
View author publications
You can also search for this author in PubMed Google Scholar
Marco Kloos
View author publications
You can also search for this author in PubMed Google Scholar
Gabriela Kovácsová
View author publications
You can also search for this author in PubMed Google Scholar
Eriko Nango
View author publications
You can also search for this author in PubMed Google Scholar
Karol Nass
View author publications
You can also search for this author in PubMed Google Scholar
Christopher M. Roome
View author publications
You can also search for this author in PubMed Google Scholar
Robert L. Shoeman
View author publications
You can also search for this author in PubMed Google Scholar
Rie Tanaka
View author publications
You can also search for this author in PubMed Google Scholar
Kensuke Tono
View author publications
You can also search for this author in PubMed Google Scholar
Yasumasa Joti
View author publications
You can also search for this author in PubMed Google Scholar
Makina Yabashi
View author publications
You can also search for this author in PubMed Google Scholar
So Iwata
View author publications
You can also search for this author in PubMed Google Scholar
Lutz Foucar
View author publications
You can also search for this author in PubMed Google Scholar
Kiyoshi Ueda
View author publications
You can also search for this author in PubMed Google Scholar
Thomas R. M. Barends
View author publications
You can also search for this author in PubMed Google Scholar
Ilme Schlichting
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

I.S., M.K., G.K. prepared and characterised samples, R.B.D., R.L.S., G.K., M.L.G., M.K. designed and operated sample injection hardware, M.Y., Y.J, S.I. were involved in preparations for the experiment, R.B.D., R.L.S., G.K., M.L.G., M.K., I.S., M.H., C.M.R., K.N., T.R.M.B., K.M., H.F., K.U., I.I., K.T., E.N., R.T. performed the experiment, C.M.R., M.H., K.N., T.R.M.B and L.F. performed online processing, A.G. performed off-line processing, T.R.M.B., A.G. phased the data, L.F., T.R.M.B., I.S. jointly supervised the work, T.R.M.B. coordinated the beamtime at SACLA, I.S. designed and coordinated the project, A.G., T.R.M.B. and I.S. wrote the manuscript with input from all the authors.

Corresponding author

Correspondence to Ilme Schlichting.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

41467_2017_754_MOESM1_ESM.pdf

Supplementary Information Supplementary figures, supplementary table, supplementary notes, supplementary methods and supplementary references

Peer Review file

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gorel, A., Motomura, K., Fukuzawa, H. et al. Multi-wavelength anomalous diffraction de novo phasing using a two-colour X-ray free-electron laser with wide tunability. Nat Commun 8, 1170 (2017). https://doi.org/10.1038/s41467-017-00754-7

Download citation

Received: 26 May 2017
Accepted: 25 July 2017
Published: 27 October 2017
DOI: https://doi.org/10.1038/s41467-017-00754-7

This article is cited by

Generation of time-synchronized two-color X-ray free-electron laser pulses using phase shifters
- Myung-Hoon Cho
- Teyoun Kang
- Chi Hyun Shim
Scientific Reports (2023)
Viscosity-adjustable grease matrices for serial nanocrystallography
- Michihiro Sugahara
- Koji Motomura
- Tetsuya Ishikawa
Scientific Reports (2020)
Illumination guidelines for ultrafast pump–probe experiments by serial femtosecond crystallography
- Marie Luise Grünbein
- Miriam Stricker
- Ilme Schlichting
Nature Methods (2020)
Megahertz data collection from protein microcrystals at an X-ray free-electron laser
- Marie Luise Grünbein
- Johan Bielecki
- Ilme Schlichting
Nature Communications (2018)
Two-colour serial femtosecond crystallography dataset from gadoteridol-derivatized lysozyme for MAD phasing
- Alexander Gorel
- Koji Motomura
- Ilme Schlichting
Scientific Data (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Experimental set-up and parameter determination

Efficient two-colour data processing

Phasing

Discussion

Methods

Sample preparation and injection

Wavelength determination using inline spectrometers

Peak identification and thresholding

Data analysis and phasing

Code availability

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links