Introduction

X-ray crystallography has been the workhorse method for determining the high-resolution structures of proteins and protein complexes. However, conventional protein crystallography requires large (1000s of μm3), well-ordered protein crystals and often requires measurements at cryogenic temperatures to mitigate radiation damage. The advent of x-ray free-electron lasers (XFELs), such as the Linac Coherent Light Source (LCLS)1,2 overcomes the need for large crystals and cryogenic temperatures. XFELs produce ultra-short (~10–50 fs) and ultra-bright x-ray pulses, enabling the collection of high-resolution diffraction patterns from protein crystals before the onset of conventional radiation damage (“diffraction-before-destruction”). Diffraction-before-destruction has been demonstrated with microcrystals that are too small to be considered for structure determination with conventional crystallography3,4. Each microcrystal exposed to an XFEL pulse is destroyed, necessitating a serial data acquisition approach in which a data set is collected from thousands of randomly oriented crystals, each exposed only once to the x-ray beam. First demonstrated using micrometer-sized crystals of the membrane protein photosystem I (PSI)3, serial femtosecond x-ray crystallography (SFX) has been extended to demonstrate high resolution structure determination5, has been used to observe novel high-resolution structural information6 and has been applied to solving the structure of a G-Protein Coupled Receptor (GPCR) embedded in lipidic cubic phase (LCP)7. For a recent review the use of x-ray lasers for structural and dynamic biology see8.

One of the drawbacks of the serial crystallography approach to date has been sample consumption, limiting the technique to samples that can be produced in large quantity. SFX experiments to date have used various forms of liquid jet injection and the most often used injector is a gas dynamic virtual nozzle9,10,11. The flow rate needed to maintain the thin jet results in high linear velocity of liquid. As a result, only a very small fraction of the injected protein crystal suspension is probed by x-rays. An approach to SFX that reduces sample consumption is highly desirable and may be essential for proteins that cannot be successfully over-expressed in large quantities. Slow flowing viscous jets extruded using electrospinning12, direct injection of Lipidic Cubic Phase buffer7,13 and pulsed injection provide alternatives in some cases. Yet another approach is to deposit sample on a thin membrane that is then scanned through the x-ray beam. Such a fixed-target approach allows for precise control of where the sample is measured with the x-rays resulting in a much more conservative use of the sample. The main limitation to date for fixed target approaches has been the relatively low data acquisition rate achieved so far: To date, SFX data sets require ~104 good diffraction patterns for reconstruction, each collected from randomly oriented crystals14,15. Consequently, the data collection rate is a critical parameter.

Here, we demonstrate the first proof-of-principle results from fixed-target serial crystallography using the LCLS. Using microcrystals of REP24 (Rapid encystment protein, 24 kDa; pdb16 code 4P5P), a putative virulence factor protein from the intracellular pathogen Francisella tularensis, we demonstrate the ability to collect data at repetition rates that allow, in principle, for collection of complete data on the time scale of about one 12-hour LCLS shift.

Methods

Production of protein crystals

In order to produce microcrystals of REP24, several sparse-matrix crystallization screens were generated and used to make vapor-diffusion crystallization experiments using the sitting-drop technique. The sitting-drop experiments were carried out by mixing 400-nL aliquots of a 14.4 mg/mL REP24 solution (in 50 mM NaCl and 10 mM HEPES pH 7.5) with 400-nL aliquots of the precipitant solution at room temperature. The results of several screens were examined and optimization screens for sitting-drop crystallization experiments were generated around a condition that produced well-formed macrocrystals of REP24-containing 24% polyethyleneglycol-monomethylether with average molecular weight of 750 Da (PEG-MME 750) and 100 mM Na-Acetate, pH 4.5.

The results of the optimization screen were used to generate batch crystallization experiments for the production of the REP24 crystals in bulk. REP24 crystals were grown by mixing a 14.4 mg/mL REP24 sample (in 50 mM NaCl and 10 mM HEPES pH 7.5) with a precipitation solution containing 54% (v/v) PEG-MME 750 and 100 mM Na-acetate pH 4.5 in a 1:1 ratio for final conditions of 7.2 mg/mL REP24, 27% (v/v) PEG-MME 750, 50 mM Na-acetate pH 4.5, 25 mM NaCl and 5 mM HEPES pH 7.5). The crystals of REP24 used in the experiment were between 10 μm and 12 μm in length and had the appearance of two, square-based pyramids connected at the peak, with maximum thickness of 5 μm. In order to produce the REP24 microcrystals for the experiment, 190 μL of the 14.4 mg/mL REP24 solution was used for a total protein mass of 2.74 mg. Approximately 1/5 of the protein could be used to cover an entire chip, for a total protein mass of 540 μg of protein per chip.

Oil-emulsion preservation of protein crystals

Although moving protein macrocrystals from an aqueous environment to Paratone-N is well established, embedding a shower of microcrystals in Paratone-N and removing the aqueous phase required a novel procedure. The REP24 crystal suspension in the mother liquor was filtered through a 20 μm inline filter. The pre-filtered REP24 crystal suspension was centrifuged at ~14,500 g for one minute to generate a pellet. The supernatant was removed and the sample was again centrifuged at 14,500 g for one minute and the remaining supernatant was removed. An aliquot of 30 mg of Paratone-N was added to the pellet (to achieve approximately 50% hit rate) and a small pipette tip followed by a microspear were used to thoroughly mix the sample and suspend the crystals at a crystal density ~1 × 109 crystals/mL.

Vigorous mixing with the microspear allowed for the protein crystals to be separated from the remaining bulk aqueous phase and into the Paratone-N, although crystals often remained surrounded by a small volume of aqueous mother liquor. The REP24 crystal emulsion was allowed to stand at room temperature for several days without a discernable decrease in birefringence of the REP24 microcrystals—further evidence that the crystals had not dehydrated.

Fixed target design and application of the sample to the fixed targets

Fixed-target supports allowed for high data-collection rates using the setup at the Coherent X-ray Imaging (CXI) beam line17 of the Linac Coherent Light Source (LCLS) at SLAC National Accelerator Laboratory. The fixed-target supports were silicon wafers produced by Silson Ltd (Silson Ltd, JBJ Business Park, Northampton Road, Blisworth, Northampton, NN7 3DW, England) and an individual wafer is shown in Fig. S1. The wafers were comprised of 200 μm thick silicon crystals with a 50 nm Si3N4 layer deposited on the (100) face. Long, rectangular windows measuring 200 μm × 8400 μm were etched into the silicon (leaving a 50 nm Si3N4 membrane) to allow for scanning of the sample through the LCLS x-ray beam through the use of the fixed-target stage motors. An overview of the experimental setup is shown in Fig. 1.

Figure 1
figure 1

Experimental overview.

Protein crystals embedded in Paratone-N were placed on fixed-targets and measured in vacuo at the Coherent X-ray Imaging (CXI) beam line of the LCLS.

The sample was applied to the fixed-target wafers by covering the tip of a crystal-manipulation spear with a small drop of the crystal-containing emulsion. The drop was lightly touched to the windows on the Si3N4 side of the chip and gently “painted” across the long edge of the window, leaving a thick streak of the sample. A 1 mm diameter crystal-mounting loop was used to gently spread out the streak of sample between three rows of windows typically resulting in a sample thickness around 20 μm. Once placed in the vacuum, the REP24 crystals embedded in Paratone-N were measured within 1-6 hours.

Experimental setup

REP24 samples were measured using the CXI instrument17 at LCLS using the (1.3 μm)2 focus sample environment. Fig. 1 shows a schematic overview of the experiment. The experiments were carried out using 8 keV x-rays (corresponding to a wavelength of 1.55Å) with an x-ray flux of ~2 × 1012 photons/pulse before accounting for losses due to upstream optics (~50% loss) both at CXI and in the upstream shared part of the LCLS beam line; however, the x rays were attenuated such that transmission was between 0.1% and 10% during these experiments. Accounting for losses due to the upstream optics, the flux on the sample was between 1 × 109–1 × 1011 photons/pulse for 0.1% and 10% transmission, respectively. The pulse durations used for the experiments were estimated to be ~30 fs based on recent measurements using the XTCAV diagnostics18.

Results

High data acquisition rates were achieved by scanning the target at speeds of 500 μm/s and 1000 μm/s using LCLS repetition rates of 5 Hz and 10 Hz, respectively. During each run, diffraction image data were recorded for each x-ray shot using the Cornell-SLAC Pixel Array Detector (CSPAD)19. Approximtely 1/3 of the collected data contained measureable diffraction patterns. Data were analysed using Cheetah20 and CrystFEL21 software suites.

Diffraction patterns exhibited Bragg peaks to ~2.5 Å resolution (Fig. 2). Diffraction patterns were indexed with unit cell constants of a = b = 44.44 Å, c = 183.45 Å, α = β = 90° and γ = 120°, corresponding to a unit-cell volume of 313.8 nm3 and consistent with the unit cell constants of the macrocrystals of REP24 determined at a synchrotron pdb16 code 4P5P. The time between depositing the Paratone-N embedded crystals on the wafer and placing them into the vacuum of the CXI sample chamber to the time samples were measured by x-rays varied between 1–6 hours. There was no apparent degradation of diffraction quality over this time period. X-ray diffraction is an extremely sensitive measure of crystallinity and any degradation of crystal quality would rapidly lead to decreased resolution and spot sharpness, which supports that the crystals were protected from dehydration by the Paratone-N.

Figure 2
figure 2

Typical single diffraction pattern from a REP24 crystal preserved in Paratone-N using the CXI instrument of the LCLS.

The diffraction pattern contains Bragg peaks to 2.5 Å resolution (2Å resolution is indicated by the white circle) and was indexed with unit cell parameters that were consistent with the macrocrystal unit cell of REP24.

Six fast-scan data sets were collected from 10 μm crystals at high concentration (Table 1) with each run corresponding to one 200 μm × 8400 μm window. A total of 610 shots were taken, of which 233 were identified as having Bragg peaks from protein crystals, indicating a hit rate of 38.2%. The mean hit rate was calculated to be 38.2 (+/− 6.7)% for all of the runs. The total time taken to collect images from the 575 shots was 595 s (shot rate of ~1/s), however, the final three runs were completed in 99 s (for 90 shots each run) approaching an average shot (and data acquisition) rate of 3/s and a hit rate of ~1/s.

Table 1 The breakdown of the six runs for the fixed-target serial crystallography experiments using REP24. The three final runs were performed utilizing the 10-Hz scanning method for the data collection. The total time of 590 seconds was the amount of time it took for all six runs of data collection, including some movement and alignment of the windows before starting the fast scanning and the repositioning from one end of a long membrane to the beginning of the next long membrane. Therefore, the average shot and data acquisition rate during the six runs was ~1/s. It was 3/s for the last three runs

Discussion

Two of the major challenges for FT-SFX at LCLS are the need to protect the protein crystals against dehydration as well as the current need to acquire ~104 diffraction patterns for a complete data set suitable for structure solution; previous fixed target experiments achieved data acquisition rates of 0.1 Hz or worse which is insufficient for FT-SFX.

Vacuum protection of protein microcrystals for FT-SFX experiments

Protecting protein crystals from dehydration in vacuum can be accomplished in one of two ways: 1) cryogenically cool the protein crystals, or 2) encapsulate the hydrated environment from the vacuum environment with a material that has minimal water permeability. We focused on encapsulating a hydrated environment so that room temperature data could be acquired, preserving the possibility of time resolved FT-SFX. To encapsulate a hydration environment, we embedded microcrystals in Paratone-N, an oil-based mixture with limited water permeability. Paratone-N is used in conventional protein crystallography as a cryo-protectant and was previously used for conventional structure determination of REP24 (pdb16 code 4P5P). REP24 macrocrystals embedded in Paratone-N for several days at room temperature remained visibly intact and maintained birefringence, indicating that there are no gross adverse effects of Paratone-N on REP24 crystals. Therefore, REP24 microcrystals embedded in Paratone-N were chosen as the first test sample for the FT-SFX experiments.

During initial development of the Paratone embedding procedure it was observed that at higher crystal densities, the protein crystals would cluster together and more thorough mixing was needed to increase the monodispersity of the crystal suspension. A potential solution for the clustering of the crystals would be to separate the crystals from the bulk aqueous phase in smaller batches, in a serial approach.

Data acquisition and hit rates for FT-SFX

Data were collected at a peak rate of 10 Hz and an average rate of 1/s, accounting for the time required to align the individual long membranes on the fixed target to the beam. The overall hit rate for diffraction patterns with Bragg peaks was 38%. This corresponds to ~0.3 hits/s at an average data acquisition rate of 1/s (average for all the runs discussed here) and ~1 hit/s at an average data acquisition rate of 3/s (average for the last three runs here). Assuming the “dead” time for aligning individual membranes can be significantly reduced by using optimized automated scripts for moving the sample stages and assuming LCLS operating at a shot rate of 10 Hz a hit rate of ~3 hits/s may be achievable and approach the ~5 hits/s rate previously demonstrated with lysozyme microcrystals using the liquid jet injector5. Such rates are significantly higher than typical data acquisition rates using fixed targets in other previous experiments at LCLS.

The peak data acquisition rate for FT-SFX will ultimately be determined by the linear velocity of the fixed target stage and the repetition rate of the LCLS due to a (putative) required minimum spacing between subsequent shots that will depend on the extent of the damage from the previous LCLS shot. The minimum useful spacing is dependent on the propagation of damage, but the extent to which damage caused by one LCLS pulse propagates through the sample to adjacent microcrystals is currently poorly characterized for FT-SFX, but the damage propagation will likely depend on the properties of the sample and embedding material. A minimum spacing of 100 μm was chosen for the experiments reported here to minimize effects of damage propagation, however, experiments at SACLA have indicated that damage propagation is contained with a 50 μm region from the sample22. For a given spacing between shots, the peak data acquisition rate will depend on the linear velocity of the motors. Using motors with higher linear velocities of several mm/s, FT-SFX data acquisition rates of 30 Hz or more may be possible.

For both FT-SFX and liquid jet (LJ)-SFX, crystal density is a key determinant of the achievable hit rate. For the liquid jet, the demonstrated hit rates for most samples have been ~1–10%, far below the ideal hit rate of 63% for Poisson statistics (~63% total hit rate, ~37% single crystal hits) that maximizes the single crystal hit rate. The hit rates achievable with the liquid jet are, in practice, often limited by clogging of the nozzle and/or a departure from ideal fluid behavior for suspensions with higher crystal densities. Fixed target approaches should allow for use of samples with higher crystal density23 and therefore higher hit rates than allowed with the liquid jet because there is no requirement to maintain a stable jet. There are probably other practical limits on crystal density in FT-SFX because, for example, sample heterogeneity, or crystal clustering, may limit the achievable crystal density. Although a high crystal density leads to a higher hit rate, the concomitant clustering of the crystals could lead to an increase in patterns measured with multiple hits and a divergence from Poisson statistics for the single crystal hit rate. Multi-crystal hits may also prove useful as CrystFEL21 and other software packages such as cctbx.xfel24 may be able to deconvolve and auto-index diffraction patterns from multiple lattices, provided the unit cell is known.

Time required for complete data sets in SFX

Our preliminary results showing vacuum protection of protein microcrystals and higher data acquisition rates with FT-SFX compared to previous fixed target work suggest FT-SFX is a potentially viable complementary method to the established LJ-SFX. To this end, it is useful to review the amount of sample and time required to produce complete data sets in recent LJ-SFX experiments compared to what will be achievable with FT-SFX.

The initial LJ-SFX experiments using Photosystem I13 involved 1.85 × 106 shots resulting in 8.5 Å electron density maps reconstructed from 16374 patterns, although later analysis suggested as few as 10,000 patterns may be sufficient to achieve ~100% completeness15. Subsequent LJ-SFX experiments with lysozyme involved 1.5 × 106 shots, giving 66,432 patterns identified as “hits” of which 12,247 indexed patterns were used to calculate the electron-density map at 1.9 Å resolution5. REP24 crystals have a high-symmetry space group (P2231) and therefore a similar number of indexable patterns to that required for lysozyme should be sufficient to calculate an electron-density map of similar quality. Assuming a 10% indexing success rate and a 37% single hit rate, we can estimate that up to 340,000 shots would be needed for a complete data set (this may be an overestimate – only 140,000 shots would be needed if we assume a similar indexing success rate of ~25% as reported for lysozyme ref. 5). At 1 Hz average data collection rate, this would require 95 hours of data collection and 125 fixed target wafers similar to the ones used here (Supplementary Fig. 1). To collect data from 1.5 × 106 shots5 at 120 Hz with the liquid jet would require a minimum of 3.5 hours of continuous operation. However, if the average data collection rate for the fixed-target experiments can be increased to 10/s (as compared to the 1–3/s data collection rate demonstrated in the present work), the total collection time can be reduced to less than 10 hours, which is less than one shift of LCLS beam time.

Sample consumption for SFX

Although LJ-SFX currently has a clear advantage over FT-SFX with regard to data acquisition rate, FT-SFX has the potential to drastically reduce sample consumption compared to the current LJ-SFX method. We calculate that FT-SFX could save several orders of magnitude in sample consumption, compared to the LJ-SFX (details provided in supplemental information). The LJ-SFX method with the current liquid jet injector requires a minimum of several tens of milligrams of protein to collect a complete data set3,5. For an optimized FT-SFX experiment using 700 nm protein crystals with the 1.3 μm LCLS focus, we calculate that we would need ~230 μg of protein to collect a complete data set, which would reduce the sample demand by at least two orders of magnitude and will be useful for low-abundance proteins, or proteins that are expensive to produce in milligram quantities, such as recombinantly expressed membrane proteins. (It should be noted, that in the cases where the above mentioned electrospinning jet or LCP jet can be used this sample requirement can be pushed also to below 1 mg).

Areas of improvement for FT-SFX

A persistent challenge for FT-SFX will be protecting protein crystals from the vacuum of XFEL sample chambers. Paratone-N embedding was shown to work for crystals of REP24 (and a few other soluble proteins, such as lysozyme), but is not a generally applicable vacuum protectant; membrane protein crystals dissolve in Paratone-N, for example. Mounting crystals in sealed macro hydration cells, in which the protein crystals remain in their mother liquor, could prove a more generalizable method for FT-SFX. Such a hydration cell configuration would need to be compatible with high data acquisition rates, however. Future work with FT-SFX will clearly require further development of methods to encapsulate hydration environments for microcrystals. XFEL sample chambers may also be equipped with cryo-stages, but that would detract from two of the clear utilities of XFELs for structural biology, namely ultrafast time resolved studies and the study of structures at room temperature.

Another potential issue may be preferential alignment of crystals (e.g., long, needle-like crystals preferentially laying flat on the substrate surface) that might not allow a full sampling of reciprocal space for a substrate surface perpendicular to the incident X-rays, which would impact structure determination. However, this issue, if encountered, can be easily mitigated by the use of a goniometer stage to rotate the normal vector of the fixed target support relative to the incident X-rays. Such a goniometer stage is already available at LCLS and has been used by the authors in other experiments.

It is clear, that for FT-SFX to be viable, an increase in the peak and average shot rates is required. The most straightforward way of achieving the required spacing between shots at high shot rates will be to have motors with faster translational motion for the fixed target stage, which will allow for higher repetition rates while maintaining the necessary minimum separation between shots and optimized automated scripts that move the fixed target from the end of one long membrane to the beginning of the next long membrane with minimal dead time for data acquisition. Another method to increase the average data acquisition rate would be to use windowless sample supports. Windowless supports would increase the number of LCLS shots per given area of the sample holder (the shot density) because there are no supports to impede the transmission of the x rays and windowless supports would make overall alignment time negligible.

Outlook

Both liquid jet injection and the fixed target approach have their advantages and disadvantages and thus are complementary techniques. LJ-SFX techniques offer the advantage of fully hydrated samples (microcrystals can be injected in the mother liquor) and are compatible with future XFEL sources that will have much higher repetition rate (e.g. the EXFEL with ~30 kHz repetition rate25). In contrast, in FT-SFX, microcrystal samples will need to be protected from dehydration (e.g. by embedding in oil or enclosing in hydration chambers) and the data acquisition rate may always be somewhat limited in practice (probably to a few tens per second) by motor speeds, minimum spacing between shots and required mechanical precision. On the other hand, at present, FT-SFX seems to offer a straightforward path to reducing sample consumption for the lower repetition rate XFEL sources, such as LCLS. Moreover, FT-SFX may offer advantages for certain types of time-resolved studies compared to the liquid jet, in particular, optical pump/x-ray probe experiments that involve relatively long pump-probe delay times and/or multiple optical excitation pulses as well as other types of pump-probe experiments that involve e.g., electric field pulses or chemical reactions triggered by liquid mixing by microfluidics.