Extracting time series matching a small-angle X-ray scattering profile from trajectories of molecular dynamics simulations

Shimizu, Masahiro; Okuda, Aya; Morishima, Ken; Inoue, Rintaro; Sato, Nobuhiro; Yunoki, Yasuhiro; Urade, Reiko; Sugiyama, Masaaki

doi:10.1038/s41598-022-13982-9

Download PDF

Article
Open access
Published: 15 June 2022

Extracting time series matching a small-angle X-ray scattering profile from trajectories of molecular dynamics simulations

Masahiro Shimizu¹,
Aya Okuda¹,
Ken Morishima¹,
Rintaro Inoue¹,
Nobuhiro Sato¹,
Yasuhiro Yunoki¹,
Reiko Urade¹ &
…
Masaaki Sugiyama¹

Scientific Reports volume 12, Article number: 9970 (2022) Cite this article

1641 Accesses
2 Citations
Metrics details

Subjects

Abstract

Solving structural ensembles of flexible biomolecules is a challenging research area. Here, we propose a method to obtain possible structural ensembles of a biomolecule based on small-angle X-ray scattering (SAXS) and molecular dynamics simulations. Our idea is to clip a time series that matches a SAXS profile from a simulation trajectory. To examine its practicability, we applied our idea to a multi-domain protein ER-60 and successfully extracted time series longer than 1 micro second from trajectories of coarse-grained molecular dynamics simulations. In the extracted time series, the domain conformation was distributed continuously and smoothly in a conformational space. Preferred domain conformations were also observed. Diversity among scattering curves calculated from each ER-60 structure was interpreted to reflect an open-close motion of the protein. Although our approach did not provide a unique solution for the structural ensemble of the biomolecule, each extracted time series can be an element of the real behavior of ER-60. Considering its low computational cost, our approach will play a key role to identify biomolecular dynamics by integrating SAXS, simulations, and other experiments.

Robust total X-ray scattering workflow to study correlated motion of proteins in crystals

Article Open access 03 March 2023

Energy landscapes from cryo-EM snapshots: a benchmarking study

Article Open access 25 January 2023

Retrieving functional pathways of biomolecules from single-particle snapshots

Article Open access 18 September 2020

Introduction

Many proteins are composed of multiple domains that realize complex and sophisticated functions by rearranging their domains flexibly. For example, each domain works together to introduce DNA supercoiling in DNA topoisomerases and each domain cooperates to refold substrate proteins in some foldases^1,2,3,4. However, revealing their dynamic processes in solution remains challenging.

Small-angle X-ray scattering (SAXS) is a representative experimental technique to study such behavior of biomolecules^5,6,7,8,9. The SAXS profiles include structural information in the range of approximately 1–100 nm, which is usually wide enough to study structural range of multi-domain proteins or their complexes. SAXS intensity I(Q) is calculated as a function of scattering vector (Q):

$$ {\text{Q}} = 4{\uppi }\sin {\uptheta }/{\lambda } $$

(1)

$$ {\text{I}}\left( {\mathbf{Q}} \right) = \mathop \sum \limits_{{{\text{i}} = 1}}^{{\text{N}}} \mathop \sum \limits_{{{\text{j}} = 1}}^{{\text{N}}} {\text{f}}_{{\text{i}}} \left( {\mathbf{Q}} \right){\text{f}}_{{\text{j}}} \left( {\mathbf{Q}} \right)\frac{{\sin \left( {{\mathbf{Q}}{\text{r}}_{{{\text{ij}}}} } \right)}}{{{\mathbf{Q}}{\text{r}}_{{{\text{ij}}}} }} $$

(2)

In Eq. (1), the θ and λ are the scattering angle and the wavelength of the incident X-ray, respectively. Equation (2) is termed Debye’s equation⁹: N is the number of atoms in a system, r_ij is the distance between the i-th and j-th atoms, and f_i (Q) is an atomic form factor of i-th atom, which is regarded as a constant in the measured small-angle range.

Since all molecules in a sample solution contribute an experimental SAXS profile, the profile includes information on their structural ensemble. There could be two different types of structural sets of which ensembles reproduce the same SAXS profile. One is a “homogeneous structural set” in which all molecules have a similar structure. The other is a “heterogeneous structural set”. The latter includes the diverse structures, but their averaged SAXS profile reproduces the experimental one. It is difficult to determine which set should be adopted as the state of the molecule in solution. In other words, we cannot judge whether all molecules have similar or diverse structures only from an experimental SAXS profile.

To address this issue, two main criteria are considered in the context of structural modeling; they are reviewed as “maximum parsimony” and “maximum entropy”¹⁰. In the “maximum parsimony” approach, a structural set composed of a small number of models is selected among possible structural sets matching an experimental SAXS profile. For example, the Akaike information criterion or Bayesian information criterion is calculated for possible structural sets, and a structural set minimizing the criteria is chosen^11,12. Many algorithms, such as ensemble optimization, minimal ensemble search, sparse ensemble selection, and maximum occurrence, have been proposed and utilized^{13,14,15,16,17}.

In the “maximum entropy” approach, a free energy landscape derived from a simulation force field is adopted as the prior distribution. The resultant ensemble should match an experimental SAXS profile and be least inconsistent with the force field. A force field does not always reproduce a real structural ensemble of a molecule. Therefore, correction of a free energy landscape to match an experimental SAXS profiles is often effective. There are two ways to correct a free energy landscape while satisfying entropy maximization. One direct approach is reweighting the free energy landscape to match an experimental SAXS profile after structural sampling by molecular dynamics (MD) simulations or Monte Carlo simulations^{18,19,20,21,22}. The other approach is to perform parallel simulations with additional potential to reproduce an experimental SAXS profile^{23,24,25,26,27}.

Although both methods are effective, they are not necessarily sufficient to model any biomolecular systems. The structural set composed of a small number of models implies that the resulting structures discretely distribute in a structural space. In a largely fluctuating system, such as a multi-domain protein with intrinsic disordered regions, it is more reasonable to model a structural set which continuously distributes in a structure space.

Maximum entropy approaches are useful in that they can construct a physically reasonable structural set. It is necessary to sufficiently explore possible molecular structures in these approaches. However, sufficient stuructural sampling is often difficult for atomistic MD simulations. Coarse-grained (CG) MD simulation is a useful alternative to overcome this difficulty^19,26. In many CG models, each CG bead reproduces net charge and hydrophilicity of their corresponding atom set, resulting in roughly reasonable inter- and intra-molecular interfaces in the simulations^28,29,30. However, parameter adjustment using experimental data is often required^31,32. A general-purpose CGMD potential does not necessarily guarantee accurate dissociation constant for the interfaces of a biomolecule. Therefore, it may not always be suitable to perform entropy maximization using a free energy landscape from a given CGMD potential as the prior distribution.

Here, we propose another approach to enumerate possible behaviors of a biomolecule using an experimental SAXS profile and CGMD simulations. In this method, CGMD simulations are first performed to obtain trajectories that efficiently cover their conformational space. Then, time series that match the SAXS profile are extracted from the trajectories. If the resulting time series are long enough, they reflect information on possible preferred states of the biomolecule; the molecule stays in the stable states for a longer time than unstable ones.

We tested our method on a multi-domain protein ER-60, which is a member of protein disulfide isomerase family³. ER-60 is composed a, b, b′, a′ domains and possesses reaction Cys-Gly-His-Cys (CGHC) motifs in both the a and a′ domains (Fig. 1). The a and a′ domains are respectively connected to the b and b′ domains via short hinge regions^3,33. We focused on the domain dynamics of ER-60. We could extract multiple time series with our method. By examining domain conformation of ER-60 in each of the time series, we got overview of possible structural ensembles of the multi-domain protein. Distance between the a and a′ domains almost linearly correlated with I(Q) at each Q value. Therefore, diversity in scattering curves among structures was explained by open-close motion of ER-60. In addition, this linear relationship was indicated as two isosbestic points in Q-I(Q) plot. The actual structural ensemble of ER-60 in solution can be a mixture of these possible domain dynamics. Our method provides "elements of biomolecular motion”, which should be useful in the context of an integrated structural biology including SAXS, MD simulations, and other experiments.

Methods

Modeling strategy

To compose a series of structural models that reproduces a SAXS profile, we made the following two assumptions about CGMD:

1.
When stable inter- or intra- protein interfaces appear in CGMD simulations, they are regarded as candidates of actual interfaces, and
2.
Since atomistic-scale interactions, such as hydrogen bonds and hydrophobic interactions, cannot be expressed precisely in CGMD simulations, the accurate affinity of the interfaces is not guaranteed.

Based on these assumptions, we devised a method to collect a series of structural models that include possible stable states. This is composed of two steps. In step 1, CGMD simulations changing a possible parameter are performed. In step 2, from each of the trajectories, the longest continuous time series that reproduces the SAXS profile is extracted. In summary, we clip a part of a CGMD trajectory that matches a SAXS profile and approximates the actual behavior of the biomolecule as a repetition of the clipped time series. We designate this method “SAS-CLIP”. Although only a single region is clipped from each trajectory, we can expect that possible structural ensembles can be enumerated by applying the SAS-CLIP to multiple trajectories. This point is studied in the “Results and discussion” section below. Using the multi-domain protein ER-60 as a model system, we examined feasibility and investigated the resulting structural series of SAS-CLIP.

MD simulations

All simulations were performed using GROMACS 2020.4^34,35. Before CGMD simulations, it was necessary to clarify the structural stability of three folded regions, from Ser²⁵ to Pro¹³⁴ (system a), that from Ala¹³² to Lys³⁶⁶ (system b-b′), and from Tyr³⁶⁴ to Glu⁴⁹³ (system a′). For this purpose, single atomistic simulations for the three systems were first performed.

CGMD simulations were performed using the Martini 3 open-beta version. In the CGMD simulations, the Lennard–Jones potential between water and ER-60 was scaled, as in previous reports^19,20. The Lennard–Jones potential between i-th and j-th particles is described as:

$$ {\text{E}}_{{{\text{Lennard}} - {\text{Jones}}}} = 4{\upvarepsilon }_{{{\text{ij}}}} \left[ {\left( {\frac{{{\upsigma }_{{{\text{ij}}}} }}{{{\text{r}}_{{{\text{ij}}}} }}} \right)^{12} - \left( {\frac{{{\upsigma }_{{{\text{ij}}}} }}{{{\text{r}}_{{{\text{ij}}}} }}} \right)^{6} } \right] $$

In this study, we regarded the ε_ij as a parameter. When i-th and j-th bead pair is water-protein bead pair, the ε_ij is treated as λ_WP ε_ij,default. Here, the ε_ij,default is the default value in the Martini 3 open-beta version, and λ_WP is the scaling factor for water-protein interactions. The σ_ijs are constants depending on bead type, and r_ij is distance between i-th and j-th particles. First, we performed three 5000-ns production runs with λ_WP values of 1.0, 1.01, 1.02, 1.03, 1.04, 1.05, and 1.06. According to the results (described in “Results and discussion” section), we additionally performed two 5000-ns production runs with λ_WP of 1.035, 1.043, 1.045, 1.046, 1.049, 1.052, and 1.055. We also performed two 10,000-ns production runs with λ_WP values of 1.04, 1.043, 1.046, 1.049, 1.052, and 1.055. Snapshots taken every 2 ns were used for analysis. For simulation detail, please see the Supplementary Information.

Analysis

SAXS profiles of snapshots in the CGMD simulations were calculated by Pepsi-SAXS³⁶. Since the Pepsi-SAXS requires an atomistic model, the CGMD snapshots were reverse-mapped with the software backward³⁷ before running the Pepsi-SAXS. Detailed description of the reverse-mapping is in the Supplementary Information.

The χ² value is given as follows:

$$ {\upchi }^{2} = \frac{1}{{{\text{N}} - 1}}\mathop \sum \limits_{{{\text{i}} = 1}}^{{\text{N}}} \left( {\frac{{{\text{I}}_{{{\text{exp}}}} \left( {{\text{Q}}_{{\text{i}}} } \right) - {\text{I}}_{{{\text{sim}}}} \left( {{\text{Q}}_{{\text{i}}} } \right)}}{{{\upsigma }\left( {{\text{Q}}_{{\text{i}}} } \right)}}} \right)^{2} $$

$$ {\text{I}}_{{{\text{sim}}}} \left( {{\text{Q}}_{{\text{i}}} } \right) = {\text{c}}\mathop \sum \limits_{{{\text{j}} = 1}}^{{\text{M}}} {\text{I}}_{{{\text{sim}},{\text{ j}}}} \left( {{\text{Q}}_{{\text{i}}} } \right) - {\text{offs}} $$

where N numbers of data points and M is the number of models in a structure set. I_exp(q_i) and I_sim,j (q_i) are the scattering intensities of experimental SAXS profile and calculated scattering intensity of j th model, respectively. The theoretical profiles are normalized by I_sim,j(0). c and offs are adjustment parameters. The latter parameter explains possible mismatch between buffer solution and sample solution in the experimental SAXS profile. The parameters are determined by the condition: $\frac{{\partial {\upchi }^{2} }}{{\partial {\text{c}}}} = 0$ and $\frac{{\partial {\upchi }^{2} }}{{\partial {\text{offs}}}} = 0$.

The distance distribution function p (r) was calculated using the reverse-mapped atomistic model. For the calculation, all electrons were considered to be in the center of an atom, and hydrogen atoms were ignored.

Since snapshots were acquired every 2 ns for analysis, the duration time of the time series L_t was calculated by multiplying the number of snapshots in the time series by 2 ns/frame.

The structure of ER-60 was analyzed based on the domain positions and orientations. Domains of ER-60 are mainly defined based on domain database Pfam³⁸, where ER-60 consists of three domains: from Ser²⁶ to Lys¹³⁰ (a domain), from Phe¹⁶⁰ to Asp³⁵⁵ (b-b′ domains), and from Pro³⁷⁷ to Arg⁴⁸² (a′ domain). In previous reports, ER-60 was treated as four domain protein^3,4, where the b-b′ domain was further divided into b and b′ domains. The θ_a-b-b′, θ_b-b′-a′, and φ_{a-b-b′-a′} are angles formed by centers of mass (COMs) of the three domains or dihedral angle formed by COMs of the four domains (Supplementary Fig. S1). φ_{b′-b-a-CGHC(a)} and φ_{b-b′-a′-CGHC(a′)} are angles formed by COMs of the three domains and the COM of the CGHC motif of either the a or a′ domain. The D_a-a′ is distance between COMs of a and a′ domains. The COM of each domain or CGHC motif is defined as the averaged coordinates of backbone beads of the corresponding residues (Supplementary Fig. S1). Probability distributions were calculated by binning a variable space and counting the number of data points in each bin.

The difference in the shape of the (θ_a-b-b′, θ_b-b′-a′) probability map between a clipped time series with the SAS-CLIP and its original trajectory (i.e., the entire trajectory before performing the SAS-CLIP) was evaluated using the following Kullback–Leibler divergence:

$$ {\text{KL}}_{{{\text{SAS}} - {\text{CLIP}}}} = \mathop \sum \limits_{{\left( {{\uptheta }_{{{\text{a}} - {\text{b}} - {\text{b}}^{\prime}}} ,{{\uptheta }}_{{{\text{b}} - {\text{b}}^{\prime} - {\text{a}}^{\prime}}} } \right) \subset {\mathbf{C}}1 \cup {\mathbf{C}}2}} {\text{P}}_{{{\text{SAS}} - {\text{CLIP}}}} \left( {{\uptheta }_{{{\text{a}} - {\text{b}} - {\text{b}}^{\prime} }} ,{{\uptheta }}_{{{\text{b}} - {\text{b}}^{\prime} - {\text{a}}^{\prime} }} } \right)\log \frac{{{\text{P}}_{{{\text{SAS}} - {\text{CLIP}}}} \left( {{\uptheta }_{{{\text{a}} - {\text{b}} - {\text{b}}^{\prime} }} ,{{\uptheta }}_{{{\text{b}} - {\text{b}}^{\prime} - {\text{a}}^{\prime} }} } \right)}}{{{\text{P}}_{{{\text{original}}}} \left( {{\uptheta }_{{{\text{a}} - {\text{b}} - {\text{b}}^{\prime} }} ,{{\uptheta }}_{{{\text{b}} - {\text{b}}^{\prime} - {\text{a}}^{\prime} }} } \right)}} $$

Here, P_SAS-CLIP (θ_a-b-b′, θ_b-b′-a′) is the probability at (θ_a-b-b′, θ_b-b′-a′) of a clipped time series, and P_original (θ_a-b-b′, θ_b-b′-a′) is the probability of the original trajectory. KL_SAS-CLIP only considers the region where P_SAS-CLIP (θ_a-b-b′, θ_b-b′-a′) > 0 and their vicinity. In other words, the sum was calculated only for ${\mathbf{C}}1 \cup {\mathbf{C}}2$, where C1 and C2 are defined as follows (a schematic is presented in Supplementary Fig. S1d):

C1. {(θ_a-b-b′, θ_b-b′-a′) | P_SAS-CLIP (θ_a-b-b′, θ_b-b′-a′) > 0}

C2. {(θ_a-b-b′, θ_b-b′-a′) | Adjacent to C1}These conditions allow KL_SAS-CLIP to be small when ER-60 stays only in a few (or one) of several stable states. Considering that the KL_SAS-CLIP ignores outside the ${\mathbf{C}}1 \cup {\mathbf{C}}2$, the P_original (θ_a-b-b′, θ_b-b′-a′) is normalized to meet the following relationship:

$$ \mathop \sum \limits_{{\left( {{\uptheta }_{{{\text{a}} - {\text{b}} - {\text{b}}\prime }} ,{{\uptheta}}_{{{\text{b}} - {\text{b}}\prime - {\text{a}}\prime }} } \right) \subset {\mathbf{C}}1 \cup {\mathbf{C}}2}} {\text{P}}_{{{\text{original}}}} \left( {{\uptheta }_{{{\text{a}} - {\text{b}} - {\text{b}}\prime }} ,{{\uptheta}}_{{{\text{b}} - {\text{b}}\prime - {\text{a}}\prime }} } \right) = 1 $$

Graphs and figures were created using gnuplot and inkscape. Images of protein structures were created using PyMOL³⁹.

Results and discussion

Origin of differences between crystal and solution structures of ER-60

Our previous study showed that the structure of ER-60 in solution differs from the crystal structure⁴ because the SAXS profile calculated from crystal structure did not reproduce the experimental structure. To clarify the origin of this discrepancy, we first examined the dynamics of each folded structure. We performed atomistic MD simulations of three parts of ER-60, including the a, b-b′, and a′ domains. In any of the three simulations, root mean square deviation (RMSD) of the folded region between a simulation snapshot and the crystal structure distributed at approximately 1.5 Å (Supplementary Figs. S2, S3, S4). In the a-, b-b′-, and a′-part simulations, the prevalence of simulation snapshot with RMSD > 2.0 Å were 12.8%, 0.25%, and 21.2%, respectively (Supplementary Fig. S4; all supplementary data are provided). These findings suggest that the structure of each four domain of ER-60 in solution is almost the same as the crystal structure. Therefore, the discrepancy between crystal and solution structures should originate from the domain conformation or domain dynamics in solution.

Feasibility of SAS-CLIP

First, we examined whether a time series with a small χ² could be clipped with SAS-CLIP. Three 5000-ns CGMD simulations were performed for each condition of λ_WP = 1.0, 1.01, 1.02, 1.03, 1.04, 1.05, and 1.06. The SAS-CLIP was applied to each trajectory with “χ² < 3.0” as the criterion for reproducing an experimental SAXS profile which we previously reported⁴. Figure 2 shows the three longest time series, #A-1, #A-2, and #A-3 provided by the SAS-CLIP. Each averaged scattering curve matched the experimental one, suggesting that SAS-CLIP worked well (Fig. 2b). The duration time (L_t) of #A-1, #A-2, and #A-3 were 2690, 2208, and 1462 ns, respectively (Table 1). The times were long enough to elucidate several preferred conformations originating from the CGMD force field (Supplementary Fig. S5). The finding indicated that SAS-CLIP can capture physically reasonable structural series. Interestingly, χ² values for each simulation snapshot in the series were distributed broadly in the range below 350 (Supplementary Fig. S6). Nevertheless, the entire structure series reproduced the experimental SAXS profile. These structural sets could not be obtained by simply collecting individual structures with small χ².

Table 1 λ_WP, L_t, χ², and KL_SAS-CLIP of time series obtained by the SAS-CLIP.

Full size table

Criteria for a time series to be consistent with both CGMD simulations and a SAXS profile

First, the criteria for a clipped time series to be consistent with CGMD simulations were examined in more detail. When L_t is too small, the conformational distribution of ER-60 in the clipped time series can be quite different from that of its original trajectory. To visualize this, several time series satisfying χ² < 3.0 were clipped from the same original trajectory as #A-2. The conformational distributions of the time series with L_t ≤ 200 ns were clearly sparse and hardly reproduced the distribution of their original 5000-ns trajectory (Fig. 3a, b). Therefore, these conformational distributions did not reproduce the distribution that naturally arises from the CGMD force field. We do not regard such a time series as reflecting a force field. Therefore, we should establish criteria to eliminate the time series. The difference in conformational distribution between a clipped time series and its original trajectory can be a measure of the relationship between the distribution and CGMD force field. Based on this idea, we defined KL_SAS-CLIP, indicating the difference in (θ_a-b-b′, θ_b-b′-a′) probability distribution between a clipped time series and its original trajectory (detailed definition is described in “Methods” section). As expected, KL_SAS-CLIPs were larger for the time series with smaller L_t (Fig. 3a). To examine the relationship between KL_SAS-CLIP and L_t, time series with χ² < 3.0 were extensively collected from the same original trajectories as #A-1, #A-2, and #A-3. In Fig. 3c, a monotonically decreasing curve is clearly observed. The slope of this curve decreased as L_t increased, and KL_SAS-CLIP remained at ~ 0.5 for the region where L_t > 700 ns. Consequently, two conditions “KL_SAS-CLIP is approximately 0.5 or less” and “L_t is larger than 700 ns” can be criteria for identifying a clipped time series that reflects the CGMD potential function well. Note that #A-1, #A-2, and #A-3 satisfy both the conditions (Table 1). The Kullback–Leibler divergence-based evaluation would be generally applied to other molecules or systems.

Second, the criterion for reproducing the experimental SAXS profile, which was initially χ² < 3.0, was reconsidered. Although the #A-1, #A-2, and #A-3 satisfy the criterion, the squared residuals $\left( {\frac{{{\text{I}}_{{{\text{exp}}}} \left( {\text{Q}} \right) - {\text{I}}_{{{\text{sim}}}} \left( {\text{Q}} \right)}}{{{\upsigma }\left( {\text{Q}} \right)}}} \right)^{2}$ exceeded 20.0 in the region 0.1 Å⁻¹ ≤ Q ≤ 0.2 Å⁻¹ (Fig. 2c, and residuals are shown in Supplementary Fig. S7a). This Q corresponds to correlation length between 31.4 Å and 62.8 Å. Considering that each of the a, b, b′, and a′ domains is a globular structure with a diameter of 20–30 Å, the I(Q) of this Q contains information on domain conformation. That is, the SAS-CLIP could extract structural series with incorrect domain conformation even with small χ². According to the observations, we improved the criteria: satisfying both “χ² < 3.0” and “the $\left( {\frac{{{\text{I}}_{{{\text{exp}}}} \left( {\text{Q}} \right) - {\text{I}}_{{{\text{sim}}}} \left( {\text{Q}} \right)}}{{{\upsigma }\left( {\text{Q}} \right)}}} \right)^{2}$ < 12.5 for Q < 0.25 Å⁻¹”.

A condition for obtaining time series with large L _t

To enumerate the possible structural ensembles of ER-60, it is important to efficiently collect ensembles with a large L_t. We focused on the relationship between λ_WP and L_t. When λ_WP was small, L_t of the clipped time series tended to be small (Supplementary Fig. S8). Therefore, it was preferable to perform simulations with λ_WP ≥ 1.03.

Enumerating possible ensemble of ER-60 with SAS-CLIP

The question whether SAS-CLIP can enumerate various ensembles of ER-60 was addressed. According to the results above, we additionally performed CGMD simulations with λ_WP between 1.03 and 1.06 and increased available trajectories. The SAS-CLIP was re-executed with the improved criteria of both “χ² < 3.0” and “$\left( {\frac{{{\text{I}}_{{{\text{exp}}}} \left( {\text{Q}} \right) - {\text{I}}_{{{\text{sim}}}} \left( {\text{Q}} \right)}}{{{\upsigma }\left( {\text{Q}} \right)}}} \right)^{2}$ < 12.5 for Q < 0.25 Å⁻¹”. We obtained six time series with L_t > 700 ns. In particular, the L_t was greater than 1 μs for each of the six time series (Table 1), which were designated as #B-1, #B-2, #B-3, #B-4, #B-5, and #B-6, respectively. KL_SAS-CLIP values were reasonably small (Table 1). Their calculated SAXS profile reproduced the experimental one quite well; The new criteria reduced not only the squared residuals (Fig. 4 and Supplementary Fig. S9, and the residuals are presented in Supplementary Fig. S7b) but also the χ² values (Table 1). We noted that the extracted time series included many individual snapshots with large χ² even with the stricter criteria (Supplementary Fig. S10).

Next, the domain structures obtained with the SAS-CLIP were examined. First, to overview the architecture of ER-60, the two angles θ_a-b-b′ and θ_b-b′-a′ were plotted (Fig. 5a). Among the six time series from #B-1 to #B-6, the θ_a-b-b′ was mainly distributed ranging from 70° to 110° and θ_b-b′-a′ ranged from 75° to 150°. The distributions were similar for the other time series that satisfied L_t > 400 ns (Supplementary Fig. S11 and Table S1). The distribution in Fig. 5a could be roughly classified into two groups. In the first group, the θ_a-b-b′ distributed at approximately 90° and several clusters were observed in the two-dimensional map (#B-1 and #B-2). In the second group, θ_a-b-b′ was distributed at < 90° (#B-3, #B-4, #B-5, #B-6). While the structural trends were common, the detailed distributions differed from each other. Additionally, we examined the dihedral angle φ_{a-b-b′-a′}. φ_{a-b-b′-a′} distributed between − 15° and 75° (Supplementary Figs. S12, S13). We could not find a clear correlation between φ_{a-b-b′-a′} and θ_a-b-b′, and between φ_{a-b-b′-a′} and θ_b-b′-a′.

Subsequently, we examined whether the a and a′ domains prefer a particular orientation in their motion. Supplementary Fig. S14 displays distributions of φ_{b′-b-a-CGHC(a)} and φ_{b-b′-a′-CGHC(a′)} for the six time series. In terms of the domain orientations, one or two preferred ones were observed for each time series. The preferred orientations also differed between the time series. In other words, the SAS-CLIP enumerated the possible distributions of the orientations of the a- and a′-domains.

Finally, the tertiary structures of ER-60 in the clipped ensembles were overviewed. Several structures featuring θ_a-b-b′ and θ_b-b′-a′ with high appearance probability are shown in Fig. 5b. All structures are U-shaped. In structures III and IV, the flexible C-terminal region (Glu⁴⁸³-Leu⁵⁰⁵) bridged a and a′ domain. These were temporary bridging in the #B-2. Similar bridging was observed in #B-1, #B-3, #B-4, and #B-5, with lifetimes varying in the range 40–160 ns. The C-terminal loop might contribute to the functional motion of ER-60 via the domain bridging.

In summary, we confirmed that SAS-CLIP can enumerate a variety of ensembles of ER-60. The four-domain architecture could be roughly classified into two classes, whereas the detailed conformational distribution differed among the clipped ensembles. The orientations of the a- and a′-domains were more diverse among the clipped ensembles.

Analysis of isosbestic points in the set of scattering curves

Reflecting the structural diversity in each clipped ensemble, their calculated scattering profiles were also diverse (Fig. 6, Supplementary Fig. S15). We next discussed significance of isosbestic points appearing in the set of scattering curves. According to a previous study⁴⁰, an isosbestic point suggests existence of a variable having a approximate linear relationship with the scattering intensity as follows;

$$ {\text{I}}\left( {{\text{Q}},{\text{X}}} \right)\sim {\text{I}}\left( {{\text{Q}},{\text{X}}_{0} } \right) + \left( {{\text{X}} - {\text{X}}_{0} } \right)\frac{{\partial {\text{I}}\left( {{\text{Q}},{\text{X}}} \right)}}{{\partial {\text{X}}}} $$

Here, X is the variable and X₀ is a parameter. The difference in I(Q) between molecular structures corresponds to difference in the X. Therefore, the X is a conformational coordinate.

In other words, a single molecular motion along the X can explain diversity of calculated SAXS profiles in a structural ensemble when an isosbestic point is found in Q-I(Q) plot. We verified this linear relationship for the ensembles of ER-60 obtained with SAS-CLIP. In Fig. 6a, the calculated scattering curves of 10 structures in the #B-1, #B-2, and #B-5 are shown, respactively. The I(Q) varied with the structure, especially around Q = 0.07 Å⁻¹ and Q = 0.2 Å⁻¹. In addition, intersections of the curves were concentrated at two regions around Q = 0.12 Å⁻¹ (shown by arrows in Fig. 6a). Distribution of I(Q) at each Q value was also calculated to get an overview of all the scattering curves (Fig. 6b). Again, sharp I(Q) distributions were observed around Q = 0.12 Å⁻¹. These results indicated that there were two approximate isosbestic points around Q = 0.12 Å⁻¹. Corresponding to the I(Q) distribution, p(r) showed sharp distribution between 50 and 60 Å (Fig. 6c). Similary, two isosbestic points were also observed in each of #B-3, #B-4, and #B-6 (Supplementary Fig. S15). The two isosbestic points were not Q values at which only intra-domain scattering appear (Supplementary Fig. S16). Based on the results, a single conformational coordinate was expected to explain the diversity of the SAXS profiles.

Indeed, the distance between centers of a and a′ domain (D_a-a′) approximately linearly correlated with I(Q) (Fig. 7). Here, the relationship between I(Q) and the D_a-a′ was shown for three points, Q = 0.07 Å⁻¹, 0.12 Å⁻¹, and 0.2 Å⁻¹, respectively. Although the linearity depending on the time series (e.g. The relation was clear in #B-5, but that was relatively weak in #B-1.), the relationship was common to the six ensembles (Supplementary Figs. S17, S18, S19).

In summary, our result supports that a single molecular motion can explain diversity of I(Q)s in a structural ensemble when an isosbestic point is found in Q-I(Q) plot. In ER-60, that was open-close motion of the a and a′ domains.

Interpretation and application of SAS-CLIP

SAS-CLIP does not provide a unique solution of the structural ensemble of a biomolecule. The structural ensemble obtained by this method differ from each other, but they all reproduce the same SAXS profile. This also means that any combination of these time series also reproduce the experimental SAXS profile. Additionally, the structural distribution of each clipped time series approximates that of a long-term MD simulation. Therefore, each time series obtained by SAS-CLIP can be an element of motion of a biomolecule, and any combination of these are candidates for the real structural ensemble.

Here, we propose a method to identify a structural ensemble of a biomolecule using clipped time series and another experimental data. With X as a quantity obtained in an experiment other than SAXS, X will be expressed as follows:

$$ {\mathbf{X}} = \mathop \sum \limits_{{{\text{i}} = 1}}^{{\text{N}}} {\text{c}}_{{\text{i}}} {\mathbf{X}}_{{{\text{SAS}} - {\text{CLIP}},{\text{ i}}}} $$

where N is the number of structural series obtained from SAS-CLIP. The X_SAS-CLIP,i is the quantity for i-th structure series of the SAS-CLIP, which is averaged over all structures in the i-th series. c_i is a weight factor. A reasonable structural ensemble that is consistent with any of SAXS, CGMD, and another experiment can be obtained by simply determining the c_is. We can assume a variety of data are the quantity X. In fact, many experimental data represent the average quantity over all molecules in solution. In addition, many kinds of experimental data can be calculated from a given atomistic structures. For example, the profile of small-angle neutron scattering (SANS)^41,42,43,44, chemical shift for nuclear magnetic resonance (NMR)^45,46, and efficiency of fluorescence resonance energy transfer (FRET)⁴⁷ can be calculated from tertiary structures. The proposed method has at least two advantages. First, the method is advantageous for constructing realistic ensembles of highly flexible biomolecules; Combination of clipped ensembles with SAS-CLIP results in an ensemble with a very large number of structures. Second, resultant ensembles are consistent with a force field of MD simulation regardless of c_i values. The linear combination approach can also avoid the possible effect of artificial free energy minimum derived from an incorrect simulation force field, which is often a problem in the entropy maximization approach. Sufficient ensembles should be clipped to perform such an analysis.

SAS-CLIP can be applied to atomistic MD simulations. It is easier for the atomistic simulations to compare a simulation step with real time. When the SAS-CLIP is applied to atomistic simulations, extracted time series will be useful to analyze experimental data including temporal information, such as neutron spin echo.

When L_t is large, KL_SAS-CLIP is sufficiently small (Table 1). If we regard an original CGMD trajectory as a subspace of the free energy landscape, each clipped time series reproduces “a subspace of the subspace”. In other words, the SAS-CLIP provides time series that roughly trace a subspace of a free energy landscape of a biomolecule. This contrasts with the entropy maximization approach, where an entire free energy landscape is reproduced as much as possible. We designed our method not to narrow down the behavior of a biomolecule based on two facts. First, an SAXS profile does not contain enough information to identify structural ensemble of a biomolecule with high resolution. Second, simulation force fields contain incorrectness. Instead of narrowing an ensemble down, SAS-CLIP is designed to be easily combined with other experimental data.

In practice, there are three major advantages of SAS-CLIP. First, time series can be extracted from relatively short MD simulations with low computational cost. We can obtain many time series at the same time when multiple simulations are performed in parallel. Second, obtained structural sets reflect a force field of MD simulations, and thus they are candidates of “element of motions”. Third, this method makes it easy to obtain a structural ensemble which matches SAXS, MD, and another experiment with simple linear model. This method will provide a new way to study biomolecules by integrating various type of experiments.

Data availability

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

Code availability

GROMACS: https://www.gromacs.org/.

GROMACS 2020.4: https://doi.org/10.5281/zenodo.4054979.

PyMOL: https://pymol.org/2/.

Martini v3 beta: http://cgmartini.nl/index.php/martini3beta.

martinize2: https://github.com/marrink-lab/vermouth-martinize.

Pepsi-SAXS: https://team.inria.fr/nano-d/software/pepsi-saxs/.

Backward: http://cgmartini.nl/index.php/tools2/resolution-transformation.

Gnuplot: http://www.gnuplot.info/.

Inkscape: https://inkscape.org/.

References

Basu, A. et al. Dynamic coupling between conformations and nucleotide states in DNA gyrase. Nat. Chem. Biol. 14, 565–574. https://doi.org/10.1038/s41589-018-0037-0 (2018).
Article CAS PubMed Google Scholar
Mills, M., Tse-Dinh, Y. C. & Neuman, K. C. Direct observation of topoisomerase IA gate dynamics. Nat. Struct. Mol. Biol. 25, 1111–1118. https://doi.org/10.1038/s41594-018-0158-x (2018).
Article CAS PubMed PubMed Central Google Scholar
Kozlov, G., Määttänen, P., Thomas, D. Y. & Gehring, K. A structural overview of the PDI family of proteins. FEBS J. 277, 3924–3936. https://doi.org/10.1111/j.1742-4658.2010.07793.x (2010).
Article CAS PubMed Google Scholar
Okuda, A. et al. Solution structure of multi-domain protein ER-60 studied by aggregation-free SAXS and coarse-grained-MD simulation. Sci. Rep. 11, 5655. https://doi.org/10.1038/s41598-021-85219-0 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Bernadó, P., Shimizu, N., Zaccai, G., Kamikubo, H. & Sugiyama, M. Solution scattering approaches to dynamical ordering in biomolecular systems. Biochim. Biophys. Acta Gen. Subj. 1862, 253–274. https://doi.org/10.1016/j.bbagen.2017.10.015 (2018).
Article CAS PubMed Google Scholar
Borges, J. C., Seraphim, T. V., Dores-Silva, P. R. & Barbosa, L. R. S. A review of multi-domain and flexible molecular chaperones studies by small-angle X-ray scattering. Biophys. Rev. 8, 107–120. https://doi.org/10.1007/s12551-016-0194-x (2016).
Article CAS PubMed PubMed Central Google Scholar
Hura, G. L. et al. Robust, high-throughput solution structural analyses by small angle X-ray scattering (SAXS). Nat. Methods 6, 606–612. https://doi.org/10.1038/nmeth.1353 (2009).
Article CAS PubMed PubMed Central Google Scholar
Murayama, Y. et al. Tracking and visualizing the circadian ticking of the cyanobacterial clock protein KaiC in solution. EMBO J. 30, 68–78. https://doi.org/10.1038/emboj.2010.298 (2011).
Article CAS PubMed Google Scholar
Lattman, E. E., Grant, T. D. & Snell, E. H. Biological Small Angle Scattering: Theory and Practice 19 (Oxford University, 2013).
Google Scholar
Bonomi, M., Heller, G. T., Camilloni, C. & Vendruscolo, M. Principles of protein structural ensemble determination. Curr. Opin. Struct. Biol. 42, 106–116. https://doi.org/10.1016/j.sbi.2016.12.004 (2017).
Article CAS PubMed Google Scholar
Bowerman, S. et al. Determining atomistic SAXS models of tri-ubiquitin chains from Bayesian analysis of accelerated molecular dynamics simulations. J. Chem. Theory Comput. 13, 2418–2429. https://doi.org/10.1021/acs.jctc.7b00059 (2017).
Article CAS PubMed PubMed Central Google Scholar
Bowerman, S., Curtis, J. E., Clayton, J., Brookes, E. H. & Wereszczynski, J. BEES: Bayesian ensemble estimation from SAS. Biophys. J. 117, 399–407. https://doi.org/10.1016/j.bpj.2019.06.024 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Bernadó, P., Mylonas, E., Petoukhov, M. V., Blackledge, M. & Svergun, D. I. Structural characterization of flexible proteins using small-angle X-ray scattering. J. Am. Chem. Soc. 129, 5656–5664. https://doi.org/10.1021/ja069124n (2007).
Article CAS PubMed Google Scholar
Pelikan, M., Hura, G. L. & Hammel, M. Structure and flexibility within proteins as identified through small angle X-ray scattering. Gen. Physiol. Biophys. 28, 174–189. https://doi.org/10.4149/gpb_2009_02_174 (2009).
Article CAS PubMed PubMed Central Google Scholar
Tria, G., Mertens, H. D. T., Kachala, M. & Svergun, D. I. Advanced ensemble modeling of flexible macromolecules using X-ray solution scattering. IUCrJ 2, 207–217. https://doi.org/10.1107/S205225251500202X (2015).
Article CAS PubMed PubMed Central Google Scholar
Berlin, K. et al. Recovering a representative conformational ensemble from underdetermined macromolecular structural data. J. Am. Chem. Soc. 135, 16595–16609. https://doi.org/10.1021/ja4083717 (2013).
Article CAS PubMed PubMed Central Google Scholar
Bertini, I. et al. Conformational space of flexible biological macromolecules from average data. J. Am. Chem. Soc. 132, 13553–13558. https://doi.org/10.1021/ja1063923 (2010).
Article CAS PubMed Google Scholar
Różycki, B., Kim, Y. C. & Hummer, G. SAXS ensemble refinement of ESCRT-III CHMP3 conformational transitions. Structure 19, 109–116. https://doi.org/10.1016/j.str.2010.10.006 (2011).
Article CAS PubMed PubMed Central Google Scholar
Larsen, A. H. et al. Combining molecular dynamics simulations with small-angle X-ray and neutron scattering data to study multi-domain proteins in solution. PLoS Comput. Biol. 16, e1007870. https://doi.org/10.1371/journal.pcbi.1007870 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kassem, N. et al. Order and disorder: an integrative structure of the full-length human growth hormone receptor. Sci. Adv. 7, eabh3805. https://doi.org/10.1126/sciadv.abh3805 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Ahmed, M. C. et al. Refinement of α-synuclein ensembles against SAXS data: comparison of force fields and methods. Front. Mol. Biosci. 8, 654333. https://doi.org/10.3389/fmolb.2021.654333 (2021).
Article CAS PubMed PubMed Central Google Scholar
Antonov, L. D., Olsson, S., Boomsma, W. & Hamelryck, T. Bayesian inference of protein ensembles from SAXS data. Phys. Chem. Chem. Phys. 18, 5832–5838. https://doi.org/10.1039/c5cp04886a (2016).
Article CAS PubMed Google Scholar
Hermann, M. R. & Hub, J. S. SAXS-restrained ensemble simulations of intrinsically disordered proteins with commitment to the principle of maximum entropy. J. Chem. Theory Comput. 15, 5103–5115. https://doi.org/10.1021/acs.jctc.9b00338 (2019).
Article CAS PubMed Google Scholar
Ivanović, M. T., Hermann, M. R., Wójcik, M., Pérez, J. & Hub, J. S. Small-angle X-ray scattering curves of detergent micelles: effects of asymmetry, shape fluctuations, disorder, and atomic details. J. Phys. Chem. Lett. 11, 945–951. https://doi.org/10.1021/acs.jpclett.9b03154 (2020).
Article CAS PubMed Google Scholar
Paissoni, C., Jussupow, A. & Camilloni, C. Determination of protein structural ensembles by hybrid-resolution SAXS restrained molecular dynamics. J. Chem. Theory Comput. 16, 2825–2834. https://doi.org/10.1021/acs.jctc.9b01181 (2020).
Article CAS PubMed PubMed Central Google Scholar
Jussupow, A. et al. The dynamics of linear polyubiquitin. Sci. Adv. 6, eabc3786. https://doi.org/10.1126/sciadv.abc3786 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Shevchuk, R. & Hub, J. S. Bayesian refinement of protein structures and ensembles against SAXS data using molecular dynamics. PLoS Comput. Biol. 13, e1005800. https://doi.org/10.1371/journal.pcbi.1005800 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Tanaka, T., Hori, N. & Takada, S. How co-translational folding of multi-domain protein is affected by elongation schedule: molecular simulations. PLoS Comput. Biol. 11, e1004356. https://doi.org/10.1371/journal.pcbi.1004356 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Terakawa, T., Kenzaki, H. & Takada, S. p53 searches on DNA by rotation-uncoupled sliding at C-terminal tails and restricted hopping of core domains. J. Am. Chem. Soc. 134, 14555–14562. https://doi.org/10.1021/ja305369u (2012).
Article CAS PubMed Google Scholar
Monticelli, L. et al. The MARTINI coarse-grained force field: extension to proteins. J. Chem. Theory Comput. 4, 819–834. https://doi.org/10.1021/ct700324x (2008).
Article CAS PubMed Google Scholar
Shimizu, M. et al. Near-atomic structural model for bacterial DNA replication initiation complex and its functional insights. Proc. Natl. Acad. Sci. USA 113, E8021–E8030. https://doi.org/10.1073/pnas.1609649113 (2016).
Article CAS PubMed PubMed Central Google Scholar
Niina, T., Brandani, G. B., Tan, C. & Takada, S. Sequence-dependent nucleosome sliding in rotation-coupled and uncoupled modes revealed by molecular simulations. PLoS Comput. Biol. 13, e1005880. https://doi.org/10.1371/journal.pcbi.1005880 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Dong, G., Wearsch, P. A., Peaper, D. R., Cresswell, P. & Reinisch, K. M. Insights into MHC class I peptide loading from the structure of the tapasin-ERp57 thiol oxidoreductase heterodimer. Immunity 30, 21–32. https://doi.org/10.1016/j.immuni.2008.10.018 (2009).
Article CAS PubMed PubMed Central Google Scholar
Abraham, M. J. et al. GROMACS: high performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1–2, 19–25. https://doi.org/10.1016/j.softx.2015.06.001 (2015).
Article ADS Google Scholar
Bondi, A. A. van der Waals Volumes and Radii. J. Phys. Chem. 68, 441–451. https://doi.org/10.1021/j100785a001 (1964).
Article CAS Google Scholar
Grudinin, S., Garkavenko, M. & Kazennov, A. Pepsi-SAXS: an adaptive method for rapid and accurate computation of small-angle X-ray scattering profiles. Acta Crystallogr. D Struct. Biol. 73, 449–464. https://doi.org/10.1107/S2059798317005745 (2017).
Article CAS PubMed Google Scholar
Wassenaar, T. A., Pluhackova, K., Böckmann, R. A., Marrink, S. J. & Tieleman, D. P. Going backward: a flexible geometric approach to reverse transformation from coarse grained to atomistic models. J. Chem. Theory Comput. 10, 676–690. https://doi.org/10.1021/ct400617g (2014).
Article CAS PubMed Google Scholar
Mistry, J. et al. Pfam: The protein families database in 2021. Nucl. Acids Res. 49, D412–D419. https://doi.org/10.1093/nar/gkaa913 (2020).
Article CAS PubMed Central Google Scholar
The PyMOL Molecular Graphics System, Version 1.8, Schrödinger, LLC.
Greger, M., Kollar, M. & Vollhardt, D. Isosbestic points: how a narrow crossing region of curves determines their leading parameter dependence. Phys. Rev. B 87, 195140. https://doi.org/10.1103/PhysRevB.87.195140 (2013).
Article ADS CAS Google Scholar
Svergun, D. I. et al. Protein hydration in solution: experimental observation by X-ray and neutron scattering. Proc. Natl. Acad. Sci. USA 95, 2267–2272. https://doi.org/10.1073/pnas.95.5.2267 (1998).
Article ADS CAS PubMed PubMed Central Google Scholar
Grudinin, S. Pepsi-SANS. https://team.inria.fr/nano-d/software/pepsi-sans/.
Yunoki, Y. et al. Overall structure of fully assembled cyanobacterial KaiABC circadian clock complex by an integrated experimental-computational approach. Commun. Biol. 5, 184. https://doi.org/10.1038/s42003-022-03143-z (2022).
Article CAS PubMed PubMed Central Google Scholar
Matsumoto, A. et al. Structural studies of overlapping dinucleosomes in solution. Biophys. J. 118, 2209–2219. https://doi.org/10.1016/j.bpj.2019.12.010 (2020).
Article ADS CAS PubMed Google Scholar
Han, B., Liu, Y., Ginzinger, S. W. & Wishart, D. S. SHIFTX2: significantly improved protein chemical shift prediction. J. Biomol. NMR 50, 43–57. https://doi.org/10.1007/s10858-011-9478-4 (2011).
Article CAS PubMed PubMed Central Google Scholar
Shen, Y. & Bax, A. SPARTA+: a modest improvement in empirical NMR chemical shift prediction by means of an artificial neural network. J. Biomol. NMR 48, 13–22. https://doi.org/10.1007/s10858-010-9433-9 (2010).
Article CAS PubMed PubMed Central Google Scholar
Okamoto, K. & Sako, Y. Recent advances in FRET for the study of protein interactions and dynamics. Curr. Opin. Struct. Biol. 46, 16–23. https://doi.org/10.1016/j.sbi.2017.03.010 (2017).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by MEXT/JSPS KAKENHI Grant Numbers (JP20K22629 to M. Shimizu; JP19K16088 and 21K15051 to K. M.; JP19KK0071, and JP20K06579 to R. I.; JP17K07816 to N. S.; JP18H05229 and JP18H05534 to M. Sugiyama), and the Sasakawa Scientific Research Grant from The Japan Science Society assigned to A. O. The study was also partially supported by a project for the construction of the basis for advanced materials science and analytical study by the innovative use of quantum beams and nuclear sciences at the Institute for Integrated Radiation and Nuclear Science, Kyoto University (KURNS) and a grants for research promotion in KURNS to M. Shimizu and Y. Y. The study was partially supported by the Platform Project for Supporting Drug Discovery and Life Science Research (Basis for Supporting Innovative Drug Discovery and Life Science Research (BINDS)) from AMED (JP22ama121001j0001) to M. Sugiyama.

Author information

Authors and Affiliations

Institute for Integrated Radiation and Nuclear Science, Kyoto University, Kumatori, Sennan-gun, Osaka, 590-0494, Japan
Masahiro Shimizu, Aya Okuda, Ken Morishima, Rintaro Inoue, Nobuhiro Sato, Yasuhiro Yunoki, Reiko Urade & Masaaki Sugiyama

Authors

Masahiro Shimizu
View author publications
You can also search for this author in PubMed Google Scholar
Aya Okuda
View author publications
You can also search for this author in PubMed Google Scholar
Ken Morishima
View author publications
You can also search for this author in PubMed Google Scholar
Rintaro Inoue
View author publications
You can also search for this author in PubMed Google Scholar
Nobuhiro Sato
View author publications
You can also search for this author in PubMed Google Scholar
Yasuhiro Yunoki
View author publications
You can also search for this author in PubMed Google Scholar
Reiko Urade
View author publications
You can also search for this author in PubMed Google Scholar
Masaaki Sugiyama
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.Sh. and M.Su. designed the modeling method. M.Sh. performed MD simulations and analysed the simulation data. All authors wrote the paper.

Corresponding authors

Correspondence to Masahiro Shimizu or Masaaki Sugiyama.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shimizu, M., Okuda, A., Morishima, K. et al. Extracting time series matching a small-angle X-ray scattering profile from trajectories of molecular dynamics simulations. Sci Rep 12, 9970 (2022). https://doi.org/10.1038/s41598-022-13982-9

Download citation

Received: 07 January 2022
Accepted: 31 May 2022
Published: 15 June 2022
DOI: https://doi.org/10.1038/s41598-022-13982-9

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.