Predicting the future of excitation energy transfer in light-harvesting complex with artificial intelligence-based quantum dynamics

Ullah, Arif; Dral, Pavlo O.

doi:10.1038/s41467-022-29621-w

Download PDF

Article
Open access
Published: 11 April 2022

Predicting the future of excitation energy transfer in light-harvesting complex with artificial intelligence-based quantum dynamics

Nature Communications volume 13, Article number: 1930 (2022) Cite this article

5264 Accesses
24 Citations
21 Altmetric
Metrics details

Subjects

Abstract

Exploring excitation energy transfer (EET) in light-harvesting complexes (LHCs) is essential for understanding the natural processes and design of highly-efficient photovoltaic devices. LHCs are open systems, where quantum effects may play a crucial role for almost perfect utilization of solar energy. Simulation of energy transfer with inclusion of quantum effects can be done within the framework of dissipative quantum dynamics (QD), which are computationally expensive. Thus, artificial intelligence (AI) offers itself as a tool for reducing the computational cost. Here we suggest AI-QD approach using AI to directly predict QD as a function of time and other parameters such as temperature, reorganization energy, etc., completely circumventing the need of recursive step-wise dynamics propagation in contrast to the traditional QD and alternative, recursive AI-based QD approaches. Our trajectory-learning AI-QD approach is able to predict the correct asymptotic behavior of QD at infinite time. We demonstrate AI-QD on seven-sites Fenna–Matthews–Olson (FMO) complex.

Simulating photosynthetic energy transport on a photonic network

Article Open access 11 March 2024

Vibronic mixing enables ultrafast energy flow in light-harvesting complex II

Article Open access 19 March 2020

Quantum coherences reveal excited-state dynamics in biophysical systems

Article 24 June 2019

Introduction

From the birth of life, solar energy has been the driving force of life. Via the mechanism of photosynthesis, living organisms capture sunlight with the highly sophisticated pigments in their antenna systems and transfer sunlight energy to the reaction center (RC) in the form of electron-hole pairs (excitons), where it is stored as biochemical energy¹. The transfer of solar energy from antenna to RC, which is also known as excitation energy transfer (EET), in the form of excitons is considered to be highly efficient with close to unit efficiency². Understanding this high efficiency of the natural harvesting systems is very important because this understanding can be potentially applied in designing very efficient organic solar cells and storage devices³. Experiments showed that the long-lasting coherence in the efficient natural light-harvesting complexes (LHCs) is preserved by the surrounding protein environments (scaffold), and this coherence may be responsible for this high efficiency^4,5. The most well-investigated LHC is Fenna–Matthews–Olsen (FMO) complex, which is found in green sulfur bacteria⁶. The small size and simplicity of the FMO complex also make it a testbed of simulation approaches. The FMO complex is a trimer of identical subunits, where each subunit consists of bacteriochlorophyll (BChl) molecules (system) attached to their protein environments⁷.

Enormous amount of research work has been done on light-harvesting processes^{8,9,10,11,12,13}. Taking FMO as an example, it is easy to see that the system (BChl molecules) is not isolated from the environment (the protein) and thus, the correct simulation of FMO should treat it as an open system rather than isolated one. In addition, many experiments suggest^14,15, that quantum effects, particularly coherence, might play an important role in the light-harvesting processes and may even be responsible for achieving the high-end efficiency. Temporal and spatial simulation of EET with the inclusion of quantum effects can be done within many frameworks such as classical mapping-based approaches^16,17,18, perturbative methods^19,20,21, and dissipative quantum dynamics (QD)^{22,23,24,25,26,27} adopted here.

QD simulations can be performed using the hierarchical equations of motion (HEOM)²⁸ and its many improvements and extensions^{8,23,29,30,31}, the quasiadiabatic propagator path integral (QuAPI)³² and its variant iterative QuAPI (iQuAPI)²⁷, the trajectory-based stochastic equation of motion (SEOM) approach^{25,33,34,35,36,37,38,39}, the multi-layer multi-configuration time-dependent Hartree (ML-MCTDH)²⁶ and the local thermalising Lindblad master equation (LTLME)²². The development of various quantum dissipative dynamics methods stirs from the fact that each of these methods has some limitations and hence there is no single universal method that works in all cases. For instance, HEOM is numerically exact but comes with a very high computational cost at low temperatures, the SEOM has no explicit dependence on the temperature but has very bad convergence at long-time propagation, in the QuAPI approach all correlation effects are included over a finite time and correlation effects beyond this time are neglected. Most importantly, all these traditional QD approaches require step-wise propagation of trajectories and the next step depends on the previous steps, thus, QD simulation is an iterative, recursive process. Both calculations at each time step and recursive nature of QD makes it rather computationally expensive.

Alleviating the computational cost of QD became a target of a series of studies applying artificial intelligence (AI)^{40,41,42,43,44,45,46}, inspired by advances in application of AI employing machine learning (ML) algorithms in computational chemistry and chemical physics^47,48. AI was also applied to investigate EET in a dimer system⁴⁴ and the FMO complex⁴⁰. Saving of computational cost by AI in above studies is impressive, however, one of the studies⁴⁰ only focused on predicting energy transfer times and transfer efficiencies rather than temporal and spatial evolution, while other related studies^44,45,46 adopted basically the same recursive nature of QD trajectory propagation.

The recursive nature of the previous AI-based QD makes it prone to error accumulation. In recursive simulations, previously predicted values are used as an input to predict the next value. Thus, the prediction error at each time-step will accumulate, which results in deterioration of accuracy. In addition, the recursive nature of predictions does not allow us to make a prediction for any arbitrary time without predicting values before that. Finally, a short-time trajectory is needed as the seed to be generated with traditional approaches such as HEOM and then provided as an input to AI model to make prediction for the next time step and ultimately propagate the long-time dynamics. Thus, even when having AI model, we still need to spend valuable computational time to generate the short-time trajectory with the traditional approaches.

Here, we suggest an AI-QD approach to directly predict QD with AI as a function of time and other parameters such as temperature, reorganization energy, etc., completely circumventing the need of recursive step-wise dynamics propagation in contrast to the traditional QD and alternative, recursive AI-based QD approaches. Our AI-QD approach is able to predict QD at infinite time with correct asymptotic behavior and can be viewed as trajectory learning, which does not need any short-time trajectory as an input, eradicates the need of traditional approaches to generate the seed, and alleviates the problem of error accumulation. We demonstrate the applicability of AI-QD on seven-sites Fenna–Matthews–Olson (FMO) complex and show how AI-QD can be used for massive, infinite-time QD simulations and provide insights into the desired range of parameters and more efficient paths followed by the transfer of excitation energy.

Results

Reference quantum dynamics of the FMO complex

We employ the Frenkel exciton Hamiltonian⁴⁹ to study EET dynamics in the FMO complex:

$${{{{{{{\bf{H}}}}}}}}={{{{{{{{\bf{H}}}}}}}}}_{{{{{{{{\rm{s}}}}}}}}}+{{{{{{{{\bf{H}}}}}}}}}_{{{{{{{{\rm{env}}}}}}}}}+{{{{{{{{\bf{H}}}}}}}}}_{{{{{{{{\mbox{s-env}}}}}}}}}+{{{{{{{{\bf{H}}}}}}}}}_{{{{{{{{\rm{reorg}}}}}}}}},$$

(1)

with all Hamiltonian terms given below

$${{{{{{{{\bf{H}}}}}}}}}_{{{{{{{{\rm{s}}}}}}}}}=\mathop{\sum }\limits_{i}^{n}\left|i\right\rangle {\epsilon }_{i}\left\langle i\right|+\mathop{\sum }\limits_{i,j = 1,i\ne j}^{n}\left|i\right\rangle {J}_{ij}\left\langle j\right|,$$

(2)

$${{{{{{{{\bf{H}}}}}}}}}_{{{{{{{{\rm{env}}}}}}}}}=\mathop{\sum }\limits_{i=1}^{n}\mathop{\sum}\limits_{k=1}\left(\frac{1}{2}{{{{{{{{\bf{P}}}}}}}}}_{k,i}^{2}+\frac{1}{2}{\omega }_{k,i}^{2}{{{{{{{{\bf{Q}}}}}}}}}_{k,i}^{2}\right),$$

(3)

$${{{{{{{{\bf{H}}}}}}}}}_{{{{{{{{\mbox{s-env}}}}}}}}}=-\mathop{\sum }\limits_{i=1}^{n}\mathop{\sum}\limits_{k=1}\left|i\right\rangle {c}_{k,i}{{{{{{{{\bf{Q}}}}}}}}}_{k,i}\left\langle i\right|,$$

(4)

$${{{{{{{{\bf{H}}}}}}}}}_{{{{{{{{\rm{reorg}}}}}}}}}=\mathop{\sum }\limits_{i=1}^{n}\left|i\right\rangle {\lambda }_{i}\left\langle i\right|,$$

(5)

where H_s, H_env, H_s-env, and H_reorg denote system (BChl molecules) Hamiltonian, Hamiltonian of protein-environment, system-environment interaction Hamiltonian and the reorganization term, respectively. In Eq. (1), n is the number of sites (BChl molecules), ϵ_i is the energy of the ith site and J_ij is the inter-site coupling between sites i and j. P_k,i, Q_k,i, and ω_k,i are, respectively, momentum, coordinate, and frequency of environment mode k associated with site i. In H_s-env, each site is connected to its own environment. The c_k,i is the strength of coupling between site i and mode k of its environment. The reorganization term H_reorg can be seen as a counter term that emerges from the interaction of the sites with the environment^8,49,50. It is added to stop further renormalization of the site energy ϵ_i by the environment. In the reorganization term H_reorg, λ_i is the reorganization energy corresponding to site i⁵¹,

$${\lambda }_{i}=\frac{1}{\pi }\int\nolimits_{0}^{\infty }\frac{{J}_{i}(\omega )}{\omega }d\omega ,$$

(6)

where J_i(ω) is spectral density of the environment corresponding to site i. As shown by Nalbach and Thorwart⁵², the effects of the discrete molecular modes on the population dynamics are largely irrelevant. As a result, it is acceptable to use continuous environment spectral density such as Drude–Lorentz spectral density

$${J}_{{{\mbox{env}}}}(\omega )=2\lambda \frac{\omega \gamma }{{\omega }^{2}+{\gamma }^{2}},$$

(7)

where γ and λ denote the characteristic frequency (bath relaxation rate) and the reorganization energy, respectively.

In general terms, the EET dynamics in the FMO complex can be described by Liouville–von Neumann equation

$$\frac{d}{dt}{{{{{{{\boldsymbol{\rho }}}}}}}}(t)=\frac{i}{\hslash }\left[{{{{{{{\bf{H}}}}}}}},{{{{{{{\boldsymbol{\rho }}}}}}}}(t)\right],$$

(8)

where ρ is the density matrix. Because of the many-body effects, direct propagation of Eq. (8) is not straightforward. Different approaches are developed to simplify and propagate Eq. (8) and interested readers are advised to look into the corresponding references^25,30,32,53.

We use the local thermalising Lindblad master equation (LTLME)²² to propagate the reference QD trajectories for the reduced density matrix of the system (see Supplementary Methods), where we adopt Adolphs and Renger’s Hamiltonian for seven sites per subunit⁵⁴ (see “Methods”). The LTLME is a coherent and complete positive trace-preserving approach, but may not be as accurate as HEOM or SEOM approaches because of approximations used in LTLME derivation^22,55, but here it is not the concern of our proof-of-concept paper.

Parameters-based non-recursive training framework

In our parameters-based non-recursive AI-QD, we train ML model as a function of a parameter space ${{{{{{{\mathcal{D}}}}}}}}$ (used as the input to ML model) which depends on the system of interest and on the data from a limited number of QD trajectories. For the FMO complex, our parameter space ${{{{{{{\mathcal{D}}}}}}}}$ consists of information of sites: λ, γ, and T. In addition, time also becomes a part of the input of our AI-QD model. In order to treat infinite time, instead of time, we introduce time-function $f(t)\in {{{{{{{\mathcal{D}}}}}}}}$, which normalizes time and for t → ∞ becomes f(t) = 1. Such normalization, however, can effectively only discern data within rather short time-region, thus, instead of a single time-function, we introduce the set of redundant time-functions $\left\{{f}_{k}(t)\right\}$ for different regions in very long-time propagation (see “Methods”). The remaining input of our model is information about the initial excitation $m=\{{m}_{1},{m}_{2}\}=\{0,1\}\in {{{{{{{\mathcal{D}}}}}}}}$ (with 0 corresponding to initial excitation on site-1 and 1 corresponding to site-6) and labels $n=\{{n}_{1},{n}_{2},{n}_{3},\ldots, {n}_{7}\}=\{1,2,3,\ldots ,7\}\in {{{{{{{\mathcal{D}}}}}}}}$ corresponding to the seven rows in the reduced density matrix. We train convolutional neural network (CNN) taking all above input elements $\{m,n,\,\gamma ,\,\lambda ,\,T,\,\{f_k(t)\}\}\in {{{{{{{\mathcal{D}}}}}}}}$ on rows of the reduced density matrix which include exciton population ρ_nn(t) and coherence (off-diagonal) terms ρ_nq, n≠q (target values to learn or output of the trained model) (see Fig. 1 and “Methods” for details, such as CNN architecture and normalization of input elements).

**Fig. 1: Preparation of training data using parameters in AI-QD training framework.**

Our training trajectories generated with the reference LTLME-QD approach are chosen by farthest-point sampling from the three-dimensional space of the following parameters: reorganization energy λ = {λ₁, λ₂, λ₃, …, λ_i}, the characteristic frequency γ = {γ₁, γ₂, γ₃, …, γ_j} and temperature T = {T₁, T₂, T₃, …, T_l} (see “Methods”).

We should also decide up to what time-length t_M we should run reference LTLME-QD trajectories. Based on the prior knowledge that populations plateau in asymptotic limit, for each trajectory we choose a different time-length t_M using a vanishing gradient scheme, where t_M is chosen such that the gradient of population G is close to zero (see “Methods”). Using the vanishing gradient scheme to find different t_M for each trajectory allows us to sample more data from the training trajectories, which are hard-to-learn, while avoiding redundant sampling from trajectories, which are easy-to-learn. This also removes arbitrariness in choosing fixed t_M parameter as was done in previous studies using the recursive AI-QD scheme^44,46.

Application to EET dynamics in FMO complex

As an application of our approach, we predict EET dynamics in the FMO complex with seven sites per subunit for parameters of the test set trajectories none of which used in training. Site-1 (BChl molecule 1) and site-6 (BChl molecule 6) are most likely to get initially excited as they are close to the photosynthetic antenna complex called chlorosome⁶, we thus present results for both cases. For predictions, we just provide the parameters of the test trajectories (characteristic frequency, reorganization energy, temperature) as an input and predict the evolution of EET. Figure 2 shows the evolution of excitation energy in all seven sites for both cases. In Fig. 2, we show EET for both short and long time periods, demonstrating that AI-QD is able to capture the coherent EET (aka quantum beating or modulation of amplitudes) of short-time dynamics and also can predict the asymptotic limit. Figure 3 shows the prominent off-diagonal terms (aka coherence) of the reduced density matrix for Fig. 2. Table 1 shows mean absolute error (MAE) and root mean square error (RMSE) averaged over 600 trajectories. As AI-QD is non-recursive (non-iterative), without any trajectory propagation, we can directly predict the asymptotic behavior. Our AI-QD performs well in all cases (from weak coherence to strong coherence, from Markovian to non-Markovian, from adiabatic to nonadiabatic situations) as can be observed for selected trajectories shown in Supplementary Fig. 1 with corresponding errors reported in Supplementary Table 1. From Supplementary Table 1, we observe that our AI-QD approach is comparatively more accurate in strongly coherent cases (large value of γ and small values of λ and T) which can be seen as a consequence of the vanishing gradient scheme which may favor these challenging cases due to a larger number of training points sampled from such trajectories. AI-QD approach can even extrapolate to a good degree as its error for the test trajectories propagated with parameters outside the training parameter space is of a similar order of magnitude to the test trajectories propagated with parameters inside the training parameter space (interpolation) as shown in Supplementary Fig. 2 and Supplementary Table 2.

**Fig. 2: Population of the seven sites in the FMO complex as a function of time.**

**Fig. 3: Electronic coherence as a function of time.**

Table 1 Mean absolute error (MAE) and root mean square error (RMSE) averaged over 600 test trajectories propagated up to 1 ns.

Full size table

It was shown^8,56,57 that the transfer of excitation energy in the seven-sites FMO complex follows mainly two paths, i.e., site-1 → site-2 → site-3 ↔ site-4 and site-6 → site-5, site-7, site-4 → site-3, here the ↔ shows that the excitation energy equilibrates between site-3 and site-4 after site-3 is populated (see Fig. 3). Among the seven sites, the sites 1 and 6 are close to the baseplate protein, while the sites 3 and 4 are near to the target RC complex^54,58. It has been proposed that the quantum coherence allows the FMO complex to quickly sample several routes (paths) in search of site-3⁵. In Fig. 4, we show the population of site-3 at t = 0.5 ps (500 fs) as a function of γ, λ, and T. From Fig. 4a, we observe that at room temperature T = 300, the ETT to site-3 or, in other words, to RC complex gets slow as the characteristic frequency γ increases. In contrast, the ETT to site-3 increases with the increase in reorganization energy λ as shown in Fig. 4b. Similar trend can be observed with the increase in temperature T as can be seen in Fig. 4c.

**Fig. 4: The evolution of site-3 population at t = 0.5 ps.**

In order to find the optimum parameters for the fastest transfer of excitation energy, we have calculated population of site-3 at 0.5 ps for a massive set of ca. 0.57 million possible combinations (site-1 + site-6) of the γ, λ, T with the search space γ = 25, 30, 35, …, 245, λ = 10, 15, 20, …, 345 and T = 25, 30, 35, …, 345. We report the fastest EET of 0.761 to site-3 for path-2 with γ = 30, λ = 310, T = 25, while for path-1 for the same parameters EET is 0.626. From Figs. 2, 4 and from the optimum parameters, we notice that following path-1, i.e., site-1 → site-2 → site-3 ↔ site-4, the EET shows more coherence and is slow compared to excitation transfer following path-2, i.e., site-6 → site-5, site-7, site-4 → site-3. From Eq. (9) (“Methods”), energy of the site-1 (12,410 cm⁻¹) is lower than the baseplate, which has been reported to be 12,500 cm⁻¹^59,60. This allows a quick transfer of the excitation energy to site-1 from the baseplate. However, the energy of site-2 (12,530 cm⁻¹) is higher than site-1 and also than site-3 (12,210 cm⁻¹), which on the one hand stops backward transfer from site-3, but on the other hand creates a local minimum on site-1. Despite the local minimum on site-1, the excitation energy is not trapped because of the quantum coherent wave-like motion between site-1 and site-2. Following path-2, the energy of site-6 (12,630 cm⁻¹) is higher than the energy of baseplate. To stop backward transfer of excitation energy from site-6 to baseplate, site-6 should quickly transfer excitation energy to other sites such as site-5, site-7, and site-4. This quick transfer from site-6 to site-5, site-7, and site-4 is only possible by the strong coupling of site-6 to site-5 and site-7, which in return are strongly coupled to site-4.

Discussion

In this work, we have presented a non-recursive (non-iterative) AI-QD approach for blazingly fast prediction of quantum dynamics, as predictions can be made for any time step up to asymptotic limit completely circumventing the need of recursive trajectory propagation. This can be used, as we demonstrated here, for massive quantum dynamics simulations, for example, in search for the best conditions required for efficient energy transfer in designed photovoltaic devices. Just to put things into perspective, our AI-QD approach can predict the entire 2.5 ps trajectory within ca. 2 min on a single core of Intel(R) Core(TM) i7-10700 CPUs @ 2.90 GHz, independent of the reference method used for generating training trajectories, while the same propagation with the traditional recursive approaches such as HEOM would take hours, and the cost would exponentially increase for low temperatures. The high cost of accurate approaches such as HEOM was also a reason why we used a much faster LTLME for this proof-of-concept study to extensively test our approach (propagation of an entire trajectory takes only 3 min with LTLME on a single CPU of the above computer architecture). It is worth emphasizing that AI-QD is embarrassingly parallel and the calculations can be further significantly sped up by using multiple CPUs or GPUs, because predictions with AI-QD for different time steps are independent of each other and different segments of trajectories can be distributed for independent calculations on many threads.

We demonstrated the feasibility of AI-QD approach on an example of the FMO complex, but this approach is general enough to be used for any other complex after retraining. It remains to be seen how well the AI-QD approach can be extended to describe several LHCs at the same time—a topic of our ongoing research. One could use the LHC Hamiltonian elements as a representation of LHC complexes and an early encouraging study⁴² has shown that by using Hamiltonian elements as input of an ML model, one can successfully describe scalar properties (energy transfer times and transfer efficiencies) for different Hamiltonians. However, open question remains how successful would be such an approach to learn dynamics and in addition, how to circumvent different dimensionalities of Hamiltonians of different complexes.

Methods

Training data

In the seven-sites FMO complex (apo-FMO), where seven BChl molecules (seven sites) exist per subunit, the inter-subunit interaction is very small and each subunit can be considered relatively isolated⁶¹. Here we adopt Adolphs and Renger’s Hamiltonian for seven sites per subunit⁵⁴

$${{{{{{\bf{H}}}}}}}_{{{{{{\rm{s}}}}}}}=\left[\begin{array}{ccccccc}12410&-87.7&5.5&-5.9&6.7&-13.7&-9.9\\ -87.7&12530&30.8&8.2&0.7&11.8&4.3\\ 5.5&30.8&12210&-53.5&-2.2&-9.6&6.0\\ -5.9&8.2&-53.5&12320&-70.7&-17.0&-63.6\\ 6.7&0.7&-2.2&-70.7&12480&81.1&-1.3\\ -13.7&11.8&-9.6&-17.0&81.1&12630&39.7\\ -9.9&4.3&6.0&-63.3&-1.3&39.7&12440\end{array}\right],$$

(9)

where energies are given in cm⁻¹. Each site is coupled to its own environment characterized by the Drude–Lorentz spectral density given by Eq. (7). Not long ago, an eighth BChl molecule (site-8) has been discovered¹¹, however, as has been mentioned by Jia et al.⁶², the role of the eighth BChl molecule (site-8) in the transfer of excitation energy in the FMO complex is negligible.

Trajectories for the reduced density matrix have been generated with the local thermalising Lindblad master equation (LTLME)²² (see Supplementary Methods) implemented in quantum_HEOM package⁶³ with QuTip⁶⁴ in the back-end with all the possible combinations of the following parameters: λ = {10, 40, 70, 100, 130, 160, 190, 220, 250, 280, 310} cm⁻¹, γ = {25, 50, 75, 100, 125, 150, 175, 200, 225, 250, 275, 300} cm⁻¹ and T = {30, 50, 70, 90, 110, 130, 150, 170, 190, 210, 230, 250, 270, 290, 310} K. We consider that all these combinations of parameters make a part of a parameter space ${{{{{{{\mathcal{D}}}}}}}}$. The time-step used for propagation is 5 fs and the trajectory is propagated up to t_M = 1 ns (10⁶ fs). With the possibility of initial excitation on site-1 and site-6, we generate 1980 trajectories for each excitation case.

Data preparation

With all the possible combinations of the parameters λ, γ, T (belonging to ${{{{{{{\mathcal{D}}}}}}}}$), we have 3960 total number of trajectories N_traj (1980 (site-1) + 1980 (site-6), all these trajectories correspond to their respective combination of parameters in parameter space ${{{{{{{\mathcal{D}}}}}}}}$). Using farthest-point sampling⁶⁵ in the three-dimensional space of λ, γ, and T, we choose 1000 trajectories as our training space TS (500 (site-1) + 500 (site-6), ca. 25% of space ${{{{{{{\mathcal{D}}}}}}}}$)), 200 trajectories as the validation set VS (ca. 5% of space ${{{{{{{\mathcal{D}}}}}}}}$)) and the rest of trajectories, we keep as the test set STP (set of test points, ca. 70% of space ${{{{{{{\mathcal{D}}}}}}}}$). For each trajectory, we choose a different time-length t_M using a vanishing gradient scheme. In this scheme, we take the gradient G of the population of each site (ρ_nn, n = 1, 2, 3, …, 7) for 10 consecutive time-steps and if all of them remain less than the threshold value of G_th = 1 × 10⁻¹⁰, we choose our t_M. We find t_M for all seven sites and then choose the maximum value among them, thus we keep a single value of asymptotic limit (t_M) for all seven-sites. By analyzing the gradients, we find the region of the trajectory, where the change in population of the site is very small. By knowing that, we keep the time-length of our trajectory t_M up to that region, because beyond t_M the change in population is very small, and ML is able to predict it. As the asymptotic limit for each trajectory is different, we have different values of t_M for each trajectory. In our training, we have included t → ∞, corresponding to the asymptotic behavior at long-time. Using the strategy of different t_M for each trajectory allows us to include more sampling in our training set from hard-to-learn trajectories, while avoiding redundant sampling from easy-to-learn trajectories. For training, sampling is done with different training time-steps Δt_train in different regions of the trajectory. We sample our training points from 0 ps–1 ps, 1 ps–1.5 ps, 1.5 ps–2.5 ps, 2.5 ps–5 ps, 5 ps–25 ps, 25 ps–50 ps, 50 ps–250 ps, 250 ps–t_M regions with Δt_train = 5, 10, 25, 50, 100, 200, 500, 1000 fs, respectively. The number of training points depends on the number of trajectories N_traj chosen for training, training time-step Δt_train and time-length of trajectories t_M, which in turn depends on G_th.

Training architecture

We use convolutional neural network (CNN) architecture, because the importance of convolutional layers is much explored for image analysis, where these layers extract important features such as edges, textures, objects, and scenes. When it comes to time-series data, we are using convolutional layers in the hope to extract some important features from the data (such as the time influence). After learning those features, when we provide a test trajectory, the trained ML model will look for those features in that test trajectory⁶⁶. Though we have used the CNN model, other neural network architectures such as long short-term memory (LSTM) is also an option. LSTM is considered to be more suitable for extracting long-time temporal dependencies in contrast to convolutional neural networks (CNNs) which are more local. However, CNNs are easy to train and in many studies, they have outperformed LSTM for future forecasting^67,68.

We use 1000 trajectories as our training set TS and 200 trajectories as the validation set VS. After preparation of the input following Fig. 1, we build a CNN architecture and optimize it with hyperopt library⁶⁹. The optimization was carried out only on 300 training trajectories from the training set TS. After optimization, our training architecture consists of two one-dimensional (1D) hidden convolutional layers, one maximum pooling layer, one flatten layer, three fully connected hidden dense layers and one output dense layer. The convolutional layers extract time-dependent correlations from a moving window, while maximum pooling layer pulls out the important information and decreases the size of the feature map which leads to reducing the computational cost. The flatten layer converts the output from the maximum pooling layer into 1D format as the fully connected dense layers, which are the traditional neural networks, can only work with 1D data. We train our CNN architecture using Keras software package⁷⁰ with the TensorFlow in the backend⁷¹. Activation function, number of filters, kernel size and number of neurons for the respective convolutional and dense layers are given in Table 2. In our study, we train a single CNN model and with ca. 3.2 million training points and 900 epochs, training takes ca. 42 h on 32 Intel(R) Xeon(R) Gold 6226R CPUs @ 2.90 GHz. The optimized learning rate is 1 × 10⁻³ with adoptive mean optimizer and the batch size is 512. Using mean squared error function as a loss, we report 1.86 × 10⁻⁷ as the validation loss. The mean absolute error (MAE) and root mean square error (RMSE) averaged over 600 randomly chosen trajectories from the set of test trajectories STP (which were not part of the training process) are given in Table 1.

Table 2 Summary of the optimized neural network architecture with layers, output shape (OS), number of parameters (NP), activation function (AF), number of filters (NF), kernel size (KS), and number of neurons (NN).

Full size table

Input normalization and redundant time-functions

As we have multiple input elements, we need to normalize them all. In normalized input, we have $\lambda =\{{\lambda }_{1},{\lambda }_{2},{\lambda }_{3},\ldots, {\lambda }_{j}\}/{\lambda }_{\max }$, $\gamma =\{{\gamma }_{1},{\gamma }_{2},{\gamma }_{3},\ldots, {\gamma }_{k}\}/{\gamma }_{\max }$, and $T=\{{T}_{1},{T}_{2},{T}_{3},\ldots, {T}_{l}\}/{T}_{\max }$, where ${\lambda }_{\max }$, ${\gamma }_{\max }$, and ${T}_{\max }$ represent the maximum values of λ, γ, and T, respectively. We divide n = {n₁, n₂, n₃, …, n₇} = {1, 2, 3, …, 7} (labels corresponding to the seven rows in the reduced density matrix) by 10 to normalize their values, i.e., the input elements corresponding to the rows in the reduced density matrix are {0.1, 0.2, 0.3, …, 0.7}. Labels for sites with possible initial excitation are m = {0, 1}, which, respectively, represent initial excitation on site-1 and site-6. The input time is represented by a set of redundant time-functions $\left\{{f}_{i}(t)\right\}$, each of which is logistic function f(t) normalizing time. We use a set of 100 logistic functions ${f}_{k}(t)=1/(1+15\cdot \exp (-(t+{c}_{k})))$, where c_k = 5k−1.0 and k ∈ {0, 1, 2, …, 99}, i.e., each logistic function has the same shape and designed to cover the corresponding ≈ 5 ps region and is shifted with respect to the next logistic function by 5 ps, as shown in Supplementary Fig. 3. The infinity limit is given by all redundant time-functions set to one.

Data availability

Data can be re-generated using the script provided at https://github.com/Arif-PhyChem/AIQD_FMO.

Code availability

The code is available at https://github.com/Arif-PhyChem/AIQD_FMO.

References

Fassioli, F., Dinshaw, R., Arpin, P. C. & Scholes, G. D. Photosynthetic light harvesting: excitons and coherence. J. R. Soc. Interface 11, 20130901 (2014).
Article PubMed PubMed Central Google Scholar
Blankenship, R. E. Molecular Mechanisms of Photosynthesis (John Wiley & Sons, 2021).
Bhatia, S. Advanced Renewable Energy Systems (Part 1 and 2) (CRC Press, 2014).
Olson, J. M. in The FMO Protein Discoveries in Photosynthesis 421–427 (Springer, 2005).
Engel, G. S. et al. Evidence for wavelike energy transfer through quantum coherence in photosynthetic systems. Nature 446, 782–786 (2007).
Article ADS CAS PubMed Google Scholar
Karafyllidis, I. G. Quantum transport in the FMO photosynthetic light-harvesting complex. J. Biol. Phys. 43, 239–245 (2017).
Article CAS PubMed PubMed Central Google Scholar
Tronrud, D. E., Wen, J., Gay, L. & Blankenship, R. E. The structural basis for the difference in absorbance spectra for the FMO antenna protein from various green sulfur bacteria. Photosynthesis Res. 100, 79–87 (2009).
Article CAS Google Scholar
Ishizaki, A. & Fleming, G. R. Theoretical examination of quantum coherence in a photosynthetic system at physiological temperature. Proc. Natl Acad. Sci. USA 106, 17255–17260 (2009).
Article ADS PubMed PubMed Central Google Scholar
Collini, E. et al. Coherently wired light-harvesting in photosynthetic marine algae at ambient temperature. Nature 463, 644–647 (2010).
Article ADS CAS PubMed Google Scholar
Milder, M. T., Brüggemann, B., van Grondelle, R. & Herek, J. L. Revisiting the optical properties of the FMO protein. Photosynthesis Res. 104, 257–274 (2010).
Article CAS Google Scholar
Schmidt am Busch, M., Müh, F., El-Amine Madjet, M. & Renger, T. The eighth bacteriochlorophyll completes the excitation energy funnel in the FMO protein. J. Phys. Chem. Lett. 2, 93–98 (2011).
Article CAS PubMed Google Scholar
Olbrich, C., Strümpfer, J., Schulten, K. & Kleinekathöfer, U. Theory and simulation of the environmental effects on FMO electronic transitions. J. Phys. Chem. Lett. 2, 1771–1776 (2011).
Article CAS Google Scholar
Chenu, A. & Scholes, G. D. Coherence in energy transfer and photosynthesis. Annu. Rev. Phys. Chem. 66, 69–96 (2015).
Article ADS CAS PubMed Google Scholar
Cheng, Y. & Silbey, R. J. Coherence in the B800 ring of purple bacteria LH2. Phys. Rev. Lett. 96, 028103 (2006).
Article ADS CAS PubMed Google Scholar
Lee, H., Cheng, Y.-C. & Fleming, G. R. Coherence dynamics in photosynthesis: protein protection of excitonic coherence. Science 316, 1462–1465 (2007).
Article ADS CAS PubMed Google Scholar
Cotton, S. J. & Miller, W. H. The symmetrical quasi-classical model for electronically non-adiabatic processes applied to energy transfer dynamics in site-exciton models of light-harvesting complexes. J. Chem. Theory Comput. 12, 983–991 (2016).
Article CAS PubMed Google Scholar
Mannouch, J. R. & Richardson, J. O. A partially linearized spin-mapping approach for nonadiabatic dynamics. I. Derivation of the theory. J. Chem. Phys. 153, 194109 (2020).
Article ADS CAS PubMed Google Scholar
Liu, J., He, X. & Wu, B. Unified formulation of phase space mapping approaches for nonadiabatic quantum dynamics. Acc. Chem. Res. 54, 4215–4228 (2021).
Article CAS PubMed Google Scholar
Hwang-Fu, Y.-H., Chen, W. & Cheng, Y.-C. A coherent modified Redfield theory for excitation energy transfer in molecular aggregates. Chem. Phys. 447, 46–53 (2015).
Article CAS Google Scholar
Jang, S., Cheng, Y.-C., Reichman, D. R. & Eaves, J. D. Theory of coherent resonance energy transfer. J. Chem. Phys. 129, 101104 (2008).
Article ADS PubMed Google Scholar
Wu, J., Liu, F., Shen, Y., Cao, J. & Silbey, R. J. Efficient energy transfer in light-harvesting systems. I: Optimal temperature, reorganization energy and spatial–temporal correlations. N. J. Phys. 12, 105012 (2010).
Article Google Scholar
Mohseni, M., Rebentrost, P., Lloyd, S. & Aspuru-Guzik, A. Environment-assisted quantum walks in photosynthetic energy transfer. J. Chem. Phys. 129, 174106 (2008).
Article ADS PubMed Google Scholar
Wilkins, D. M. & Dattani, N. S. Why quantum coherence is not important in the Fenna–Matthews–Olsen complex. J. Chem. Theory Comput. 11, 3411–3419 (2015).
Article CAS PubMed Google Scholar
Strümpfer, J. & Schulten, K. Open quantum dynamics calculations with the hierarchy equations of motion on parallel computers. J. Chem. Theory Comput. 8, 2808–2816 (2012).
Article PubMed PubMed Central Google Scholar
Imai, H., Ohtsuki, Y. & Kono, H. Application of stochastic Liouville–Von Neumann equation to electronic energy transfer in FMO complex. Chem. Phys. 446, 134–141 (2015).
Article CAS Google Scholar
Schulze, J., Shibl, M. F., Al-Marri, M. J. & Kühn, O. Multi-layer multi-configuration time-dependent Hartree (ML-MCTDH) approach to the correlated exciton-vibrational dynamics in the FMO complex. J. Chem. Phys. 144, 185101 (2016).
Article ADS PubMed Google Scholar
Richter, M. & Fingerhut, B. P. Coarse-grained representation of the quasi adiabatic propagator path integral for the treatment of non-markovian long-time bath memory. J. Chem. Phys. 146, 214101 (2017).
Article ADS PubMed PubMed Central Google Scholar
Tanimura, Y. & Kubo, R. Time evolution of a quantum system in contact with a nearly Gaussian-Markoffian noise bath. J. Phys. Soc. Jpn. 58, 101–114 (1989).
Article ADS MathSciNet Google Scholar
Xu, R.-X., Cui, P., Li, X.-Q., Mo, Y. & Yan, Y. Exact quantum master equation via the calculus on path integrals. J. Chem. Phys. 122, 041103 (2005).
Article ADS Google Scholar
Tanimura, Y. Stochastic Liouville, Langevin, Fokker–Planck, and master equation approaches to quantum dissipative systems. J. Phys. Soc. Jpn. 75, 082001 (2006).
Article ADS Google Scholar
Zhang, H.-D. et al. Hierarchical equations of motion method based on fano spectrum decomposition for low temperature environments. J. Chem. Phys. 152, 064107 (2020).
Article ADS CAS PubMed Google Scholar
Makarov, D. E. & Makri, N. Path integrals for dissipative systems by tensor multiplication. Condensed phase quantum dynamics for arbitrarily long time. Chem. Phys. Lett. 221, 482–491 (1994).
Article ADS CAS Google Scholar
Stockburger, J. T. & Grabert, H. Non-Markovian quantum state diffusion. Chem. Phys. 268, 249–256 (2001).
Article CAS Google Scholar
Shao, J. Decoupling quantum dissipation interaction via stochastic fields. J. Chem. Phys. 120, 5053–5056 (2004).
Article ADS CAS PubMed Google Scholar
Ke, Y. & Zhao, Y. An extension of stochastic hierarchy equations of motion for the equilibrium correlation functions. J. Chem. Phys. 146, 214105 (2017).
Article ADS PubMed PubMed Central Google Scholar
McCaul, G., Lorenz, C. & Kantorovich, L. Partition-free approach to open quantum systems in harmonic environments: an exact stochastic Liouville equation. Phys. Rev. B 95, 125124 (2017).
Article ADS Google Scholar
Han, L., Chernyak, V., Yan, Y.-A., Zheng, X. & Yan, Y. Stochastic representation of non-Markovian fermionic quantum dissipation. Phys. Rev. Lett. 123, 050601 (2019).
Article ADS MathSciNet CAS PubMed Google Scholar
Han, L. et al. Stochastic equation of motion approach to fermionic dissipative dynamics. I. Formalism. J. Chem. Phys. 152, 204105 (2020).
Article ADS CAS PubMed Google Scholar
Ullah, A. et al. Stochastic equation of motion approach to fermionic dissipative dynamics. II. Numerical implementation. J. Chem. Phys. 152, 204106 (2020).
Article ADS CAS PubMed Google Scholar
Häse, F., Kreisbeck, C. & Aspuru-Guzik, A. Machine learning for quantum dynamics: deep learning of excitation energy transfer properties. Chem. Sci. 8, 8419–8426 (2017).
Article PubMed PubMed Central Google Scholar
Hartmann, M. J. & Carleo, G. Neural-network approach to dissipative quantum many-body dynamics. Phys. Rev. Lett. 122, 250502 (2019).
Article ADS CAS PubMed Google Scholar
Häse, F., Roch, L. M., Friederich, P. & Aspuru-Guzik, A. Designing and understanding light-harvesting devices with machine learning. Nat. Commun. 11, 4587 (2020).
Article ADS PubMed PubMed Central Google Scholar
Secor, M., Soudackov, A. V. & Hammes-Schiffer, S. Artificial neural networks as propagators in quantum dynamics. J. Phys. Chem. Lett. 12, 10654–10662 (2021).
Article CAS PubMed Google Scholar
Herrera Rodríguez, L. E. & Kananenka, A. A. Convolutional neural networks for long time dissipative quantum dynamics. J. Phys. Chem. Lett. 12, 2476–2483 (2021).
Article PubMed Google Scholar
Lin, K., Peng, J., Gu, F. L. & Lan, Z. Simulation of open quantum dynamics with bootstrap-based long short-term memory recurrent neural network. J. Phys. Chem. Lett. 12, 10225 (2021).
Article CAS PubMed Google Scholar
Ullah, A. & Dral, P. O. Speeding up quantum dissipative dynamics of open systems with kernel methods. N. J. Phys. 23, 113019 (2021).
Article CAS Google Scholar
Keith, J. A. et al. Combining machine learning and computational chemistry for predictive insights into chemical systems. Chem. Rev. 121, 9816 – 9872 (2021).
Article PubMed PubMed Central Google Scholar
Shi, Y., Prieto, P. L., Zepel, T., Grunert, S. & Hein, J. E. Automated experimentation powers data science in chemistry. Acc. Chem. Res. 54, 546–555 (2021).
Article CAS PubMed Google Scholar
Ishizaki, A. & Fleming, G. R. Unified treatment of quantum coherent and incoherent hopping dynamics in electronic energy transfer: reduced hierarchy equation approach. J. Chem. Phys. 130, 234111 (2009).
Article ADS PubMed Google Scholar
Mühlbacher, L. & Kleinekathöfer, U. Preparational effects on the excitation energy transfer in the FMO complex. J. Phys. Chem. B 116, 3900–3906 (2012).
Article PubMed Google Scholar
Zhong, X. & Zhao, Y. Charge carrier dynamics in phonon-induced fluctuation systems from time-dependent wavepacket diffusion approach. J. Chem. Phys. 135, 134110 (2011).
Article ADS PubMed Google Scholar
Nalbach, P. & Thorwart, M. The role of discrete molecular modes in the coherent exciton dynamics in FMO. J. Phys. B: At., Mol. Optical Phys. 45, 154009 (2012).
Article ADS Google Scholar
Stock, G. & Thoss, M. Semiclassical description of nonadiabatic quantum dynamics. Phys. Rev. Lett. 78, 578 (1997).
Article ADS MathSciNet CAS MATH Google Scholar
Adolphs, J. & Renger, T. How proteins trigger excitation energy transfer in the FMO complex of green sulfur bacteria. Biophysical J. 91, 2778–2797 (2006).
Article ADS CAS Google Scholar
Worster, S. B., Stross, C., Vaughan, F., Linden, N. & Manby, F. Structure and efficiency in bacterial photosynthetic light-harvesting. J. Phys. Chem. Lett. 10, 7383–7390 (2019).
Article Google Scholar
Brixner, T. et al. Two-dimensional spectroscopy of electronic couplings in photosynthesis. Nature 434, 625–628 (2005).
Article ADS CAS PubMed Google Scholar
Cho, M., Vaswani, H. M., Brixner, T., Stenger, J. & Fleming, G. R. Exciton analysis in 2D electronic spectroscopy. J. Phys. Chem. B 109, 10542–10556 (2005).
Article CAS PubMed Google Scholar
Wen, J., Zhang, H., Gross, M. L. & Blankenship, R. E. Membrane orientation of the FMO antenna protein from Chlorobaculum tepidum as determined by mass spectrometry-based footprinting. Proc. Natl Acad. Sci. USA 106, 6134–6139 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Francke, C. & Amesz, J. Isolation and pigment composition of the antenna system of four species of green sulfur bacteria. Photosynthesis Res. 52, 137–146 (1997).
Article Google Scholar
Frigaard, N.-U. et al. Isolation and characterization of carotenosomes from a bacteriochlorophyll c-less mutant of chlorobium tepidum. Photosynthesis Res. 86, 101–111 (2005).
Article CAS Google Scholar
Ke, Y. & Zhao, Y. Hierarchy of forward-backward stochastic Schrödinger equation. J. Chem. Phys. 145, 024101 (2016).
Article ADS PubMed Google Scholar
Jia, X., Mei, Y., Zhang, J. Z. & Mo, Y. Hybrid QM/MM study of FMO complex with polarized protein-specific charge. Sci. Rep. 5, 1–10 (2015).
Article CAS Google Scholar
Abbott, J. W. quantum_HEOM. GitHub repository. https://github.com/jwa7/quantum_HEOM (2019).
Johansson, J. R., Nation, P. D. & Nori, F. QuTiP: an open-source Python framework for the dynamics of open quantum systems. Computer Phys. Commun. 183, 1760–1772 (2012).
Article ADS CAS Google Scholar
Dral, P. O. MLatom: a program package for quantum chemical research assisted by machine learning. J. Computat. Chem. 40, 2339–2347 (2019).
Article CAS Google Scholar
Ismail Fawaz, H., Forestier, G., Weber, J., Idoumghar, L. & Muller, P.-A. Deep learning for time series classification: a review. Data Min. Knowl. Discov. 33, 917–963 (2019).
Article MathSciNet MATH Google Scholar
Tran, D., Bourdev, L., Fergus, R., Torresani, L. & Paluri, M. Learning spatiotemporal features with 3D convolutional networks. in 2015 IEEE International Conference on Computer Vision (ICCV), 4489–4497 (IEEE Computer Society, 2015).
Lea, C., Flynn, M. D., Vidal, R., Reiter, A. & Hager, G. D. Temporal convolutional networks for action segmentation and detection. in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1003–1012 (IEEE Computer Society, 2017).
Bergstra, J., Komer, B., Eliasmith, C., Yamins, D. L. K. & Cox, D. D. Hyperopt: a python library for model selection and hyperparameter optimization. Computat. Sci. Discov. 8, 014008 (2015).
Article ADS Google Scholar
Keras, a deep learning API. https://keras.io (2014).
Abadi, M. et al. Tensorflow: large-scale machine learning on heterogeneous distributed systems. http://tensorflow.org/ (2016).

Download references

Acknowledgements

P.O.D. acknowledges funding by the National Natural Science Foundation of China (No. 22003051), the Fundamental Research Funds for the Central Universities (No. 20720210092), and via the Lab project of the State Key Laboratory of Physical Chemistry of Solid Surfaces.

Author information

Authors and Affiliations

State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen, 361005, Fujian, China
Arif Ullah & Pavlo O. Dral

Authors

Arif Ullah
View author publications
You can also search for this author in PubMed Google Scholar
Pavlo O. Dral
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.U. conceived the idea of investigating excitation energy transfer in the Fenna–Matthews–Olsen complex. P.O.D. conceived the idea of learning trajectories as a function of time and parameters. Both authors developed the method. A.U. did all the implementations, calculations, analysis of data, and wrote the original version of the manuscript. Both authors revised the manuscript.

Corresponding authors

Correspondence to Arif Ullah or Pavlo O. Dral.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ullah, A., Dral, P.O. Predicting the future of excitation energy transfer in light-harvesting complex with artificial intelligence-based quantum dynamics. Nat Commun 13, 1930 (2022). https://doi.org/10.1038/s41467-022-29621-w

Download citation

Received: 22 November 2021
Accepted: 16 March 2022
Published: 11 April 2022
DOI: https://doi.org/10.1038/s41467-022-29621-w

This article is cited by

Environment-assisted quantum discord in the chromophores network of light-harvesting complexes
- Moisés Chávez-Huerta
- Fernando Rojas
Quantum Information Processing (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.