Impedance-based forecasting of lithium-ion battery performance amid uneven usage

Jones, Penelope K.; Stimming, Ulrich; Lee, Alpha A.

doi:10.1038/s41467-022-32422-w

Download PDF

Article
Open access
Published: 16 August 2022

Impedance-based forecasting of lithium-ion battery performance amid uneven usage

Nature Communications volume 13, Article number: 4806 (2022) Cite this article

18k Accesses
53 Citations
124 Altmetric
Metrics details

Subjects

Abstract

Accurate forecasting of lithium-ion battery performance is essential for easing consumer concerns about the safety and reliability of electric vehicles. Most research on battery health prognostics focuses on the research and development setting where cells are subjected to the same usage patterns. However, in practical operation, there is great variability in use across cells and cycles, thus making forecasting challenging. To address this challenge, here we propose a combination of electrochemical impedance spectroscopy measurements with probabilistic machine learning methods. Making use of a dataset of 88 commercial lithium-ion coin cells generated via multistage charging and discharging (with currents randomly changed between cycles), we show that future discharge capacities can be predicted with calibrated uncertainties, given the future cycling protocol and a single electrochemical impedance spectroscopy measurement made immediately before charging, and without any knowledge of usage history. The results are robust to cell manufacturer, the distribution of cycling protocols, and temperature. The research outcome also suggests that battery health is better quantified by a multidimensional vector rather than a scalar state of health.

Identifying degradation patterns of lithium ion batteries from impedance spectroscopy using machine learning

Article Open access 06 April 2020

Yunwei Zhang, Qiaochu Tang, … Alpha A. Lee

Data-driven capacity estimation of commercial lithium-ion batteries from voltage relaxation

Article Open access 27 April 2022

Jiangong Zhu, Yixiu Wang, … Helmut Ehrenberg

Machine learning pipeline for battery state-of-health estimation

Article 05 April 2021

Darius Roman, Saurabh Saxena, … David Flynn

Introduction

Electrification of the transportation industry is now taking place at an increasingly rapid pace, enabling significant strides towards a carbon neutral future. Fundamental to this transition has been the development of the lithium-ion battery, which powers the majority of electric vehicles (EVs) on the road today. Notwithstanding the environmental benefits of this transition, reliance on the lithium-ion battery poses significant challenges, with consumer concerns including range anxiety, fear of battery failure and charging time. Easing these concerns demands the ability to accurately forecast battery performance, and specifically when usage conditions are variable.

The key challenge is the heterogeneity of the battery. Each user uses their car differently, and even across a single battery pack not all cells are necessarily charged or discharged with identical current^1,2,3. These differences mean that each cell’s internal state, including the extent of lithium plating or electrode cracking, can vary significantly both at an intra-pack and inter-pack level^4,5.

To quantify the extent of degradation within cells, and to identify cells that have reached their End of Life (in EVs, this is typically defined as the point at which the discharge capacity has reduced to 80% of the nominal capacity^6,7), the scalar State of Health (SOH) metric is typically adopted, measured using previous cycle discharge capacity or internal resistance^{8,9,10,11,12,13}. The problem with this approach is that batteries with the same numerical SOH do not necessarily exhibit identical levels of each degradation process (for example, lithium plating or electrode cracking), yet the impact of future cell usage on the cell’s future performance and degradation pathway depends significantly on the type of degradation that has already occurred^14,15,16. Accurate forecasting of battery performance demands a non-invasive approach to acquire information about the cell state at a microscopic level.

Both short^{11,17,18,19,20} and long^21,22,23 timescale forecasting of battery performance are of interest in battery prognostics. Over a short timescale, predicting how the battery would respond to a particular charging and discharging protocol can be used to develop optimal charging protocols¹⁷. Short-term forecasting also encompasses SOH estimation^11,18,19,20: here, the aim is to predict the battery’s discharge capacity or internal resistance under a specific, standardised cycling protocol. Over a long timescale, the focus is on predicting the remaining useful life²¹, end of life²², or the ‘knee-point’ in the battery’s life trajectory at which degradation accelerates²³.

Approaches to both types of forecasting can be subdivided into empirical, physics-based, and data-driven models, with some models being a hybrid of these. Empirical approaches have been used to model long-term capacity fade with power laws but assume fixed operation over battery life and do not account for intrinsic differences in cell state at start of life. These approaches assume that all cells of the same chemistry will fade in the same way if operated in the same way, which is not observed in practice²⁴. In physics-based approaches, the battery is either modelled mechanistically using first principles analysis of internal physical and electrochemical processes, or using equivalent circuit modelling, which models the cell as a circuit comprising resistors and capacitors that are representative of the underlying electrochemical processes^25,26. Mechanistic models aim to capture how the battery voltage responds to an externally applied current (or vice versa), which can be used to predict optimal charging protocols¹⁷. However, the parameters of such models need to be updated for each individual cell and typically suffer from non-identifiability – several sets of model parameters could explain the observed data equally well, but would make drastically different predictions on test cells or on the same cell later in its life. For circuit-based models, the parameters of the circuit can be fitted to either current-voltage data^20,27, or to electrochemical impedance spectra^28,29. The circuit parameters can then be used to forecast capacity degradation under standardised use conditions²⁰ or to simulate the effect of different usage conditions on battery pack performance³⁰. However, it is challenging to capture every degradation mode in an analytical model. Further, a new set of model parameters must be learnt for each cell from cycle to cycle, making it challenging to infer a general cell-to-cell model.

Purely data-driven approaches to forecasting use raw data as input to a machine learning algorithm to forecast long term capacity fade, resistance increase and remaining useful life^31,32. Feature-based data-driven approaches applied machine learning on features extracted from the charging or discharging curve to predict discharge capacity¹⁹, remaining useful life²², and abrupt capacity decays^23,33. Innovations in extracting features from charge/discharge curves³⁴ and machine learning approaches for modelling time-series data^35,36 have enabled significant improvements in the accuracy of predictions. Further studies showed that using features of the discharge curve across a small number of initial cycles, it is possible to train machine learning models that can generalise to different cell chemistries³⁷. Going beyond charging and discharging curves, approaches such as electrochemical impedance spectroscopy (EIS)²¹, early cycle Coulombic efficiency³⁸, current interruption³⁹ and acoustic time-of-flight analysis^18,40 have been used for degradation forecasting. These approaches provide a fuller description of battery state – for example, EIS captures the response of the cell over a broad frequency range, with different frequencies correlating to distinct physical, chemical and mechanical changes in the active material^26,41,42,43. Data-driven methods typically utilise data generated in the laboratory setting, where cells are charged and discharged in the same way over the entirety of their lifetimes, thus the impact of variable cell usage on future performance can be ignored (see Fig. 1). However, extrapolating the models developed for laboratory setting to field data or other realistic usage profiles such as the Worldwide Harmonized Light Vehicles Test Cycles (WLTC)^13,44,45, where cells are cycled in vastly different ways over their lifetimes, has proved a major challenge⁷.

**Fig. 1: Schematic comparison of the proposed approach to previous research works.**

In this work, we seek to identify whether there exists a sufficiently informative marker of cell health that can be used to forecast short-term and longer term future performance, amid uneven historical and future cell usage. Figure 1 provides an illustration of our approach, and how it differs from previous approaches. We find that upon acquisition of an EIS spectrum just before charging, both next cycle and longer term cell capacity can be predicted with a test error of less than 10%. When testing on cells subjected to similar cycling conditions to those used to train the model, our model achieves comparable accuracy to state-of-the-art forecasting models (8.2% test error versus 8.8% test error), except that our model enables forecasting with no access to any historical data, whereas previous state-of-the-art models require historical data from the cell’s cycling trajectory. In addition, when extrapolating to different operating temperatures, our model significantly outperforms the state-of-the-art model, achieving a 57% reduction in test error (from 34.2% to 14.6%).

We observe that our model is data-efficient, requiring just eight cells to attain a test error of less than 10%. Crucially, our approach is robust to dataset shift, attaining a test error of less than 7% on a dataset with a different distribution of cycling patterns to the training set. This is important for deployment in the field where driving patterns may be different from those used to train the model. We additionally demonstrate that, if available, using additional features based on historical capacity–voltage data can serve to augment the state representation and reduce average test error by up to 25%. Our approach is robust with respect to cell manufacturer, average usage pattern and operating temperature.

Further, our work fills a gap in publicly available data by contributing a large corpus of cycling data on cells under dynamic working conditions⁴⁶. Our work focuses on a set of idealised usage distributions rather than realistic driving profile in order to demonstrate the extent of generalisability of the model. Our work departs from the NASA randomised usage dataset⁴⁷, which randomly cycles cells for 50 cycles before measuring the next cycle discharge capacity after charging via a ‘reference’ protocol. Although several models for forecasting degradation under randomised conditions have been built based on this data^12,19,48, the effect of a single protocol on next cycle discharge capacity cannot be disentangled, and there is a need for a reference charge/discharge protocol every few cycles which does not concord with typical field usage.

Results

Data generation

For this study, we generate two separate datasets corresponding to commercial LiR coin cells purchased from two different manufacturers, which allows us to test whether our approach is robust with respect to cell manufacturer.

The first dataset corresponds to 40 Powerstream LiR 2032 coin cells (nominal capacity 1C = 35 mAh). We subject 24 cells to a sequence of randomly selected charge and discharge currents at 23 ± 2 °C for 110–120 full charge/discharge cycles. Each cycle consists of an initial diagnosis of battery state, involving acquisition of the galvanostatic EIS spectrum, followed by usage, involving a charging and discharging stage. Charging and discharging consist of a two stage and one stage Constant Current (CC) protocol, respectively; the currents are randomly selected at each cycle in the ranges 70–140 mA (2–4 C), 35–105 mA (1–3 C), and 35–140 mA (1–4 C) respectively. To test the model’s robustness to domain shift, we additionally cycle the remaining 16 cells under the same conditions as above, except now fixing the discharge current for all cells and cycles at 52.5mA (1.5 C) instead of randomly changing the discharge current at each cycle. The space of protocols considered is illustrated in Fig. 2 and an example of the capacity trajectories of three cells is provided in Supplementary Fig. 1 for illustration of the difference from typical monotonic capacity fade experiments. A complete description of cycling protocols is provided in the Methods and the full set of operating conditions that each cell is subjected to is detailed in Supplementary Table 1.

**Fig. 2: Proposed charge-discharge protocol.**

Having used the first dataset to confirm the approach can successfully forecast discharge capacity several cycles ahead, we later significantly expand our analysis to explore the model’s robustness to cell manufacturer, changes to usage pattern and operating temperature. To achieve this, we cycle an additional 48 cells from a second manufacturer, RS Pro (nominal capacity 40 mAh), under a much wider range of usage patterns. In this case, each cell is again subjected to 100 cycles of two-stage CC charging, and one-stage CC discharging, with the three rates randomly selected at the start of each cycle. However, we now make the problem more challenging by having a different distribution of currents for each cell, to replicate the scenario in which different battery users have different average usage patterns to each other, but still exhibit random cycle-to-cycle behaviour. Of these cells, sixteen are also cycled at a higher operating temperature of 35 °C.

Capacity forecasting using EIS

We first consider the setting in which we want to predict the next cycle discharge capacity, for a cell whose usage history (including for example, cycle or calendar age, or historical capacity–voltage data) is completely unknown, if we apply a particular charging and discharging profile. We frame the problem as a regression task, and train a probabilistic machine learning model to learn the mapping Q_n = f(s_n, a_n), with uncertainty estimates, where s_n is the battery state at the start of the nth cycle, a_n is the future action (the nth cycle charge/discharge protocol), and Q_n is the discharge capacity measured at the end of the cycle. The battery state vector s_n is formed from the concatenation of the real (\({Z}_{{{{{{{{\rm{re}}}}}}}}}\)) and imaginary (\({Z}_{{{{{{{{\rm{im}}}}}}}}}\)) components of the impedance measured at 57 frequencies, ω₁, . . . , ω₅₇, in the range 0.02Hz-20kHz; \({{{{{{{{\bf{s}}}}}}}}}_{n}=[{Z}_{{{{{{{{\rm{re}}}}}}}}}({\omega }_{1}),{Z}_{{{{{{{{\rm{im}}}}}}}}}({\omega }_{1}),...,{Z}_{{{{{{{{\rm{re}}}}}}}}}({\omega }_{57}),{Z}_{{{{{{{{\rm{im}}}}}}}}}({\omega }_{57})]\). The action vector is formed from the concatenation of the nth cycle charge and discharge currents.

Figure 3 illustrates the accuracy of our model. Using both state and action as input, the next cycle discharge capacity is predicted with an average error of 8.2%. Importantly, both state and action (Fig. 3a) are found to be necessary to predict future cell performance: if state (Fig. 3b) or action (Fig. 3c) alone are used as inputs, the test error approximately doubles to 20.7% and 15.4% respectively. This demonstrates the importance of both the cell’s internal health and the externally selected usage in determining realised cell performance.

**Fig. 3: Predicting next cycle discharge capacity.**

For applications such as optimised charging and repurposing triaging, it is important that a model of battery life trajectory can forecast not only the immediate next cycle discharge capacity, but also capacity several cycles into the future^49,50. With this in mind, we next investigate how the predictive accuracy of the model changes as we push the model to predict capacity further into the future. In each case, the input comprises the concatenation of the state representation at the start of the nth cycle, s_n, with the ‘action’ vector a_n...n+j comprising all charging and discharging currents that will be applied between cycle n and cycle n + j.

Figure 4 shows how the coefficient of determination R² changes with j. As expected, the accuracy of the model generally decreases as the forecasting interval increases. However, the model still attains R² = 0.75 when projecting 40 cycles into the future.

Data efficiency and robustness to domain shift

We next test the robustness of our method by investigating data efficiency and model generalisability. To test data efficiency, we measure how performance changes as the number of cells used to train the model increases. As seen in Fig. 5, there is a marked reduction in test error from 23.8% to 8.2% as the number of cells increases from two to 22. Nevertheless, the model is demonstrably data-efficient, with just eight cells needed to obtain a test error of less than 10%.

An important test of model generalisability is to study model accuracy when the domain distribution changes, i.e. when the model is being deployed in settings that are different from the training data¹². This is important for deployment in the field as the approach needs to be robust to driving patterns that might be different from the training data⁸. We test model robustness by cycling an additional 16 cells from the same manufacturer, but now adjusting the cycling protocol by fixing the discharge current to 1.5C for each cell throughout its life. We use a model trained using only cells that were subjected to random discharge currents over their lifetime, to predict next-cycle discharge capacity of cells subjected to fixed discharging. To illustrate the difference in training and test datasets, the distribution of discharge capacities is shown for each in Fig. 6a.

The predictive accuracy of the model on the fixed discharge dataset is illustrated in Fig. 6b. Promisingly, the model attains a test error of just 6.3% on this domain-shifted dataset, which corresponds to R² = 0.76.

Our model also outputs predictive uncertainty, which indicates how certain the model is about the quality of its predictions. It is especially important in the domain-shifted setting that the model ‘knows what it does not know’ and estimates high predictive uncertainty about data points that it is likely to obtain a high error on. We can test the model’s ability to estimate its uncertainty by observing how the average test error changes as the number of data points is reduced to include only the data points that the model is most confident about. If a model can successfully estimate its level of certainty, the average test error should reduce as the proportion of data is reduced to include only the most confidently predicted points. Figure 6c shows a 32% reduction in root-mean-squared error (RMSE) as the proportion of data is reduced from 100% to the most confident 25%, demonstrating that our model has learnt which predictions it should be confident about.

Comparison of state representations

Having demonstrated the ability of the EIS spectrum to capture battery state, we now benchmark this representation of battery health against other approaches utilised in the literature, including the state-of-the-art feature-based method^22,51, and consider whether there are additional features to the EIS spectrum that can serve to augment battery state. Simple measures that have been used to forecast or estimate battery SOH include using the previous cycle discharge capacity, or the capacity throughput since cycling commenced. More advanced approaches include extracting features of the historical capacity–voltage discharge curves, as shown in Fig. 1. The state-of-the-art approach to extracting such features was implemented by Severson et al.²² and inspired the approaches to feature extraction used recently by Attia et al. and Paulson et al.^37,51. We benchmark how our EIS-based approach performs relative to those state-of-the-art features.

Further, we assess whether incorporation of physical interpretations, in the form of equivalent circuit models (ECM), improves predictions. We use the widely implemented Randles circuit model, comprising a series resistance, connected with a resistance in parallel with a capacitance and a Warburg impedance element, as well as the more complex Extended Randles circuit, which adds an additional resistor-capacitor parallel combination in series to the Randles circuit. The ECM is fitted to the spectrum (at an associated computational cost) and we use the extracted parameters as the state representation instead of raw EIS data.

In total, we consider the following features in our benchmark:

Previous cycle discharge capacity Q_n−1.
Capacity throughput (CT) since cycling commenced, as defined by the sum of cell charge and discharge capacities from cycles 0 to n − 1.
State of Health (SOH), as defined by Q_n−1/Q₀.
State-of-the-art features of the capacity–voltage discharge curve (CVF): Following Severson et al.²², we form a state representation at the start of cycle n by extracting features from the capacity–voltage discharge curve after cycle n − 1. We fit each curve to a spline function, linearly interpolating to measure capacity at 1000 evenly spaced voltages from \({V}_{\min }\) to \({V}_{\max }\). This 1000-dimensional capacity vector Q_n−1 is normalised by subtracting the equivalent vector from cycle 0, Q₀. The following features are then used as inputs: \({V}_{\max }\), \({V}_{\min }\), \(\log ({{{{{{{\rm{var}}}}}}}}({{{{{{{{\bf{Q}}}}}}}}}_{n-1}-{{{{{{{{\bf{Q}}}}}}}}}_{0}))\), \(\log ({{{{{{{\rm{IQR}}}}}}}}({{{{{{{{\bf{Q}}}}}}}}}_{n-1}-{{{{{{{{\bf{Q}}}}}}}}}_{0}))\). Additionally, we fit the capacity to a sigmoid \(Q(\tilde{V})=\frac{{p}_{0}}{1.0+\exp ({p}_{1}(\tilde{V}-{p}_{2}))}\) where \(\tilde{V}\) is the normalised voltage and use the parameters p₀, p₁, p₂ as features.
Equivalent circuit model parameters (ECM-R and ECM-ER): We fit equivalent circuit models using the Randles circuit (ECM-R) and Extended Randles circuit (ECM-ER) to the EIS spectra and concatenate the obtained parameters together.

We note that in contrast to EIS features, the formation of a state representation using the first four aforementioned features demands access to historical current-voltage data, over at least the entirety of the previous discharge and for some features, over the entire cell lifetime. However, they benefit from the advantage of not requiring equipment to measure the EIS spectrum, which comes with an associated financial and temporal cost. Forming a state representation using the ECM parameters (extracted from the EIS spectrum) has an associated computational cost and can be considered a form of dimensionality reduction of the raw EIS data. An additional problem faced by ECMs in general is non-uniqueness, in that multiple different combinations of ECM parameters can generally explain a particular EIS spectrum equally well⁵².

Table 1 shows how the state representation impacts test error and model goodness of fit. In all cases, the model is trained to predict the next cycle discharge capacity, given the next cycle protocol and the chosen state representation. Interrogating the relative importance of features, we first consider the baseline of using EIS only (without including the protocol) and using the protocol only (without including EIS). Perhaps unsurprisingly, battery degradation is a function of both the current state and future charge/discharge protocol. As such, using both EIS and the protocol significantly outperforms using EIS only or using the protocol only.

Table 1 Comparison of state representations

Full size table

We then explore the impact of physics-based representation of the EIS spectrum, using the Randles (ECM-R) and extended Randles (ECM-ER) equivalent circuit models. Comparing EIS + Protocol with ECM-R + Protocol and ECM-ER + Protocol reveals that these physics-based models lose information, and using a machine learning approach to directly learn from raw data might be advantageous.

We next consider the different approaches that have been reported in the literature, Q_n−1, SOH, CT, and CVF, with CVF being the state-of-the-art in the battery informatics literature. In all cases, EIS + Protocol outperforms those other features with Protocol, although CVF is competitive.

Interestingly, information from capacity–voltage curve data (CVF) is complementary to EIS - combining EIS with these features leads to a significant increase in accuracy (EIS + CVF + Protocol). This is perhaps unsurprising, as EIS probes the impedance of the single ‘static’ cell discharged state (with high information content per instant state), whilst capacity–voltage curves probe how the cell state evolves continuously over the path from charged to discharged (with low information content per instant state).

Finally, the best model performance is attained by combining all of the above features to form the state representation. In this case the average test error is just 6.2%.

Robustness to different cell manufacturers

We now extend our analysis to explore how robust our approach is to changing the cell manufacturer, adjusting the operating temperature and adjusting the average use pattern. We repeat our experiment on a new batch of 32 commercial LiR coin cells (of nominal capacity 1 C = 40 mAh) from RS Pro, a different manufacturer, except we now make the problem significantly more challenging by subjecting different subgroups of cells to one of four different usage distributions. These usage distributions are shown in Supplementary Table 1.

We measure the accuracy of the model in two ways: firstly, we consider the case where the model is exposed to cells that have been subjected to the same distribution of protocols as the test set (random splitting), and second, the more challenging case where the model is only trained on the cells which are subjected to three of the cycling protocol distributions and tested on the remaining eight cells subjected to a different cycling protocol. This is a much harder task as the average usage on the test cells is very different to the average usage on the training cells—it is a test of whether the model can extrapolate to different average use not just different cycle-to-cycle use.

The results for different state representations are shown in Table 2 for both the case where the train/test split is random, and where the split is stratified into different usage patterns. Comparable observations are made for cells purchased from the second manufacturer: namely, the most accurate predictions are made when the state representation is formed using features of the EIS spectrum alongside those formed from the discharge curve (CVF). As expected, the model performs significantly better when it has been trained on data from some cells that have been exposed to a similar distribution of cycling patterns as those that the model is tested on. However, the model remains performant in the scaffold split scenario, and in this setting the test error reduces by 30% when the state representation is formed using the EIS spectrum alongside the features of the discharge curve, instead of solely using features of the discharge curve. These additional results further demonstrate that if available, both the EIS spectrum and discharge curve can act as informative markers of the battery’s internal state, but that they are complementary to each other.

Table 2 Robustness of approach to cell manufacturer

Full size table

We next verify that the model is robust with respect to changing external operating temperature. We cycle an additional 16 cells at 35 °C and test the model trained on data from cells cycled at room temperature. Table 3 shows that our model can extrapolate to cells operated at these higher temperatures, but that the EIS spectrum plays a particularly important role in characterising the battery state when the cell is not operated at the same temperature. The model obtains a test error of 34.2% when only the discharge curve features are used to characterise state, which reduces to 14.0% when both the EIS spectrum and discharge curve features are used. This further demonstrates the additional information that EIS signals contain relative to charging-discharging curves, and supports the hypothesis that EIS implicitly tracks temperature⁵³.

Table 3 Robustness to operating temperature

Full size table

Discussion

In this paper, we showed that the electrochemical impedance spectrum accurately characterises the internal state of a cell, and a machine learning model can be trained to accurately forecast both immediate and longer term cell performance with predictive uncertainty, even amid uneven and unknown historical cell usage. Our model achieves comparable accuracy (8.2% test error) to the state-of-the-art forecasting approach (8.8% test error) when testing on cells subjected to the same distribution of operating conditions as the cells used to train the model. However, as outlined in Fig. 1, the state-of-the-art approach demands access to historical cycling data whereas our model enables forecasting with no historical data. Additionally, our model significantly outperforms the state-of-the-art model when extrapolating to a higher operating temperature, with a 57% reduction in test error (from 34.2% to 14.6%).

Our method is data-efficient, achieving a next-cycle test error of 9.9% with training data from just eight cells, and is robust to shifts in dataset distributions. Additionally, we find that there is scope to boost model performance by 25% if historical cycling data is available; such data can be used to derive features that augment the cell state representation. We demonstrate that our approach can be utilised across different cell chemistries, and the model is robust to different operating temperatures.

Our approach differentiates from the prior art in two important ways : First, we employ an information-rich electrical signal—EIS—which captures the response of the cell across different timescales without any knowledge of the cycling history. This is in contrast to most existing methods which employ features from the charging–discharging curve—a significantly more coarse-grained signal—as input to machine learning models. Our results suggest significant improvements in battery management systems abound by incorporating circuitries that measure electrochemical impedance, albeit at a financial and temporal cost.

Second, we focus on uneven cycling, where the charging and discharging rates vary from cycle to cycle. This departs from previous studies on machine learning for battery degradation which focused on constant charge/discharge conditions, which are typical in battery testing. Our results problematise the concept of a single scalar State of Health, as the state of the battery is dependent on the extent of the myriad different degradation mechanisms, which in turn depends on the sequence of historic charge/discharge protocols. Rather, we suggest that a cell can be described by a multidimensional state vector, captured using informative high-dimensional measurements like EIS, and a machine learning approach can be used to predict future capacities given the state vector and future charge/discharge protocols. Furthermore, although in this work we only consider forecasting starting from an initially discharged state, we hypothesise that it should be possible in future work to forecast discharge capacity starting from any state of charge based on the EIS measurement, since EIS spectrum implicitly tracks state of charge^54,55,56.

We note that the general framework that we have laid out for predicting future battery performance given current cell state and future actions has scope to be applied in a broad range of battery diagnostic and control settings. For example, predicting the effect of a proposed charging protocol on next cycle discharge capacity as well as long term degradation is important for optimising rapid charging applications⁵¹, where a balance must be achieved between charging time and rate of cell degradation⁵⁷. Our work can additionally be extended to consider more complicated dynamic usage protocols, such as WLTC.

Methods

Battery cycling

For this study we cycle 88 commercial LiR coin cells purchased from two different manufacturers, Powerstream and RS Pro, in a temperature regulated laboratory at 23 ± 2 °C. A Biologic BCS-805 potentiostat is used for cycling, and photographs of the experimental setup are provided in Supplementary Fig. 2.

Across all datasets, cells are subjected to a sequence of randomly selected charge and discharge currents for 110–120 full charge/discharge cycles. Cycling commences when the cell is in the fully discharged state, and each cycle comprises the following steps: (a) resting for 20 min at the open circuit voltage, (b) acquisition of the galvanostatic EIS spectrum in the fully discharged state, (c) two stage CC charging, (d) resting for 20 min at the open circuit voltage, (e) acquisition of the galvanostatic EIS spectrum in the fully charged state, (f) one stage CC discharging. The galvanostatic EIS spectrum is always measured by collecting impedance measurements at 57 frequencies uniformly distributed in the log domain in the range 0.02Hz-20kHz using a sinusoidal current with amplitude of 5 mA. Cells are cycled in a temperature-controlled lab room at 23 ± 2 °C.

To generate the first dataset, we cycle 24 Powerstream LiR 2032 coin cells (nominal capacity 1 C = 35 mAh). For these cells, charging consists of a two-stage CC protocol; currents are randomly selected in the ranges 70–140mA (2C–4C) and 35mA-105mA (1C-3C) in stages 1 and 2 respectively. A time limit of 15 min is set for each charging stage such that the total charging time is constrained to be 30 min or less. Charging will stop before the 30 min time limit if the safety threshold voltage of 4.3 V is reached. During discharging, a single constant discharge current, randomly selected in the range 35mA-140mA (1C–4C), is applied, until the voltage drops to 3.0 V.

An additional 16 cells (also manufactured by Powerstream and of nominal capacity 35 mAh) are cycled under the same conditions, except now we fix the discharge current at 52.5 mA (1.5C) for all cells and cycles, instead of randomly changing the discharge current at each cycle.

We then generate a second dataset that enables exploration of the model’s robustness to cell manufacturer, changes to usage pattern and operating temperature. We cycle 48 cells from a second manufacturer, RS Pro (nominal capacity 40 mAh), under a much wider range of usage patterns. The general six-step cycling protocol remains the same as described above, with each cell again being subjected to 100 cycles of two-stage CC charging, and one-stage CC discharging, with the three rates randomly selected at the start of each cycle. However, the distribution of currents now changes for each cell. Of these cells, sixteen are also cycled at a higher operating temperature of 35 ± 2 °C, in a temperature-controlled heating chamber. A description of the full set of operating conditions that each cell is subjected to is detailed in Supplementary Table 1.

Machine learning model

All problems in this study are framed as regression tasks. We train a probabilistic machine learning model to learn the mapping Q_j = f(s_n, a_n...j), with uncertainty estimates, where s_n is the battery state at the start of the nth cycle, a_n is the set of future cycling protocols applied over cycles n to j, and Q_j is the discharge capacity at the end of the jth cycle. The battery state vector s_n is generally formed from the concatenation of the real (\({Z}_{{{{{{{{\rm{re}}}}}}}}}\)) and imaginary (\({Z}_{{{{{{{{\rm{im}}}}}}}}}\)) components of the galvanostatic EIS spectrum measured in the fully discharged state at the start of the cycle at 57 frequencies, ω₁, . . . , ω₅₇, in the range 0.02Hz-20kHz; \({{{{{{{{\bf{s}}}}}}}}}_{n}=[{Z}_{{{{{{{{\rm{re}}}}}}}}}({\omega }_{1}),\, {Z}_{{{{{{{{\rm{im}}}}}}}}}({\omega }_{1}),...,\, {Z}_{{{{{{{{\rm{re}}}}}}}}}({\omega }_{57}),\, {Z}_{{{{{{{{\rm{im}}}}}}}}}({\omega }_{57})]\). For the task of predicting next cycle discharge capacity, the action vector a_n is formed from the concatenation of the nth cycle charge and discharge currents. When predicting discharge capacity several cycles, j, ahead of time, the future protocol is now formed from all charging and discharging currents that will be applied between cycle n and cycle n + j.

For the machine learning model, we use an ensemble of 10 XGBoost models⁵⁸, each with 500 estimators and a maximum depth of 100. The mean and standard deviation of the predictions made by each model in the ensemble are used to quantify the predicted output and the predictive uncertainty. To test model performance we use the median R² score and median percentage error. To obtain test metrics from a dataset comprising N cells, we randomly leave two test cells out, train on the remaining N−2 cells and repeat this process N/2 times, leaving different cells out each time.

Data availability

The data generated in this study are provided in the Zenobo database at https://doi.org/10.5281/zenodo.6645536⁵⁹.

Code availability

The code required to reproduce this manuscript is available at https://github.com/PenelopeJones/battery-forecasting⁶⁰.

References

Gogoana, R., Pinson, M. B., Bazant, M. Z. & Sarma, S. E. Internal resistance matching for parallel-connected lithium-ion cells and impacts on battery pack cycle life. J. Power Sources 252, 8–13 (2014).
Article ADS CAS Google Scholar
Brand, M. J., Hofmann, M. H., Steinhardt, M., Schuster, S. F. & Jossen, A. Current distribution within parallel-connected battery cells. J. Power Sources 334, 202–212 (2016).
Article ADS CAS Google Scholar
Bruen, T. & Marco, J. Modelling and experimental evaluation of parallel connected lithium ion cells for an electric vehicle battery system. J. Power Sources 310, 91–101 (2016).
Article ADS CAS Google Scholar
An, F., Chen, L., Huang, J., Zhang, J. & Li, P. Rate dependence of cell-to-cell variations of lithium-ion cells. Sci. Rep. 6, 35051 (2021).
Article ADS CAS Google Scholar
Schindler, M., Sturm, J., Ludwig, S., Schmitt, J. & Jossen, A. Evolution of initial cell-to-cell variations during a three-year production cycle. eTransportation 8, 100102 (2021).
Article Google Scholar
No, author. Research and development of high-power and high-energy electrochemical storage devices. https://www.osti.gov/biblio/1160224 (2014).
Sulzer, V. et al. The challenge and opportunity of battery lifetime prediction from field data. Joule 5, 1934–1955 (2021).
Article Google Scholar
Goebel, K., Saha, B., Saxena, A., Celaya, J. R. & Christophersen, J. P. Prognostics in battery health management. IEEE Instrum. Meas. Mag. 11, 33–40 (2008).
Article Google Scholar
Asakura, K., Shimomura, M. & Shodai, T. Study of life evaluation methods for li-ion batteries for backup applications. J. Power Sources 119, 902–905 (2003).
Article ADS CAS Google Scholar
Liu, D., Pang, J., Zhou, J., Peng, Y. & Pecht, M. Prognostics for state of health estimation of lithium-ion batteries based on combination gaussian process functional regression. Microelectron. Reliab. 53, 832–839 (2013).
Article CAS Google Scholar
Berecibar, M. et al. Critical review of state of health estimation methods of li-ion batteries for real applications. Renew. Sustain. Energy Rev. 56, 572–587 (2016).
Article CAS Google Scholar
Richardson, R. R., Osborne, M. A. & Howey, D. A. Gaussian process regression for forecasting battery state of health. J. Power Sources 357, 209–219 (2017).
Article ADS CAS Google Scholar
Hu, X., Xu, L., Lin, X. & Pecht, M. Battery lifetime prognostics. Joule 4, 310–346 (2020).
Article CAS Google Scholar
Broussely, M. et al. Main aging mechanisms in Li ion batteries. J. Power Sources 146, 90–96 (2005).
Article ADS CAS Google Scholar
Koleti, U. R., Dinh, T. Q. & Marco, J. A new on-line method for lithium plating detection in lithium-ion batteries. J. Power Sources 451, 227798 (2020).
Article CAS Google Scholar
Koleti, U. R., Bui, T. N. M., Dinh, T. Q. & Marco, J. The development of optimal charging protocols for lithium-ion batteries to reduce lithium plating. J. Energy Storage 39, 102573 (2021).
Article Google Scholar
Lucia, S. et al. Towards adaptive health-aware charging of li-ion batteries: a real-time predictive control approach using first-principles models. In: American Control Conference, (ed. Sun, J.) 4717–4722 (IEEE, 2017).
Davies, G. et al. State of charge and state of health estimation using electrochemical acoustic time of flight analysis. J. Electrochem. Soc. 164, A2746 (2017).
Article CAS Google Scholar
Yang, D., Zhang, X., Pan, R., Wang, Y. & Chen, Z. A novel gaussian process regression model for state-of-health estimation of lithium-ion battery using charging curve. J. Power Sources 384, 387–395 (2018).
Article ADS CAS Google Scholar
Pan, H., Lü, Z., Wang, H., Wei, H. & Chen, L. Novel battery state-of-health online estimation method using multiple health indicators and an extreme learning machine. Energy 160, 466–477 (2018).
Article Google Scholar
Zhang, Y. et al. Identifying degradation patterns of lithium ion batteries from impedance spectroscopy using machine learning. Nat. Commun. 11, 1–6 (2020).
ADS CAS Google Scholar
Severson, K. A. et al. Data-driven prediction of battery cycle life before capacity degradation. Nat. Energy 4, 383–391 (2019).
Article ADS Google Scholar
Fermín-Cueto, P. et al. Identification and machine learning prediction of knee-point and knee-onset in capacity degradation curves of lithium-ion cells. Energy AI 1, 100006 (2020).
Article Google Scholar
Bloom, I. et al. An accelerated calendar and cycle life study of li-ion cells. J. Power Sources 101, 238–247 (2001).
Article ADS CAS Google Scholar
Gaberšček, M. Understanding li-based battery materials via electrochemical impedance spectroscopy. Nat. Commun. 12, 1–4 (2021).
Article CAS Google Scholar
Meddings, N. et al. Application of electrochemical impedance spectroscopy to commercial li-ion cells: a review. J. Power Sources 480, 228742 (2020).
Article CAS Google Scholar
Zou, Y., Hu, X., Ma, H. & Li, S. E. Combined state of charge and state of health estimation over lithium-ion battery cell cycle lifespan for electric vehicles. J. Power Sources 273, 793–803 (2015).
Article ADS CAS Google Scholar
Vyroubal, P. & Kazda, T. Equivalent circuit model parameters extraction for lithium ion batteries using electrochemical impedance spectroscopy. J. Energy Storage 15, 23–31 (2018).
Article Google Scholar
Westerhoff, U., Kurbach, K., Lienesch, F. & Kurrat, M. Analysis of lithium-ion battery models based on electrochemical impedance spectroscopy. Energy Technol. 4, 1620–1630 (2016).
Article CAS Google Scholar
Samadani, E., Mastali, M., Farhad, S., Fraser, R. A. & Fowler, M. Li-ion battery performance and degradation in electric vehicles under different usage scenarios. Int. J. Energy Res. 40, 379–392 (2016).
Article CAS Google Scholar
Strange, C. & dos Reis, G. Prediction of future capacity and internal resistance of li-ion cells from one cycle of input data. Energy AI 5, 100097 (2021).
Article Google Scholar
Liu, K., Shang, Y., Ouyang, Q. & Widanage, W. D. A data-driven approach with uncertainty quantification for predicting future capacities and remaining useful life of lithium-ion battery. IEEE Trans. Ind. Electron. 68, 3170–3180 (2021).
Article ADS Google Scholar
Li, W. et al. One-shot battery degradation trajectory prediction with deep learning. J. Power Sources 2021, 230024 (2021).
Roman, D., Saxena, S., Robu, V., Pecht, M. & Flynn, D. Machine learning pipeline for battery state-of-health estimation. Nat. Mach. Intell. 3, 447–456 (2021).
Article Google Scholar
Zhao, R., Kollmeyer, P. J., Lorenz, R. D. & Jahns, T. M. A compact unified methodology via a recurrent neural network for accurate modeling of lithium-ion battery voltage and state-of-charge. In: 2017 IEEE Energy Conversion Congress and Exposition. (ed. Knight, A.) 5234–5241 (IEEE, 2017).
Li, W. et al. Online capacity estimation of lithium-ion batteries with deep long short-term memory networks. J. Power Sources 482, 228863 (2021).
Article CAS Google Scholar
Paulson, N. H. et al. Feature engineering for machine learning enabled early prediction of battery lifetime. J. Power Sources 527, 231127 (2022).
Article CAS Google Scholar
Burns, J. et al. Predicting and extending the lifetime of li-ion batteries. J. Electrochem. Soc. 160, A1451 (2013).
Article CAS Google Scholar
Geng, Z., Thiringer, T. & Lacey, M. J. Intermittent current interruption method for commercial lithium-ion batteries aging characterization. IEEE Trans. Transp. Electrif. 8, 2985–2995 (2022).
Article Google Scholar
Bommier, C. et al. In operando acoustic detection of lithium metal plating in commercial licoo2/graphite pouch cells. Cell Rep. Phys. Sci. 1, 100035 (2020).
Article Google Scholar
Vetter, J. et al. Ageing mechanisms in lithium-ion batteries. J. Power Sources 147, 269–281 (2005).
Article ADS CAS Google Scholar
Ecker, M. et al. Development of a lifetime prediction model for lithium-ion batteries based on extended accelerated aging test data. J. Power Sources 215, 248–257 (2012).
Article ADS CAS Google Scholar
Zhang, Y., Wang, C.-Y. & Tang, X. Cycling degradation of an automotive LiFePO4 lithium-ion battery. J. Power Sources 196, 1513–1520 (2011).
Article ADS CAS Google Scholar
Nuhic, A., Terzimehic, T., Soczka-Guth, T., Buchholz, M. & Dietmayer, K. Health diagnosis and remaining useful life prognostics of lithium-ion batteries using data-driven methods. J. Power Sources 239, 680–688 (2013).
Article CAS Google Scholar
de Hoog, J. et al. Combined cycling and calendar capacity fade modeling of a nickel-manganese-cobalt oxide cell with real-life profile validation. Appl. Energy 200, 47–61 (2017).
Article CAS Google Scholar
Hu, X., Xu, L., Lin, X. & Pecht, M. Battery lifetime prognostics. Joule 4, 310–346 (2020).
Article CAS Google Scholar
Bole, B., Kulkarni, C. S. & Daigle, M.Adaptation of an electrochemistry-based Li-ion battery model to account for deterioration observed under randomized use. (eds. He, D. & Byington, C.) (Annual Conference of the PHM Society, 2014).
Richardson, R. R., Birkl, C. R., Osborne, M. A. & Howey, D. A. Gaussian process regression for in situ capacity estimation of lithium-ion batteries. IEEE Trans. Ind. Inform. 15, 127–138 (2019).
Article Google Scholar
Perez, H. E., Hu, X., Dey, S. & Moura, S. J. Optimal charging of li-ion batteries with coupled electro-thermal-aging dynamics. IEEE Trans. Vehicular Technol. 66, 7761–7770 (2017).
Article Google Scholar
Neubauer, J. & Pesaran, A. The ability of battery second use strategies to impact plug-in electric vehicle prices and serve utility energy storage applications. J. Power Sources 196, 10351–10358 (2011).
Article ADS CAS Google Scholar
Attia, P. M. et al. Closed-loop optimization of fast-charging protocols for batteries with machine learning. Nature 578, 397–402 (2020).
Article ADS CAS PubMed Google Scholar
Harrington, D. A. & van den Driessche, P. Mechanism and equivalent circuits in electrochemical impedance spectroscopy. Electrochim. Acta 56, 8005–8013 (2011).
Article CAS Google Scholar
Raijmakers, L., Danilov, D., van Lammeren, J., Lammers, M. & Notten, P. Sensorless battery temperature measurements based on electrochemical impedance spectroscopy. J. Power Sources 247, 539–544 (2014).
Article ADS CAS Google Scholar
Rodrigues, S., Munichandraiah, N. & Shukla, A. Ac impedance and state-of-charge analysis of a sealed lithium-ion rechargeable battery. J. Solid State Electrochem. 3, 397–405 (1999).
Article CAS Google Scholar
Rodrigues, S., Munichandraiah, N. & Shukla, A. A review of state-of-charge indication of batteries by means of ac impedance measurements. J. Power Sources 87, 12–20 (2000).
Article ADS CAS Google Scholar
Xu, J., Mi, C. C., Cao, B. & Cao, J. A new method to estimate the state of charge of lithium-ion batteries based on the battery impedance model. J. Power Sources 233, 277–284 (2013).
Article CAS Google Scholar
Keil, P. & Jossen, A. Charging protocols for lithium-ion batteries and their impact on cycle life—an experimental study with different 18650 high-power cells. J. Energy Storage 6, 125–141 (2016).
Article Google Scholar
Chen, T. & Guestrin, C. XGBoost: a scalable tree boosting system. In: Proc. 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. (eds. Krishnapuram, B. & Shah, M.) (Association for Computing Machinery, 2016).
Jones, P., Stimming, U. & Lee, A. Impedance-based forecasting of battery performance amid uneven usage. https://doi.org/10.5281/zenodo.6645536 (2021).
Jones, P. K. & Lee, A. A. Impedance based forecasting of battery performance under uneven future use. https://github.com/PenelopeJones/battery-forecasting (2022).

Download references

Acknowledgements

P.K.J. and A.A.L. acknowledge the support of the Winton Programme for the Physics of Sustainability. P.K.J. acknowledges support from the Ernest Oppenheimer Fund and the Alan Turing Institute (EPSRC EP/W001381/1). A.A.L. acknowledges support from the Royal Society. The authors thank Dr Yunwei Zhang for helpful discussions.

Author information

Authors and Affiliations

Department of Physics, University of Cambridge, Cambridge, UK
Penelope K. Jones & Alpha A. Lee
The Alan Turing Institute, London, UK
Penelope K. Jones
Chemistry, School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne, UK
Ulrich Stimming

Authors

Penelope K. Jones
View author publications
You can also search for this author in PubMed Google Scholar
Ulrich Stimming
View author publications
You can also search for this author in PubMed Google Scholar
Alpha A. Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.K.J. and A.A.L conceived the study. P.K.J. carried out the experiments, analysed the experimental data and developed the machine learning methodology. P.K.J. and A.A.L. wrote the paper. P.K.J., U.S. and A.A.L. discussed the results and commented on the manuscript.

Corresponding author

Correspondence to Alpha A. Lee.

Ethics declarations

Competing interests

P.K.J. and A.A.L. are co-founders and equity owners of Byterat Ltd, a company focused on the development of software for batteries. The authors declare no additional competing interests.

Peer review

Peer review information

Nature Communications thanks Shahab Shokrzadeh, Md Sazzad Hosen and the other anonymous reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jones, P.K., Stimming, U. & Lee, A.A. Impedance-based forecasting of lithium-ion battery performance amid uneven usage. Nat Commun 13, 4806 (2022). https://doi.org/10.1038/s41467-022-32422-w

Download citation

Received: 15 November 2021
Accepted: 28 July 2022
Published: 16 August 2022
DOI: https://doi.org/10.1038/s41467-022-32422-w

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.