Deep learning to estimate lithium-ion battery state of health without additional degradation experiments

Lu, Jiahuan; Xiong, Rui; Tian, Jinpeng; Wang, Chenxu; Sun, Fengchun

doi:10.1038/s41467-023-38458-w

Download PDF

Article
Open access
Published: 13 May 2023

Deep learning to estimate lithium-ion battery state of health without additional degradation experiments

Nature Communications volume 14, Article number: 2760 (2023) Cite this article

18k Accesses
23 Citations
13 Altmetric
Metrics details

Subjects

Abstract

State of health is a critical state which evaluates the degradation level of batteries. However, it cannot be measured directly but requires estimation. While accurate state of health estimation has progressed markedly, the time- and resource-consuming degradation experiments to generate target battery labels hinder the development of state of health estimation methods. In this article, we design a deep-learning framework to enable the estimation of battery state of health in the absence of target battery labels. This framework integrates a swarm of deep neural networks equipped with domain adaptation to produce accurate estimation. We employ 65 commercial batteries from 5 different manufacturers to generate 71,588 samples for cross-validation. The validation results indicate that the proposed framework can ensure absolute errors of less than 3% for 89.4% of samples (less than 5% for 98.9% of samples), with a maximum absolute error of less than 8.87% in the absence of target labels. This work emphasizes the power of deep learning in precluding degradation experiments and highlights the promise of rapid development of battery management algorithms for new-generation batteries using only previous experimental data.

Transfer learning based generalized framework for state of health estimation of Li-ion cells

Article Open access 01 August 2022

Subhasmita Sahoo, Krishnan S. Hariharan, … Sangheon Lee

Deep learning approach towards accurate state of charge estimation for lithium-ion batteries using self-supervised transformer model

Article Open access 01 October 2021

M. A. Hannan, D. N. T. How, … F. Blaabjerg

Predicting the state of charge and health of batteries using data-driven machine learning

Article 02 March 2020

Man-Fai Ng, Jin Zhao, … Zhi Wei Seh

Introduction

Lithium-ion batteries (LIBs) offer high energy density, fast response, and environmental friendliness¹, and have unprecedentedly spurred the penetration of renewable energy^2,3,4. The global market of LIBs displays staggering figures in 2020, up to 142.8 GWh on the side of electric vehicles, and it is expected to exceed 91.8 billion dollars⁵ in the next few years. While LIBs are being popularized at a phenomenal rate, their prolonged applications are facing tough challenges. As is the case with machines, LIB components such as electrodes and separators experience varying levels of degradation. These negative spillovers lead to capacity and power fade^6,7 and thereby imperil the assets⁸. To ensure safe and efficient battery management, obtaining an accurate battery state of health (SOH) is of vital importance.

Battery SOH has been defined in various forms. It can be defined by the service time⁹ or by the increase in the internal resistance¹⁰. Although these variables are easily measurable, battery degradation is also accompanied by capacity loss, whose accurate determination impacts other battery management tasks such as driving range estimation and life prediction. Thus, SOH, defined as the ratio between the present capacity and the initial capacity, is drawing broad attention^11,12. However, the capacity measurement requires completely charging or discharging the batteries with specific protocols¹³, which is not practical for batteries in use. This motivates the SOH estimation from daily operating data.

Existing SOH estimation studies are generally devoted to extracting features correlated with SOH degradation and mapping them to the SOH. These methods require lifelong battery degradation data with measured SOH labels of the target LIBs (so-called target-labeled data). On this basis, many features have been crafted based on our understanding of battery degradation, such as the electrical features^14,15, electrochemical features¹⁶, acoustic features¹⁷, mechanical features^18,19, and thermal features²⁰. Such arduous data collection and feature engineering steps impede the development of SOH estimation methods. Recently, deep neural networks (DNNs)^21,22 have demonstrated the automatic extraction of black-box features from raw operating data, showing impressive SOH estimation performance. However, experimental collection of the target-labeled data is time-consuming and resource-intensive²³, and creating massive datasets for different types of LIBs is rarely sustainable.

A paradigm of generalization for DNNs is transfer learning, which is one promising solution to lighten the burden of data collection. Transfer learning transfers knowledge learned by a DNN in a training dataset (the source domain) to a different dataset (the target domain) using a small number of target-labeled data²⁴. For example, a DNN for charging curve prediction trained using one type of battery can adapt to other types of batteries by fine-tuning using a small number of new samples²⁵. A growing body of literature^26,27 applies retraining or fine-tuning techniques for SOH estimation of various types of batteries. Generally, these works require at least 25–30% of labeled lifelong data in the target domain. Therefore, target domain degradation experiments are still needed. These approaches relying on conventional experiments cannot keep pace with the battery upgrading and have established a barrier to the development of battery technologies.

Approaches without the need for additional target-labeled data to estimate SOH are attractive^28,29,30. This can help the rapid development of battery management systems (BMSs) for new-generation batteries using only existing experimental data, saving considerable time and resources. It is also expected to motivate the utilization of large-scale field data without labels. Indeed, issues for target label-agnostic cases have long been noticed. Cross-domain learning in the absence of target labels has been proven to be equivalent to a dual training task, i.e., learning to predict the source labels while closing the gap between the source and target domains^31,32. Research in the field of visual recognition³³ has shown that even though there are no target-labeled data for training, a DNN jointly trained for classification and domain invariance can conduct classification precisely across visually distinct domains. This confirms that DNNs can accomplish the tasks in target label-agnostic cases. However, LIB SOH estimation, which has a massive demand for target labels, has yet to benefit from it.

In this work, we propose a deep learning-based framework to estimate battery SOH without relying on target labels for training. The proposed framework integrates the estimates of a swarm of DNNs into a reliable SOH estimate rather than relying on a single DNN. Individual DNNs are trained to learn cross-domain knowledge according to source labels and domain invariance of degradation features. DNNs with good performance in the swarm are selected for reliable estimation. We further reveal the influence of the sample distribution in the source domain on SOH estimation and propose to improve estimation performance by trimming the sample distribution in the source domain. We adopt two self-developed and three public battery degradation datasets for cross-validation. The validation covers 80 cases, encompassing 71,588 samples collected from 65 cells. We demonstrate that the proposed framework can achieve an absolute error of less than 3% for 89.4% of samples (less than 5% for 98.9% of samples), with a maximum absolute error of less than 8.87%. To provide references for the selection of hyper-parameters, we also investigate the influence of the crucial hyper-parameters on the estimation performance. These results highlight the potential of deep learning in supplanting the time-consuming battery degradation experiments, and further rapid development of BMS for new-generation batteries using existing experimental data.

Results

Framework overview

We develop a SOH estimation framework composed of a swarm of DNNs (Fig. 1). This framework is designed for reliable estimation by selectively integrating the estimates from multiple DNNs. The proposed framework is introduced in terms of the training procedure (Fig. 1a), estimation procedure (Fig. 1c), and its component units (Fig. 1b). Their definitions and processes are introduced in the Methods section in detail.

**Fig. 1: Overview of the proposed SOH estimation framework.**

The training procedure of the proposed framework integrates independent sub-trainings of N DNNs (see Fig. 1a). Without loss of generality, the battery charging data are employed as the input of the DNNs since the battery charging process is generally controllable and occurs regularly. Specifically, charging capacity sequences within a voltage sampling window (so-called partial charging curves) are taken as the input of each DNN, as demonstrated by previous studies^{25,34,35,36,37}. Before sub-trainings, partial charging curves from both source and target domains are normalized by their nominal capacity. When the training starts, all the sub-trainings are enabled and share an identical training set that is composed of labeled source domain samples and unlabeled target domain samples. We also designed a trimming round to form a new source domain with a balanced SOH distribution by discarding some samples. The training procedure of the proposed framework is terminated after all the sub-trainings are finished.

Each DNN in the proposed framework has identical hyper-parameters (Fig. 1b) but is initialized with different random seeds based on the He initializer³⁸. As the input of DNN, partial charging curves from both source and target domains are first gridded with a voltage interval of 10 mV to reduce the data burden. Next, these samples are fed into stacked one-dimensional (1D) CNN layers to extract their feature vectors. After that, feature vectors of the source domain are flattened and fed into a terminal fully connected (TFC) layer to generate their SOH estimates. These estimates are used together with the source domain labels to calculate the source domain loss. On the other hand, feature vectors of target-domain samples are flattened to a middle fully connected (MFC) layer for reconstructing their feature vectors. These reconstructed feature vectors play two roles. The first is to quantify the domain gap together with the source domain feature vectors. The second is to provide estimates of target domain samples (treated as the pre-estimates of each trained DNN) in the estimation procedure, where the reconstructed feature vectors are further fed into the same TFC as the source domain for regression. By simultaneously minimizing the SOH estimation loss of source domain samples and the gap between the TFC inputs of the two domains, each sub-training transfers the source domain knowledge to the target domain.

The estimation procedure of the proposed framework, unlike the training procedure, is to select a swarm of the trained DNNs to participate in the estimation (Fig. 1c). First, all the DNNs are activated to estimate the SOHs in the target domain, as mentioned above. The trained DNNs are expected to differ widely in estimation performance owing to the training uncertainty and can thus be treated as pre-estimators. To produce a reliable final estimate, we eliminate some unfavorable DNNs by setting quartile thresholds for the mean and standard deviation of the estimation results. The estimations from the selected DNNs are averaged to produce eventual SOH estimates for the target domain samples. Detailed discussions can be found in the Rationalization of predictive performance section.

Data generation

SOH estimation in target label-agnostic cases spans different applications, manufacturers, and chemistries. To reflect such situations, we employ 10,757 samples collected from 65 commercial LIB cells produced by five different manufacturers for validation. The eventual datasets cover five kinds of widely-used cathode active materials, including lithium cobalt oxide (LiCoO₂, LCO)³⁹, a blend of LCO and lithium nickel cobalt oxide (LiCoNiO₂, LCO/NCO)⁷, nickel manganese cobalt (Li(NiMnCo)O₂, NMC)⁴⁰, nickel cobalt aluminum (LiNiCoAlO₂, NCA), and lithium iron phosphate (LiFePO₄, LFP). Note that the exact composition of the cathode active materials cannot be further provided here as this information is not available in the existing literature. The specifications of these five types of LIBs and their experimental costs are compared in Table 1 (see Supplementary Note 1 for the estimation of the experimental cost). The degradation data of PANASONIC and GOTION LIBs are experimentally generated in our lab (see Data availability statement), and others are from three public datasets^39,41,42,43. Details regarding the datasets can be found in Table S1. Note that C-rate is a measure of the battery’s charge or discharge current relative to its nominal capacity, and is used here to describe the experimental current.

Table 1 Main specifications of the selected LIBs in this work

Full size table

In Fig. 2, we plot the charging curves of the selected LIBs to disclose their different degradation behaviors. Significant gaps exist between the charging curves of any two types of LIBs, even though Datasets #2 and #4 consist of batteries with similar electrode active materials. This is because the degradation behavior of LIBs is subject to various factors, such as manufacturing factors and application scenarios. Affected by dissimilarity among experiments, degradation rate, and data processing, the LIBs we employed cover different SOH distributions. This simulates the real-world discrepancy in sample distribution between the source and target domains. Therefore, given a new type of LIBs, traditionally, we carry out time- and resource-consuming experiments to develop tailor-made models for SOH estimation. To address this issue, we propose a domain adaptation-enabled framework for cross-dataset battery SOH estimation without knowing any SOH labels of target batteries.

**Fig. 2: Lifelong charging curve family and histogram of the SOH of the selected five types of LIBs produced by five manufacturers.**

Cross-dataset battery SOH estimation in the absence of target labels

In this experiment, we examine the performance of the proposed framework on cross-dataset SOH estimation in the absence of target labels. Detailed hyper-parameter settings can be found in Fig. S1. To this end, the employed five types of LIBs are pairwise combined, resulting in a total of 20 combinations. A general case is that there are lifelong labeled source domain data (SOH ranges from 100 to 75%) and unlabeled target domain data (SOH ranges from 100% to an unknown level) for training. That is to say, except that initial cycles have a SOH of 100%, we know nothing about the SOH distribution of samples from the target domain. Thus, our estimation faces the challenge of domain imbalance⁴⁴. To imitate general cases, we postulate that the SOH of batteries in the target domain are distributed from 100 to 95, 90, 85, and 80% (denoted as ~95, ~90, ~85, and ~80%), respectively. As a result, each combination contains four cases with four different lower SOH bounds (see Table S2 for the detailed validation scheme). A 500-mV voltage window is utilized to extract the partial charging data, and it covers 3.5 to 4 V for Datasets #1–4 and 3.1 to 3.6 V for Dataset #5. The impact of different voltage windows on SOH is investigated in Supplementary Note 2.

We first use all source domain samples to train the framework without trimming and the estimation of SOH error distribution is shown in Fig. 3a. Detailed results can be found in Figs. S3, 4. Overall, the mean absolute error (MAE) of the proposed framework in all cases is within 5.01%, which proves that the estimation in the absence of target labels is effective. Interestingly, changing source domains induces variation in the estimation accuracy for a specific target domain. For example, using Dataset #5 as the source domain, we can achieve accurate estimation for Datasets #1–4. However, using Dataset #2 as the source domain leads to different estimation performance: the overall errors in the ~85 and ~80% cases are generally higher than those in the ~95 and ~90% cases. This result draws our attention to the source domain. Tracing back to the SOH distribution in the source domains, we find that the SOH distribution in Dataset #2 is significantly skewed towards high SOH, and the skewness is −0.4. In contrast, the distribution of other datasets is relatively symmetric, and the skewnesses of Datasets #1 and #3–5 are −0.19 and −0.12, 0.12, and −0.05, respectively. This explains why using Dataset #2 as the source domain brings significant advantages in the ~95 and ~90% cases. For further validation, we trim the source domain SOH distribution to be symmetric by making the skewness tend towards zero before training. The SOH distributions in source domains after the trim are shown at the top of Fig. 3a. Dataset #5 undergoes only slight trimming as its original distribution is almost symmetric (the skewness before the trim is −0.05). In contrast, Dataset #2 is significantly trimmed, where many samples with high SOH are discarded. Next, we employ the trimmed source domains to train the framework and evaluate the performance of SOH estimation. On the whole, the extent of change in accuracy is positively correlated with that in the trim. Using Dataset #5 as the source domain leads to good performance with little change as before. Using Dataset #2, after undergoing the most notable change, brings similar trends as the other datasets: the framework performs significantly better in the ~95 and ~90% cases than in the ~85 and ~80% cases. These results highlight the impact of source domain SOH distribution on SOH estimation in the absence of target labels and the effectiveness of the trim. We then gather all the verification cases to statistically evaluate the improvement of SOH estimation. Figure 3b shows the comparative results before and after the trim to describe the error distribution as a function of true SOH. Trimming the source domains reduces the maximum absolute error from 10.09 to 8.87%. More importantly, the percentage of high absolute errors (>5%) is dramatically reduced. Also, most estimates are at a low absolute error level (≤3%). To quantify this, we plot the cumulative distribution of the absolute error using bins with a width of 1% absolute error in Fig. 3c. 89.4% of the samples have an absolute error of less than 3% and up to 98.9% of the samples have an error of less than 5% after the trim, which is significantly superior to the case without trim. In conclusion, the proposed framework can achieve accurate cross-dataset SOH estimation in the absence of target labels and can be improved after trimming the sample distribution in the source domain.

**Fig. 3: Performance of cross-dataset battery SOH estimation in the absence of target labels.**

Comparison with existing methods

To verify the advancement of the proposed framework, we gather all validation cases and compare our accuracy with that of four popular methods, including Gaussian process regression³⁵ (GPR), random forest⁴⁵ (RF), support vector regression⁴⁶ (SVR), and CNN⁴⁷. Their hyper-parameter settings can be found in Table S3. The comparative results of the absolute error distribution are described in Fig. 4, and detailed results can be found in Figs. S5–8.

**Fig. 4: Comparison of absolute error distribution of the SOH estimation.**

We first show the performance of the four existing methods when the target domain labels are available. Having enough target labels for learning the target domain, the existing methods show high accuracy with MAEs of less than 1%. However, the target labels in practice come at the cost of numerous workforce and energy. Developing a battery degradation dataset requires 644–8473 hours of degradation experiments (according to the estimation in Supplementary Note 1). In the absence of the target labels, existing methods fail to provide reliable estimation with their MAEs over 5.01%, and the maximum absolute error reaches over 17.91%. By contrast, the proposed framework achieves accurate SOH estimation without target labels, reducing the MAE and maximum absolute error by more than half. The MAE and maximum absolute error are within 1.43 and 8.87%, respectively. More importantly, given a swarm size of 300, our method leverages ~0.7 hours for training, avoiding degradation experiments of thousands of hours (see Supplementary Note 3 for the computational cost comparison). This excellent performance can be attributed to the swarm-driven and domain adaptation strategies. To demonstrate this, ablation experiments are performed to verify the role of these strategies. Benchmark 1 and Benchmark 2 are created by disabling the swarm-driven and domain adaptation strategies of the proposed framework, respectively. Benchmark 3 is designed by disabling both strategies. The detailed results can be found in Figs. S9–11. Besides, Benchmark 1 is designed with a comparable number of hyper-parameters to the proposed framework. As expected, without the help of either of the two strategies, the estimation performance approximately reduces to the level of existing methods.

Rationalization of predictive performance

The excellent performance of our framework can be attributed to domain adaptation and swarm-driven strategies. The swarm-driven strategy is first analyzed. We take a pair of instances in the ~85% case (i.e., the cases of transfer from Dataset #1 to #5 and from Dataset #5 to #1) to investigate its influence. The distributions of pre-estimation root mean square errors (RMSEs) of DNN swarms in these cases are reported in Fig. 5a, b. Overall, the pre-estimation RMSEs of DNN swarms before selection, which are affected by uncertain training, have a wide distribution. One can note that some DNNs have RMSEs of up to 10%. Thus, relying only on a single DNN may yield unreliable SOH estimates like this. We also observe that most DNNs in the swarm are positively skewed within an RMSE of less than 8%. Many of them have RMSEs of less than 3%, indicating that a considerable part of the swarm is trustworthy. This motivates the proposed framework to selectively integrate estimations of a swarm of DNNs for reliable SOH estimation. We find that, for a given batch of target domain samples, the estimation results of the DNNs in the swarm are diverse, but their distributions show regularity with their accuracy (see Supplementary Note 4 for the analysis). This provides an opportunity for the proposed framework to select a group of well-performing DNNs. To develop a selection criterion, we choose the mean and standard deviation to assess the estimations of each DNN in Fig. 5c, d. It is seen that the RMSEs of the pre-estimates for DNNs are significantly correlated with their means and standard deviations. The Pearson correlation coefficients between the means and the RMSEs are −0.9937 in Fig. 5c and −0.9933 in Fig. 5d. Those between the standard deviations and the RMSEs are mostly greater than 0.5 (0.6258 in Fig. 5c and 0.6074 in Fig. 5d). The Pearson correlation coefficients over all cases can be found in Fig. S13. These results reveal that the RMSE of the pre-estimates of each DNN is negatively correlated with the mean and is positively correlated with the standard deviation. In other words, DNNs whose pre-estimates have a higher mean and lower standard deviation are more likely to have higher accuracy. This explains why excellent DNNs can be selected even in the absence of target labels by only using lower and upper quartiles for the means and standard deviations (theorized in the Methods section). We also report the distributions of pre-estimation RMSEs in Fig. 5a, b corresponding to the selected DNNs (circled in black line) in Fig. 5c, d. It is observed that after the selection, the RMSE bandwidth of the swarm estimation is reduced from over 10% to around 4%, and the mean RMSE decreases to about 2%. These results demonstrate the effectiveness and importance of the DNN swarm-driven strategy in the proposed framework.

**Fig. 5: SOH estimation performance of DNNs in the proposed framework.**

After evaluating the swarm-driven strategy, we investigate the other crucial strategy of the proposed framework, i.e., domain adaptation. The feature vectors, before being fed to the terminal fully connected layer, are the subject of domain adaptation. Nevertheless, we do not focus on them but define an explanation map to visualize their contributions to the SOH estimate. The explanation map is defined as the weighted sum of the unflattened weight vector W* of the TFC layer and these feature vectors along the channels (theorized in the DNN explanation section). We first dissect an individual DNN without domain adaptation at 10 equal-interval cycles to visualize the evolution of the explanation map as a function of degradation in Fig. 6a. This DNN is first trained for Dataset #1 and then applied to Dataset #5 with no available target labels. It is seen that the explanation maps for Datasets #1 and #5 are dramatically different. When applied to dataset #5 without domain adaptation, the DNN shows abnormally high feature values near the sampling points corresponding to the voltage plateau. As a result, it makes a severe overestimate for Dataset #5. Next, we dissect an individual DNN (with the median RMSE corresponding to Fig. 5a) of the proposed framework to show the explanation maps (see Fig. 6b). Note that a domain-adapted feature vector needs to be unflattened before being used to compute its explanation map. In contrast to Fig. 6a, the gaps in explanation maps between the two datasets are significantly mitigated by domain adaptation. Thanks to this, the DNN can make accurate estimations even in the absence of the dataset #5 labels. This is the answer regarding the necessity of domain adaptation in the proposed framework.

**Fig. 6: Dissection diagram of the proposed framework (case of transfer from Dataset #1 to #5).**

Estimation performance with various hyper-parameters

We further investigate the impact of some crucial hyper-parameters on the estimation performance of the proposed framework, including the size of the DNN swarm, activation functions, number of channels, and number of layers of the CNN. The size of the DNN swarm is set to 1, 50, 100, …, and 300, respectively, and the results are shown in Fig. 7a. We observe that increasing the swarm size can reduce the overall estimation error. A size of 50 is sufficient for accurate estimation by suppressing the MAE below 2%. Thus, one can balance the accuracy and computational cost by tuning the swarm size in practice. We then examine the influence of activation functions in Fig. 7b by comparing the estimation performance using ReLU, Tanh, Sigmoid, and LogSigmoid, respectively. Note that the Sigmoid before the DNN output is not considered in this comparison as it is used to scale the estimates into [0, 1]. The results show that the ReLU activation function shows the highest accuracy and is therefore preferred when applying the framework. Next, we study the impact of the number of CNN layers and channels on the estimation performance. We span the number of CNN layers from 1 to 4, and the number of channels for all layers is assumed to be identical and belongs to [32, 64, 128, 256]. The results are reported in Fig. 7c. It can be observed that increasing the number of channels does not always reduce the MAE except for the one-layer CNN. The number of channels less than 128 is sufficient to provide an accurate estimation. On the other hand, multiple CNN layers are conducive to high accuracy. One might need to find a suitable number of CNN layers to balance the estimation accuracy and computational cost.

Limitations and outlook

The present study can be improved in the future. First, as a data-driven approach, the proposed framework does not assume specific properties and dimensions of the input data. Hence, the proposed framework can be applied to a wider variety of battery materials, other SOH metrics, and input signals. Second, our preliminary trimming strategy can be developed with more advanced techniques to optimize estimation performance. Finally, the proposed framework does not assume specific application scenarios. It thus can be explored to apply to the big data containing a large amount of battery real-world operation history. The proposed framework is promising to help maximize the potential of big data, which generally lacks labels.

Discussion

Existing techniques for battery SOH estimation are highly dependent on the labeled degradation data of the target battery, resulting in an enormous expenditure of time and resources for data collection. In this work, we devise a target label-agnostic solution to battery SOH estimation based on deep learning. This framework selectively integrates the estimations of a swarm of DNNs into a reliable SOH estimate rather than relying on a single DNN. Each DNN is trained for source labels and domain invariance of degradation features simultaneously. A trim strategy is proposed to regulate the skewness of the source domain sample distribution to improve the accuracy.

As a case study, we take the partial charging curve as the input of the proposed framework. For validation, we combine two experimentally generated datasets and three public datasets for cross-validation, resulting in 80 cases covering 71,588 samples. We first demonstrate that the proposed framework can achieve absolute errors of less than 3% for 89.4% of samples (less than 5% for 98.9% of samples), with a maximum absolute error of less than 8.87% in the absence of target battery labels. Compared with the existing methods, the proposed framework reduces the MAE and maximum absolute error by more than half. These results illustrate the successful application for various domains. Furthermore, we dissect the DNN and visually explicate that the proposed architecture of DNNs can effectively minimize the domain gap. The analysis of the swarm of DNNs unveils a correlation between the mean, standard deviation, and errors of DNNs’ estimates and clarifies how our framework can select the DNNs for SOH estimation in target label-agnostic cases. Finally, we investigate the impact of the crucial hyper-parameters on the estimation performance, and provide references for hyper-parameter selection to apply the proposed framework better.

In summary, our work highlights the potential of deep learning in supplanting time- and resource-consuming battery degradation tests. We envisage that the proposed framework will motivate the utilization of large-scale historically collected but unlabeled LIB data (e.g., onboard data, cloud data). It can also enable the rapid development of BMS for new-generation batteries using only existing experimental data.

Methods

Data processing

Partial charging curves of LIBs are employed as the input of DNNs for SOH estimation. In a constant-current charging process, the voltage V(t) and current I(t) are stored by BMS at a time step t, and the partial charging curve q^ψ can then be captured by setting a voltage sampling window:

$$\left\{\begin{array}{c}{{{{{{\bf{q}}}}}}}^{\psi }=\left[\begin{array}{cccc}\frac{{Q}_{0}^{\psi }(V)}{Q} & \frac{{Q}_{1}^{\psi }(V)}{Q} & {...} & \frac{{Q}_{K}^{\psi }(V)}{Q}\end{array}\right],\,\, \psi \in \{S,T\}\hfill\\ {Q}_{i}^{\psi }(V)={\int }_{V(t)={V}_{\min }}^{V(t)={V}_{\min }+i\varDelta V}|I(t)|{{{{{\rm{d}}}}}}t,\, \, i\in \{0,\, 1,\, {...},\, K\}\end{array}\right.$$

(1)

where the superscript ψ indicates whether q^ψ belongs to the source domain S or the target domain T. Q denotes the initial capacity, which is used to normalize the partial charging curve of different types of LIBs. V_min is the lower voltage limit. The voltage sampling window is gridded by a given voltage step ΔV and ranges from V_min to V_min + KΔV.

To improve estimation performance, we generate a more balanced source domain by trimming the distribution of samples in the original source domain. Specifically, the original source domain samples are first grouped into n_bin bins with a uniform width (set to 2% in this work) according to their labels. Using the number of samples n_i,origin in the i-th bin as an upper bound, the number of samples in this bin n_i,trim that need a trim is then optimized by:

$$ \mathop{\min }\limits_{0\le {n}_{i,{{{{{\mathrm{trim}}}}}}}\le {n}_{i,{{{{{\mathrm{origin}}}}}}},{n}_{i,{{{{{\rm{trim}}}}}}} \in \, {{{{{\mbox{c}}}}}} /}\,\mathop{\sum }\limits_{k=1}^{2}{\alpha }_{k}{g}_{k}\\ \left\{\begin{array}{c}{n}_{i,{{{{{\mathrm{remain}}}}}}}={n}_{i,{{{{{\mathrm{origin}}}}}}}-{n}_{i,{{{{{\mathrm{trim}}}}}}}\hfill\\ {g}_{1}=\left | {\mu }_{3}\left(\mathop{\cup}\limits_{i=1}^{{n}_{{{{{{\mathrm{bin}}}}}}}}{\left\{{y}_{ij}^{S}\right\}}_{j=1,{..}.,{n}_{i,{{{{{\rm{remain}}}}}}}}\right)-{\mu }_{3}^{\ast } \right |,\, {g}_{2}=\frac{{\sum }_{i=1}^{{n}_{{{{{{\rm{bin}}}}}}}}{n}_{i,{{{{{\rm{trim}}}}}}}}{{\sum }_{i=1}^{{n}_{{{{{{\rm{bin}}}}}}}}{n}_{i,{{{{{\rm{origin}}}}}}}}\end{array} \right.$$

(2)

where n_remain represents the number of the remaining samples after trim. ${\{{y}_{ij}^{S}\}}_{j=1,\ldots,{n}_{i,{{{{{\rm{remain}}}}}}}}$ represents n_remain source domain samples randomly selected from the i-th bin. g_k denotes the component of the objective function, k∈{1,2}, and α_k is the weight corresponding to the g_k. In this work, α₁ and α₂ are set to 0.2 and 2, respectively. μ₃(∙) is the skewness operator defined as the third standardized moment:

$${\mu }_{3}(y)=\frac{{{{{{\rm{E}}}}}}{(y-\mu )}^{3}}{{\sigma }^{3}}$$

(3)

where E(∙) is the expectation operator, μ and σ are the mean and standard deviation of y, respectively. In this optimization, minimizing g₁ makes the skewness of the trimmed source domain approach its target value μ₃^* (set to zero in this work) to avoid asymmetry, while minimizing g₂ ensures that as few samples as possible are discarded. Two constraints on the optimization are defined to generate a new source domain with a comparable range and unimodal distribution:

$$\left\{\begin{array}{c}\frac{\mathop{\max }\limits_{i\in [1,{n}_{{{{{{\rm{bin}}}}}}}]}({n}_{i,{{{{{\rm{remain}}}}}}})\, -\mathop{\min }\limits_{i\in [1,{n}_{{{{{{\rm{bin}}}}}}}]}({n}_{i,{{{{{\rm{remain}}}}}}})}{{\sum }_{i=1}^{{n}_{{{{{{\rm{bin}}}}}}}}{n}_{i,{{{{{\rm{origin}}}}}}}}\le \varepsilon \hfill\\ \mathop{\sum }\limits_{i=2}^{{n}_{{{{{{\rm{bin}}}}}}}-1}{{{{\mathrm{sgn}}}}}({n}_{i+1,{{{{{\rm{remain}}}}}}}-{n}_{i,{{{{{\rm{remain}}}}}}})-{{{{\mathrm{sgn}}}}}({n}_{i,{{{{{\rm{remain}}}}}}}-{n}_{i-1,{{{{{\rm{remain}}}}}}})=1\end{array}\right.$$

(4)

where ε is the maximum difference in the number of samples among the bins, which is set to 4.5% in this work.

DNN architecture

The proposed framework is composed of a swarm of DNNs. For the DNN #x, x ∈ {1, 2, …, N}, the gridded input is first processed by serially stacked 1D CNN layers for feature vector extraction, which can be formulated as:

$$\left\{\begin{array}{c}{\,}^{l+1}{{{{{\bf{X}}}}}}_{m}^{\psi }(i)={\,}^{l+1}{{{{{\bf{b}}}}}}+\mathop{\sum }\limits_{c=1}^{{\,}^{l}C}\mathop{\sum }\limits_{j=1}^{{\,}^{l+1}k}{\,}^{l+1}{{{{{\bf{w}}}}}}_{c}\otimes {\,}^{l}{{{{{\bf{X}}}}}}_{c}^{\psi }\left({\,}^{l+1}s_{{{{{{\rm{tr}}}}}}}i+j\right)\\ m\in \{1,\, 2,\, {...},{\,}^{l+1}C\},\, i\in \{0,\, 1,\, {...},{\,}^{l+1}L\}\hfill\\ {\,}^{l+1}L=\frac{{\,}^{l}L-{\,}^{l+1}k}{{\,}^{l+1}s_{{{{{{\rm{tr}}}}}}}}+1,\, l\in \{1,\, 2,\, {...},{\Upsilon }-1\}\hfill\end{array}\right.$$

(5)

where ^l+1X^ψ and ^lX^ψ denote the output and input of the (l + 1)-th 1D CNN layer, respectively. ⊗ represents the valid cross-correlation operator. ^l+1w, ^l+1b, ^l+1k, and ^l+1s_tr are the weight, bias, kernel size, and stride of the (l + 1)-th 1D CNN layer, respectively. ^lC and ^l+1C are the numbers of input and output channels of the (l + 1)-th 1D CNN layer, respectively. ^l+1L and ^lL are the lengths of the ^l+1X^ψ and ^lX^ψ, respectively. (ϒ−1) denotes the total number of the 1D CNN layers.

A MFC layer is exclusively designed for the target domain after the shared CNN layers to reconstruct the extracted feature vectors. The output of the terminal 1D CNN layer is flattened by the channel and then input to the MFC layer, which can be described as:

$${}^{{{{{{\rm{MFC}}}}}}}{{{{{\bf{X}}}}}}^{T}={{{{{{\bf{W}}}}}}}_{{{{{{\rm{MFC}}}}}}}{\,}^{{\Upsilon }}{{{{{{\bf{X}}}}}}}^{T}+{{{{{{\bf{b}}}}}}}_{{{{{{\rm{MFC}}}}}}}$$

(6)

where ^ϒX^T and ^MFCX^T are the target domain input and output of the MFC layer, respectively. W_MFC and b_MFC are the weight and bias vectors of the MFC layer, respectively.

A shared TFC layer is designed at the terminal of the DNN for both source and target domains to regress SOH. The source domain output of the terminal 1D CNN layer is flattened and then fed to the TFC layer, while the target domain output of the MFC layer is directly provided to the TFC layer. This layer can be expressed as:

$${\vartheta }^{{\Psi }}=\left\{\begin{array}{c}{{{{{{\bf{W}}}}}}}_{{{{{{\rm{TFC}}}}}}}{\, }^{{\Upsilon }}{{{{{{\bf{X}}}}}}}^{S}+{b}_{{{{{{\rm{TFC}}}}}}},\, {{{{{\rm{if}}}}}}\,{\Psi }=S\hfill\\ {{{{{{\bf{W}}}}}}}_{{{{{{\rm{TFC}}}}}}}{\, }^{{{{{{\rm{MFC}}}}}}}{{{{{{\bf{X}}}}}}}^{T}+{b}_{{{{{{\rm{TFC}}}}}}},\, {{{{{\rm{otherwise}}}}}}\end{array}\right.$$

(7)

where ϑ^ψ denotes the SOH pre-estimate of each DNN. ^ϒX^S is the source domain output of the ϒ-th 1D CNN layer. W_TFC and b_TFC are the weight vector and the bias of the TFC layer, respectively.

The rectified linear unit (ReLU) activation function, which takes the maximum value between 0 and its input as output, is designed to follow each 1D CNN layer and fully connected layer. The sigmoid activation function is applied before outputting the estimate to ensure that the SOH pre-estimate is between 0 and 1.

DNN explanation

We define a vector F^ψ to visualize the domain adaptation of the proposed framework, which can be formulated as:

$${{{{{{\bf{F}}}}}}}^{\psi }=\mathop{\sum }\limits_{1}^{{}^{\zeta }C}{{{{{{\bf{W}}}}}}}^{\ast }\odot \, {}^{\zeta }{{{{{\bf{X}}}}}}^{\psi },\;\psi \in \{S,T\},\;\zeta \in \{{\Upsilon },{{{{{\rm{TFC}}}}}}\}$$

(8)

where ☉ denotes the element-wise multiplication of two vectors. W* is the unflattened weight vector of the TFC layer. ^ζX^ψ is the unflattened feature vector or the feature vector before flattening. The sigmoid activation function is also applied before the output of this equation. Thus, F^ψ is equivalent to a TFC layer output minus b_TFC and cancels the sampling point-wise weighted sum. Combining F^ψ at various SOH can produce an explanation map for visualizing the contribution of the feature vectors to the lifelong SOH estimation.

DNN training

First, N DNNs are independently trained, and in the present study, we set N = 300. Each DNN is parameterized by the labeled data from the source domain and the unlabeled data from the target domain. The widely-used Adam algorithm⁴⁸ is employed to optimize the parameters iteratively. The learning rate is set to 0.001. To realize cross-domain transfer learning, we define a loss function E containing three components, which can be formulated as:

$$\left\{\begin{array}{c}J=\mathop{\sum }\limits_{i=1}^{3}{\kappa }_{i}\, {f}_{i}\hfill\\ {f}_{1}=\frac{1}{{n}_{{{{{{\rm{s}}}}}}}}\mathop{\sum }\limits_{i=1}^{{n}_{{{{{{\rm{s}}}}}}}}{({\vartheta }_{i}^{S}-{\,}^{\ast }\vartheta _{i}^{S})}^{2}\hfill\\ {f}_{2}={\left|\frac{1}{{n}_{{{{{{\rm{s}}}}}}}}\mathop{\sum }\limits_{i=1}^{{n}_{{{{{{\rm{s}}}}}}}}\phi ({\,}^{{\Upsilon }}X_{i}^{S})-\frac{1}{{n}_{{{{{{\rm{t}}}}}}}}\mathop{\sum }\limits_{i=1}^{{n}_{{{{{{\rm{t}}}}}}}}\phi ({\,}^{{{{{{\rm{MFC}}}}}}}X_{i}^{T})\right|}_{H}\\ {f}_{3}=\frac{1}{Z}\mathop{\sum }\limits_{i=1}^{Z}{({\vartheta }_{0,i}^{T}-1)}^{2}\hfill\end{array}\right.$$

(9)

where f_i denotes the component of the loss function J, i ∈ {1, 2, 3}, and κ_i is the weight corresponding to the f_i. This work sets κ₁, κ₂, and κ₃ to 1, 0.1, and 1, respectively. ^*ϑ^S denotes the available label of the source domain sample. n_s and n_t are the number of samples from the source and target domains, respectively. ||·||_H represents the norm of the reproducing Hilbert space in terms of the embedding kernel ϕ(·), and the Gaussian kernel is employed as the ϕ(·). ϑ^T₀ denotes the pre-estimate of the target domain at the first cycle, and Z is the number of these pre-estimates. f₁ evaluates the mean squared error between each element of the source domain in the pre-estimate and the available labels, which is designed for training each DNN to learn SOH estimation from labeled source domain samples. f₂ evaluates the domain invariance, and the maximum mean discrepancy (MMD) is employed as a criterion to measure the distance between the high-dimensional degradation features of the source domain samples and those of the target domain samples after reconstruction. f₃ is also the measure of the mean squared error but only for the target domain samples at the first cycle. This is because the partial charging curves of the first cycle (i.e., in fresh status) are easily obtained (e.g., by LIB formation or factory test), and their labels can be treated as 1 to improve the learning of the target domain samples.

In each sub-training, samples from the source domain are divided into a training set and a validation set. Two-thirds of the source domain samples are used as the training set, and the rest are the validation set. Each sub-training is terminated when the RMSE of the validation and training sets is less than both 5%, or when the number of epochs reaches 2000. The minimum number of epochs is set to 500. All the samples are divided into mini-batches for training with a mini-batch size of 20. All DNNs are trained based on an NVIDIA Tesla V100 GPU in this work.

SOH estimation with trained DNNs

The proposed framework selectively integrates the pre-estimates of the DNNs to generate a reliable SOH estimate. We employ mean and standard deviation to evaluate the pre-estimates of each trained DNN. An efficient metric, quartile, is used to select DNNs according to these measures. The retaining DNNs are the final choices of the proposed framework for each estimate, and the indexes x of the final choices can be formulated as:

$${{{{{\bf{x}}}}}}=\{x|{{{{{\rm{E}}}}}}({\,}_{x}\vartheta ^{T})\ge {Q}_{3}({{{{{\rm{E}}}}}}({\,}_{x}\vartheta ^{T})),{{{{{\rm{Var}}}}}}({\,}_{x}\vartheta ^{T})\le {Q}_{1}({{{{{\rm{Var}}}}}}({\,}_{x}\vartheta ^{T}))\}$$

(10)

where Q₁ and Q₃ denote the lower quartile operator and the upper quartile operator, respectively. The expected pre-estimates of the retaining DNNs can be treated as the final SOH estimation for the target domain. In this study, we employ root mean square error RMSE, absolute error AE, and its average value MAE to evaluate the SOH estimation:

$$\left\{\begin{array}{c}{{{{{\mathrm{RMSE}}}}}}=\sqrt{\frac{{\sum }_{i=1}^{M}{({y}_{i}-{\hat{y}}_{i})}^{2}}{M}}\hfill\\ {{{{{\mathrm{A}}}}}}{{{{{{\mathrm{E}}}}}}}_{i}=|{y}_{i}-{\hat{y}}_{i}|\hfill \\ {{{{{\mathrm{MAE}}}}}}=\frac{{\sum }_{i=1}^{M}{{{{{\mathrm{A}}}}}}{{{{{{\mathrm{E}}}}}}}_{i}}{M}\hfill\end{array}\right.$$

(11)

where y_i and ŷ_i are the measured value and the final estimate of the SOH for sample i, respectively. M represents the total number of samples of interest.

Battery cycling and dataset generation

Batteries from datasets #3 and #5 are tested in thermal chambers at 20 and 45 °C, respectively. An ARBIN BT2000 battery test system is employed to cycle the batteries. In each cycle of the battery from Dataset #3, the charge strategy is to charge at a constant-current rate of 0.3 C until the voltage reaches 4.2 V and then hold at 4.2 V until the charging current drops below 0.03 A; the discharge strategy is to discharge at a constant-current rate of 2 C. In each cycle of the battery from Dataset #5, the charge strategy is to charge at a constant-current rate of 1 C until the voltage reaches 3.65 V and then hold at 3.65 V until the charging current drops below 1.35 A; the discharge strategy is to discharge at a constant-current rate of 1 C. The charging curves extracted from the constant-current charging phase of the cycles are integrated into the datasets.

Data availability

Datasets #3 and #5 generated in this study have been deposited in the Mendeley database under the accession code: https://data.mendeley.com/datasets/v8k6bsr6tf/1.

Code availability

Code for the modeling work is available from the corresponding authors upon request.

References

Costa, C. M. et al. Recycling and environmental issues of lithium-ion batteries: advances, challenges and opportunities. Energy Storage Mater. 37, 433–465 (2021).
Article Google Scholar
O’Neill, S. Development of lithium-ion batteries wins Nobel Prize. Engineering 6, 487–488 (2020).
Article Google Scholar
Zhang, L., Zhu, C., Yu, S., Ge, D. & Zhou, H. Status and challenges facing representative anode materials for rechargeable lithium batteries. J. Energy Chem. 66, 260–294 (2022).
Article CAS Google Scholar
Vykhodtsev, A. V., Jang, D., Wang, Q., Rosehart, W. & Zareipour, H. A review of modelling approaches to characterize lithium-ion battery energy storage systems in techno-economic analyses of power systems. Renew. Sust. Energ. Rev. 166, 112584 (2022).
Article CAS Google Scholar
Miao, Y., Liu, L., Zhang, Y., Tan, Q. & Li, J. An overview of global power lithium-ion batteries and associated critical metal recycling. J. Hazard Mater. 425, 127900 (2022).
Article CAS PubMed Google Scholar
Severson, K. A. et al. Data-driven prediction of battery cycle life before capacity degradation. Nat. Energy 4, 383–391 (2019).
Article ADS Google Scholar
Birkl, C. R., Roberts, M. R., McTurk, E., Bruce, P. G. & Howey, D. A. Degradation diagnostics for lithium ion cells. J. Power Sources 341, 373–386 (2017).
Article ADS CAS Google Scholar
Lu, J. et al. Battery degradation prediction against uncertain future conditions with recurrent neural network enabled deep learning. Energy Storage Mater. 50, 139–151 (2022).
Article Google Scholar
Dolci, G., Tua, C., Grosso, M. & Rigamonti, L. Life cycle assessment of consumption choices: a comparison between disposable and rechargeable household batteries. Int. J. Life Cycle Assess. 21, 1691–1705 (2016).
Article CAS Google Scholar
Kamali, M. A., Caliwag, A. C. & Lim, W. Novel SOH estimation of lithium-ion batteries for real-time embedded applications. IEEE Embed. Syst. Lett. 13, 206–209 (2021).
Basia, A., Simeu-Abazi, Z., Gascard, E. & Zwolinski, P. Review on State of Health estimation methodologies for lithium-ion batteries in the context of circular economy. CIRP J. Manuf. Sci. Technol. 32, 517–528 (2021).
Article Google Scholar
Hossain Lipu, M. S. et al. Intelligent algorithms and control strategies for battery management system in electric vehicles: progress, challenges and future outlook. J. Clean Prod. 292, 126044 (2021).
Xiong, R., Li, L. & Tian, J. Towards a smarter battery management system: a critical review on battery state of health monitoring methods. J. Power Sources 405, 18–29 (2018).
Article ADS CAS Google Scholar
Fly, A. & Chen, R. Rate dependency of incremental capacity analysis (dQ/dV) as a diagnostic tool for lithium-ion batteries. J. Energy Storage 29, 101329 (2020).
Article Google Scholar
Hu, X., Jiang, J., Cao, D. & Egardt, B. Battery health prognosis for electric vehicles using sample entropy and sparse Bayesian predictive modeling. IEEE Trans. Ind. Electron 63, 2645–2656 (2016).
Google Scholar
Khodadadi Sadabadi, K., Jin, X. & Rizzoni, G. Prediction of remaining useful life for a composite electrode lithium ion battery cell using an electrochemical model to estimate the state of health. J. Power Sources 481, 228861 (2021).
Article CAS Google Scholar
Knehr, K. W. et al. Understanding full-cell evolution and non-chemical electrode crosstalk of Li-ion batteries. Joule 2, 1146–1159 (2018).
Article CAS Google Scholar
Samad, N. A., Kim, Y., Siegel, J. B. & Stefanopoulou, A. G. Battery capacity fading estimation using a force-based incremental capacity analysis. J. Electrochem. Soc. 163, A1584–A1594 (2016).
Article CAS Google Scholar
Mohtat, P., Lee, S., Siegel, J. B. & Stefanopoulou, A. G. Comparison of expansion and voltage differential indicators for battery capacity fade. J. Power Sources 518, 230714 (2022).
Article CAS Google Scholar
Wu, Y. & Jossen, A. Entropy-induced temperature variation as a new indicator for state of health estimation of lithium-ion cells. Electrochim. Acta 276, 370–376 (2018).
Article CAS Google Scholar
Yang, N., Song, Z., Hofmann, H. & Sun, J. Robust State of Health estimation of lithium-ion batteries using convolutional neural network and random forest. J. Energy Storage 48, 103857 (2022).
Li, P. et al. State-of-health estimation and remaining useful life prediction for the lithium-ion battery based on a variant long short term memory neural network. J. Power Sources 459, 228069 (2020).
Article CAS Google Scholar
Lombardo, T. et al. Artificial intelligence applied to battery research: hype or reality? Chem. Rev. 122, 10899–10969 (2022).
Article CAS PubMed Google Scholar
Hoarfrost, A., Aptekmann, A., Farfañuk, G. & Bromberg, Y. Deep learning of a bacterial and archaeal universal language of life enables transfer learning and illuminates microbial dark matter. Nat. Commun. 13, 2606 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Tian, J., Xiong, R., Shen, W., Lu, J. & Yang, X. G. Deep neural network battery charging curve prediction using 30 points collected in 10 min. Joule 5, 1521–1534 (2021).
Shu, X. et al. A flexible state-of-health prediction scheme for lithium-ion battery packs with long short-term memory network and transfer learning. IEEE Trans. Transp. Electrif. 7, 2238–2248 (2021).
Tan, Y. & Zhao, G. Transfer learning with long short-term memory network for state-of-health prediction of lithium-ion batteries. IEEE Trans. Ind. Electron 67, 8723–8731 (2020).
Article Google Scholar
Ye, Z. & Yu, J. State-of-health estimation for lithium-ion batteries using domain adversarial transfer learning. IEEE Trans. Power Electron 37, 3528–3543 (2022).
Article ADS Google Scholar
Ye, Z., Yu, J. & Mao, L. Multisource domain adaption for health degradation monitoring of lithium-ion batteries. IEEE Trans. Transp. Electrif 7, 2279–2292 (2021).
Article Google Scholar
Han, T., Wang, Z. & Meng, H. End-to-end capacity estimation of Lithium-ion batteries with an enhanced long short-term memory network considering domain adaptation. J. Power Sources 520, 230823 (2022).
Borgwardt, K. M. et al. Integrating structured biological data by Kernel maximum mean discrepancy. Bioinformatics 22, e49–e57 (2006).
Kifer, D., Ben-David, S. & Gehrke, J. Detecting change in data streams. In Proc. 2004 VLDB Conference 180–191 (VLDB Endowment, 2004).
Tzeng, E., Hoffman, J., Zhang, N., Saenko, K. & Darrell, T. Deep domain confusion: maximizing for domain invariance. Preprint at arXiv https://doi.org/10.48550/arXiv.1412.3474 (2014).
Xiong, R. et al. Lithium-ion battery health prognosis based on a real battery management system used in electric vehicles. IEEE Trans. Veh. Technol. 68, 4110–4121 (2019).
Article Google Scholar
Richardson, R. R., Birkl, C. R., Osborne, M. A. & Howey, D. A. Gaussian process regression for in situ capacity estimation of lithium-ion batteries. IEEE Trans. Ind. Inf. 15, 127–138 (2019).
Article Google Scholar
Zheng, Y. et al. A novel capacity estimation method based on charging curve sections for lithium-ion batteries in electric vehicles. Energy 185, 361–371 (2019).
Naha, A. et al. An incremental voltage difference based technique for online state of health estimation of Li-ion batteries. Sci. Rep. 10, 9526 (2020).
He, K., Zhang, X., Ren, S. & Sun, J. Delving deep into rectifiers: surpassing human-level performance on ImageNet classification. In 2015 IEEE International Conference on Computer Vision (ICCV) 1026–1034 (IEEE, 2015).
He, W., Williard, N., Osterman, M. & Pecht, M. Prognostics of lithium-ion batteries based on Dempster-Shafer theory and the Bayesian Monte Carlo method. J. Power Sources 196, 10314–10321 (2011).
Article ADS CAS Google Scholar
Käbitz, S. et al. Cycle and calendar life study of a graphite|LiNi1/3Mn 1/3Co1/3O2 Li-ion high energy system. Part A: full cell characterization. J. Power Sources 239, 572–583 (2013).
Article Google Scholar
Li, W. et al. One-shot battery degradation trajectory prediction with deep learning. J. Power Sources 506, 230024 (2021).
Birkl, C. Oxford battery degradation dataset 1. University of Oxford (2017).
Xing, Y., Ma, E. W. M., Tsui, K. L. & Pecht, M. An ensemble model for predicting the remaining useful performance of lithium-ion batteries. Microelectron. Reliab. 53, 811–820 (2013).
Article CAS Google Scholar
Weiss, K. R. & Khoshgoftaar, T. M. Investigating transfer learners for robustness to domain class imbalance. In 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA) 207–213 (IEEE, 2016).
Li, Y. et al. Random forest regression for online capacity estimation of lithium-ion batteries. Appl. Energy 232, 197–210 (2018).
Article CAS Google Scholar
Guo, Y., Huang, K., Yu, X. & Wang, Y. State-of-health estimation for lithium-ion batteries based on historical dependency of charging data and ensemble SVR. Electrochim. Acta 428, 140940 (2022).
Article CAS Google Scholar
Tian, J., Xiong, R., Shen, W., Lu, J. & Sun, F. Flexible battery state of health and state of charge estimation using partial charging data and deep learning. Energy Storage Mater. 51, 372–381 (2022).
Article Google Scholar
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. Preprint at https://doi.org/10.48550/arXiv.1412.6980 (2014).

Download references

Acknowledgements

This work was funded by the National Key R&D Program of China under Grant 2021YFB2402002 (R.X.), the Beijing Natural Science Foundation under Grant L223013 (R.X.), and the China Postdoctoral Science Foundation under Grant BX2021035 and 2022M710379 (J.T.).

Author information

Authors and Affiliations

Department of Vehicle Engineering, School of Mechanical Engineering, Beijing Institute of Technology, Beijing, 100081, China
Jiahuan Lu, Rui Xiong, Jinpeng Tian, Chenxu Wang & Fengchun Sun

Authors

Jiahuan Lu
View author publications
You can also search for this author in PubMed Google Scholar
Rui Xiong
View author publications
You can also search for this author in PubMed Google Scholar
Jinpeng Tian
View author publications
You can also search for this author in PubMed Google Scholar
Chenxu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Fengchun Sun
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.X. conceived the idea of SOH estimation, led and supervised the project, participated in paper writing and revision, and provided guidance to all co-authors. F.S. supervised and led this project. J.L., J.T., and C.W. generated the data. J.L. conceived, wrote, and revised the manuscript. All the authors have revised the manuscript and agreed with its content.

Corresponding authors

Correspondence to Rui Xiong or Jinpeng Tian.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Chao Hu and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer review file

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lu, J., Xiong, R., Tian, J. et al. Deep learning to estimate lithium-ion battery state of health without additional degradation experiments. Nat Commun 14, 2760 (2023). https://doi.org/10.1038/s41467-023-38458-w

Download citation

Received: 16 September 2022
Accepted: 03 May 2023
Published: 13 May 2023
DOI: https://doi.org/10.1038/s41467-023-38458-w

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.