Abstract
Electrolytes are considered crucial for the performance of batteries, and therefore indispensable for future energy storage research. This paper presents data that describes the effect of the electrolyte composition on the ionic conductivity. In particular, the data focuses on electrolytes composed of ethylene carbonate (EC), propylene carbonate (PC), ethyl methyl carbonate (EMC), and lithium hexafluorophosphate (LiPF6). The mass ratio of EC to PC was varied, while keeping the mass ratio of (EC + PC) and EMC at fixed values of 3:7 and 1:1. The conducting salt concentration was also varied during the study. Conductivity data was obtained from electrochemical impedance spectroscopy (EIS) measurements at various temperatures. Based on the thus obtained temperature series, the activation energy for ionic conduction was determined during the analysis. The data is presented here in a machine-readable format and includes a Python package for analyzing temperature series of electrolyte conductivity according to the Arrhenius equation and EIS data. The data may be useful e.g. for the training of machine learning models or for reference prior to experiments.
Background & Summary
Electrolytes are crucial for the performance of batteries1 since they enable shuttling of the ions, provide electrical isolation of the electrodes and have a defining influence on the formation and stability of the solid electrolyte interface (SEI)2 and the cathode electrolyte interface (CEI)2,3,4. Achieving high performance electrolytes, typically requires the presence of various components like organic solvents, co-solvents, functional additives and conducting salts5. The concentration of each component and the ratio between the components have a strong impact on the conductivity of the electrolyte6,7,8. Ding et al. showed in several studies6,7,8,9, that the composition of the electrolyte, especially the PC content, affects the viscosity and glass transition temperature of the electrolyte. The amount of PC also hinders crystallization of EC6,10. This allows for the formulation of electrolytes with improved performance at low temperatures10,11.
The dataset12 presented herein provides a comprehensive basis for future optimization studies, as it contains a wide variation of formulations and temperatures, including the raw data. Furthermore, it can help to gain deeper insights regarding composition-property-performance relationships. Fractions of this dataset served as the basis for several machine learning models published elsewhere11,13,14. The automated high-throughput experimentation system13 available at the Helmholtz Institute Münster is used to formulate a variety of electrolyte solutions based on EC, EMC, PC and LiPF6. Ratios of (PC + EC):EMC of 3:7 and 1:1 are covered in the dataset12. The concentration of the conducting salt varies between 0.2 mol kg−1 and 2.1 mol kg−1, while the ratio of EC:PC ranges from 0.0 to 9.2.
The robotic system13 used for the acquisition of the data is able to dispense liquid and solid components into aluminium or polymer vials with high accuracy. Each formulation is identified by a batch number and measurements are identified by a unique ID stored and reported on the vial through a QR code. After sample-preparation, the automated setup performs the targeted measurement. Subsequently, the system returns a JSON formatted file for each formulation, which allows for downstream processing. Here, we present the data12 as a CSV file to summarize the results received from 504 individual JSON files. Manual analysis of the raw data is time intensive, which is why we have developed an automated Python-based data analysis package called Modular and Autonomous Data Analysis Platform (MADAP)15 with a command line interface (CLI) and a graphical user interface (GUI) that can process the aggregated CSV. This package is generalized and can be used on a variety of datasets as described below. The overall workflow of generating and analyzing data is shown in Fig. 1. All input parameters are tracked and saved in the output obtained from MADAP15 to allow full data provenance tracking16,17 of not just the experimental but also the data analysis steps in the research workflow18.
The dataset12 can be used to train machine learning models in order to predict promising electrolyte formulations to reach an optimum conductivity, as demonstrated by Rahmanian et al.11. Further, the research community may find the data useful in the design of their own experiments and in decisions concerning the use of hardware, software and human resources. The use of this dataset together with analysis tools like MADAP15 as a base for further lithium-ion battery research, enables the generation of further insights such as the activation energy of the ion conduction process. It is even possible to add other analysis procedures to MADAP15 to further expand the automation it provides.
Methods
High throughput experimentation (HTE) system
The robotic HTE system13, used to acquire the data12 presented here, is designed for high-throughput operation in a nitrogen atmosphere. The setup designed for the formulation of electrolyte solutions is able to prepare 96 formulations in 8 h by gravimetric dosing of solid and liquid materials into polymer or aluminium vials. Up to 10 mL of electrolyte can be formulated within one vial. The setup also provides functionalities to close the vials, mix, and heat their content using a heated shaker plate. Further, EIS measurements are performed automatically. To track the samples, each vial is automatically labelled using a QR code representing information like the date of preparation, an ID for the electrolyte mixture and information regarding the chemicals used. In preparation for EIS measurements, a volume of 750 μL of the electrolytes is automatically filled into single-use Eppendorf Ⓡ Safe-Lock Tubes with a capacity of 2 mL. The use of single-use equipment avoids cross contamination in this step of the process. Subsequently, electrodes are automatically immersed into the sample. These electrodes are designed to generate reproducible results independent of the shape of the vial or the depth of immersion19. For the measurement, the samples are arranged in groups of eight samples per rack, three of which are mounted on one larger rack. Four of these combined racks can be connected to the Metrohm Autolab potentiostat, which is used for the measurements13.
EIS measurement
After the assembly of the racks, they are manually transferred to a Memmert TTC256 temperature chamber for EIS measurements. The connection of the cells to the Metrohm Autolab potentiostat is also done by the operator. The temperature chamber is programmed such, to cover the temperature ranges between −30 °C and 60 °C in steps of 10 °C. Subsequent to an equilibration period of 2 h for each temperature, the EIS measurements are automatically performed with an applied AC voltage of 40 mV and frequencies between 20 kHz to 50 Hz. A multiplexer distributes the output of twelve channels to eight outputs each. Hence, 96 channels are available to connect to each of the 96 cells on a rack13. Each experiment is repeated several times to provide up to 8 sets of values to the dataset. Repetitions can be identified and distinguished based on the running number in the experimentID.
Data management in the experimental setup
The data recording during the experimental workflow is handled by a laboratory information management system. It records identifiers for the starting materials, test protocols and relevant experimental parameters. Furthermore, the system is able to merge these data with metadata comprising further details about the electrolytes used in a measurement. After conclusion of a measurement, the collected data including the metadata is saved to a JSON file, which can be used for analysis.
Analysis Software (MADAP)
For the data analysis, a variety of tools are present and available, e.g. ZView20, pyEIS21, impedance22, Aftermath23 and Origin24. We decided to bundle some of these tools into a compact, modular software package called MADAP15, thoroughly documented using sphinx25. This analysis tool provides all the necessary means to perform electrochemical data analysis based on experimental datasets, while providing full data provenance tracking, and plot publication quality results. It can perform a variety of automated electrochemical analyses, including EIS, linear and cyclic voltammetry and the analysis of temperature series according to the Arrhenius equation. In this paper, we focus on Arrhenius analysis and EIS measurements. MADAP15 is deployed in Python3 and is publicly accessible as a GitHub repository (https://github.com/fuzhanrahmanian/MADAP)15, a pip installable package (pip install madap), and an executable (https://github.com/fuzhanrahmanian/MADAP/releases/tag/v1.0.0) with a graphical user interface (GUI) created with PySimpleGui26,27, as shown in Fig. 2. The accessibility of MADAP15, by means of a CLI as well as a GUI, provides the broader scientific community with a variety of entry points for the data analysis. The generic nature of the procedure assures that the package can be expanded with further analysis methods without impacting the existing methodologies. Further, this enables its integration into autonomous research workflows28,29,30. The basic workflow of an analysis using MADAP15 comprises the three steps of data acquisition, pre-processing and the analysis itself. In the former, the user can import different data types (.txt, .json, .hdf5 or .h5, .xml, .pkl and .csv) and select the data to be analyzed based on ranges of indices for rows and columns or by specifying column labels. The pre-processing step can detect outliers based on given upper and lower limits of the relevant quantile using the Quantile-based flooring and capping algorithm31. The user may choose to specify custom limits or use the default values implemented in MADAP15. In version 1.0, the default values are chosen as 0.01 for the lower and 0.99 for the upper limit. Afterwards, the user can choose what type of analysis shall be performed, i.e. voltammetry, EIS or Arrhenius.
A showcase of the Graphical User Interface diagram of MADAP15.
Figure 3 depicts the code structure used in MADAP15. In the beginning of each analysis, all the procedures instantiate an abstract class called EChemProcedure, which enforces the presence of methods called analyze, plot, save_data and perform_all_actions. All procedures additionally inherit from the common Plots class, which equips them with the common plotting functionalities, providing outputs with scientific format32. The complete procedure is continuously logged to review potential errors.
The stylized Unified Modelling Language (UML) diagram that represents the code structure of MADAP15.
The linear fit required for the Arrhenius type analysis33 is implemented in MADAP15 using the functionalities for linear regression provided in the scikit-learn package34. The activation energy and the pre-exponential factor are derived from this fit. The regression loss, which is chosen as a quality metric, is calculated using the mean square error (MSE). Finally, plots and data files for the raw and fitted data as well as the model’s parameters are automatically generated and saved in a designated location in accordance to the FAIR (Findability, Accessibility, Interoperability, and Reusability) data principle35.
EIS analysis and fitting are performed by a partial adoption of the impedance package provided by Matthew D. et al.22. In this package, the model uses a non-linear square fit as supplied by the SciPy36 package. The EImpedance module of MADAP15 gives the user the possibility to provide a definition of an equivalent circuit via available elements and their corresponding values. In this case, the user should provide guesses for the value of each element in the selected circuit. Based on these guesses, MADAP15 generates a fit of the selected data internally and evaluates its quality. For the quality check, the root-mean-square error (RMSE) of the fit is determined and compared to the root mean square (RMS) of the experimental data. If the ratio of RMSE over RMS exceeds a threshold (δ), a re-evaluation will be triggered. In this case, the standard deviation of each estimated value of a circuit’s element is added to or subtracted from the respective value to improve generalization. The operation to be carried out is selected randomly for each value. These new values are then used as the input guesses for the subsequent fit. This procedure is iterated, until either the ratio of RMSE and RMS is below δ, i.e. Equation 1 is fulfilled, or 5 iterations are reached. The number of iterations as well as δ are determined heuristically to 5 and 0.01, although the user will have the possibility to change them and define custom numbers as required.
Alternatively, MADAP15 provides the option to iterate over 40 common hard-coded equivalent circuits, which are provided as part of the MADAP15 package, without further user input. In this case, the match with the lowest RMSE will be chosen. This metric will be used as the loss metric in the analysis. For every impedance spectrum, the fitted circuit parameters and their uncertainties, the loss metric, the determined resistance and the corresponding conductivity will be saved automatically. To provide information about the linearity and stability of the fit, the improved linear Kramers-Kronig (linKK) method37 as implemented in the impedance module22 is applied automatically to each spectrum. For visualization, a Nyquist and a Bode plot comprising the raw and fitted data as well as a residual plot for the linKK method will be generated and saved accordingly. Figure 4b shows the data and the fit of randomly selected spectra corresponding to different quantiles of the RMSE to convey an impression of the achieved quality of the fit. For each quantile, four spectra and their respective fits are shown. For evaluation of the reliability of the fit, benchmarking is done referencing to the manual analysis of the selected data using Metrohm Autolab software as a baseline. In comparison to this baseline, MADAP15 provides acceptable fits for the majority of the spectra. The same principle was applied for the Arrhenius analyses, depicted in Fig. 4a.
Fits randomly selected from Q1, Q2, Q3 based on the (a) R2 score of the Arrhenius fit and (b) RMSE of the eis fit determined by MADAP15 for 5035 electrolyte measurements with a frequency range between 50 and 20k.
Data Records
The dataset12 presented here comprises, among others, conductivity, real and imaginary part of the impedance as determined by EIS measurements and information regarding the formulation of a variety of electrolyte formulations for lithium-based batteries. The formulations relate to the masses of the solvent components EC, PC, and EMC and the conducting salt LiPF6.
We provide the dataset as a dataframe in a CSV file format, which can be dowloaded from https://doi.org/10.5281/zenodo.724493912 and may be used under the CC BY license. A summary of its structure is presented in Table 1. This table also shows the data type, the range of values covered for each quantity, the number of unique values and the physical unit. In this section, we elucidate more on the data and the interrelations within the dataframe.
The robotic system operated at the Helmholtz Institute Münster outputs the raw data in JSON format. Although, this format is machine-readable, we decided to provide the data in CSV format, which can easily be read into the user’s script as a table, e.g. using the Pandas38 library available for Python. Each line in the dataframe represents all the data available for a single measurement. Parameters, which are shared by several experiments, are repeated in each line, where they are applicable. In the following, we will elucidate more on each column of the dataframe.
experimentID
This column provides a unique identifier for each experiment, which enables traceability of the data. It codes the operator, the date of the experiment, the label of the electrolyte and a running number differentiating the repeats. The format of the experimentID is: [operator]_[date of the experiment]_[label of the electrolyte]_[running number].
temperature
The temperature, at which each measurement was performed, is reported in this column. Each row corresponds to a measurement at one temperature. The values range from −30 °C to 60 °C. For five formulations, the measurement at −30 °C is not reported in the dataset.
frequency
This column reports a string, which comprises a list of the frequencies used in the EIS measurements. The frequencies are reported in units of Hz and cover a range from 20 kHz to 50 Hz.
real_impedance
Values for the real part of the impedance, Z’, in the unit Ω are given in this column in the form of a string of a list of floats. The values in this column for all measurements range from −3.901 × 105 Ω to 11.430 × 107 Ω. The negative values result from artefacts in the measurements.
imaginary_impedance
The imaginary part of the impedance, Z”, is presented in this column. The values are given in Ω and range from −37.185 × 107 Ω to 103.002 Ω. The positive values result from artefacts in the measurements.
cell_constant,_standard_deviation
The cell constant and the respective standard deviation values are reported in cm−1 and determined from five reference measurements using 0.01 M KCl (aq) standard solution at a temperature of 20 °C13. In the dataset, they are reported in a common column as a tuple, in which the first value corresponds to the cell constant and the second value reports the standard deviation. The values for the cell constant range from 3.815 to 4.720, while the standard deviations span a range from 0.000 to 0.178.
PC
This column reports the mass of PC in g used during the preparation of the electrolyte formulation. The values are given as floats and range from 0.273 g to 5.306 g.
EC
The mass of EC used during the preparation of the electrolyte formulation is reported in this column. The values are given as floats in units of g and are spanning a range from 0.000 g to 4.320 g.
EMC
In this column, we report the mass of EMC used for the preparation of the electrolyte formulation. The values are given in g and comprise values between 0.480 g and 9.457 g.
LiPF_6
This column presents the mass in g of LiPF6 comprised in the formulations. The values reach from 0.301 g to 4.093 g.
metadata
In this column, additional information is reported, which cannot be reasonably presented in tabular form. The metadata are presented as a string of a dictionary. It reports the date and type of the experiment using the keys experimentDate and experimentType, respectively. Further, the version of the JSON format is associated with the key formatVersion. The number of the channel running the experiment, the amount of electrolyte used in the respective measurement, and the suspected measurement error are correlated with the keys channel, electrolyteAmount, and suspectedMeasurementError, respectively. The keys PC, EC, EMC, and LiPF6 are linked to further information regarding the respective electrolyte component which is represented in dictionary format. The keys Batch-No, CAS-No, and comment present the respective information as a string. The date of delivery and the date of opening of the container are given as strings in the format MM/YY and can be accessed using the keys dateOfDelivery and dateOfOpening. The molar mass of the substance is reported as a float with the key molarMass, while its unit is given as a string using the key molarMassUnit. The name key is associated with a string stating the long name of the chemical. The purity of the material is found using the key purity, while the SMILES string is given with the key SMILES. Both of these quantities are reported as strings. The amount of the respective substance used in the formulation is accessed with the key substanceAmount, while the respective unit is found using the key substanceAmountUnit. Finally, the supplier key returns the supplier, from which the material was obtained.
Moreover, the dataframe also contains data resulting from the analysis of the experimental data using the MADAP15 Python package. The MADAP15 analysis workflow is performed on a Lenovo Workstation with an AMD Ryzen Threadripper PRO 3975WX processor at 3500 MHz with 32 cores and 64 Logical Processors. The workstation is equipped with 128 GB of RAM and an RTX A6000 GPU running with Microsoft Windows 10 Pro. The single core performance of the CPU turned out to be a bottleneck during operation, since the used libraries are not optimized for multicore processing or GPU training. Hence, MADAP15 was configured to use all 32 cores for multithreaded operation for this scenario. In the following, we elucidate more on the analyzed results contained in the dataframe by going through the column names associated with analyzed data.
phase_shift
This column reports the phase shift (ϕ) or phase angle as obtained from the EIS analysis implemented in the MADAP15 package according to Eq. 2:
The data is given as a string of a list with values ranging from 0.131 to 89.882 given in°.
EIS_conductivity
The ionic conductivity obtained as the quotient of the cell constant and the resistance determined from the EIS analysis implemented in MADAP15 is reported in this column. The conductivity is given in units of S cm−1 and the values range from 0.000 S cm−1 to 0.019 S cm−1.
EIS_fittedParameters
In this column, we report the determined values of the circuit’s elements as well as their uncertainties as obtained from the analysis. These parameters are represented as a string of a list of tuples. The first element of each tuple illustrates the value of the respective element, and the second value shows the standard deviation error obtained from the output of the impedance package22. The order of the tuples corresponds to the order of a given circuit’s elements as presented in column EIS_circuit.
EIS_RMSE
This column reports the RMSE of the fit in the real and the imaginary dimension as obtained from EIS analysis. The values are given as floats.
EIS_numberRCelements
An optimal number of RC elements in an equivalent circuit determined using the linKK method can be verified by a metric, which subtracts the ratio between the sums of negative and positive resistor values from unity. The symbolic representation of this metric is conventionally chosen to be μ and its values are reported as floats in our dataframe. The number of RC elements considered as optimal is the one, which results in a value of μ below 0.8537.
EIS_fitEvaluation
This column reports a numeric value providing means to estimate the degree of over- or under-fitting. The values range from 0.576 to 0.850 and are reported as floats. The upper limit is fixed at 0.850 to avoid overfitting, as described by Schönleber et al.37.
EIS_resistance
From the EIS analysis, the resistance of the electrolyte towards ionic charge transfer is obtained. The values resulting from the analysis are reported in this column in units of Ω. A range from 241.781 Ω to 25.564 × 103 Ω is spanned by the data.
EIS_chiSquare
This statistical value determines the goodness of the fit derived from the linKK method and is calculated as the sum of squares of the real and imaginary residual error. The χ2 values are reported as floats.
EIS_circuit
The manual or auto-selected circuit used to fit the EIS data of the concerned measurement is reported in this column. In the representation, serial connections are displayed as element1-element2, while p(element1, element2) indicates a parallel electric connection. The elements in the circuit are represented by R for resistance and C for capacity. A constant phase element is indicated by CPE and a Warburg element is represented as W. An additional list of elements, which may be used by the user, can be found in the impedance package22. In this column, the fitted circuit for each conductivity experiment is represented by a string.
EIS_impedance
This column represents a list of impedance values obtained from the fitted model with frequency as input and the measured impedance as output. The data is reported as a string of a list.
EIS_residualReal
The residual errors of the real impedance obtained from the linKK method can be seen in this column. They are given as a string of a list.
EIS_residualImaginary
In this column, the residual error derived from the linKK method for imaginary impedance as a consistency factor is reported as a string of a list.
Arrhenius_activationEnergy
For calculating the activation energy from the conductivity experiment, a linear fit between the inverse temperatures in 1000/K and the natural logarithm of conductivities is applied. The activation energy can be calculated with the Arrhenius equation and is reported as a float in this column with the unit mJ mol−1.
Arrhenius_preExponential
The pre-exponential factor obtained from the linear fit according to the Arrhenius equation is reported in this column. The values of this factor are given as a float with the unit Scm−1.
Arrhenius_R2
In this column, the R2 score of the linear fit is shown as a unitless float.
Arrhenius_MSE
In this column, we report the mean square error of the linear fit as a unitless float.
Arrhenius_lnConductivity
A list of the natural logarithmic conductivities obtained from the linear fit is reported in this column as a string of a list of floating point numbers.
All the relevant data concerning the raw data, fitting parameters and results of the analysis are saved in the presented dataset. The data is therefore fully traceable and reusable. This is compliant with the FAIR35 data standard. The workflow is schematized in Fig. 5.
The column named Data Type in Table 1 shows the data type obtained after reading the dataframe from the CSV file using Pandas’38. read_csv method. The user should note the information provided in the column Description to see the structure of the string. For example, the real part of the impedance is read as a string type variable. However, it actually represents a list of floats and should be cast to this data type.
Technical Validation
The reliability of the experimental data is validated by repeating each measurement several times. Invalid data is not stored in the dataset12 reported here. Each measurement is examined by an expert in the field to ensure high quality of the data.
The data obtained from the analysis is verified using an appropriate metric for each analysis. For the Arrhenius type analysis, the quality of the fit is quantified by the mean squared error (MSE).
The impedance data reported in this work is pre-processed for analysis by excluding negative impedance values and outliers to enable a reliable analysis. The linKK method is used to verify the linearity of the spectrum and also reports the goodness of the fit by the statistical χ2 value corresponding to the residual errors of the impedance data. Consequently, the resulting fit of the equivalent circuit is verified by means of the RMSE. This workflow returns the parameters corresponding to the equivalent circuit as presented in the section Data Records.
For visualization, we generated quantiles based on R2 and RMSE for all the fits performed during the analyses. Figure 4 shows the results of four randomly selected analyses taken from each quantile to provide an overview of the distribution of the fitting quality. In Fig. 4a, fits corresponding to quantiles based on R2 are shown, while Fig. 4b presents fits for quantiles based on RMSE. The first row in each subfigure gives an impression of the lowest fit quality, while the best fits are shown in the last row of the subfigures. Additionally, the conductivity and the activation energy calculated by MADAP15 are depicted in Fig. 6.
Results from the analysis according to the Arrhenius equation. (a) represents the activation energies for two (EC + PC):EMC ratios of 0.4 and 1, derived by MADAP15 using a linear regression fit, (b) shows the conductivity value for the mentioned ratios at 10 descrete temperatures between −30.0 °C and 60.0 °C obtained from the analysis performed by MADAP15 using non-linear least square fit of SciPy36 module. For a part of this fit, the impedance module22 has been utilized.
Usage Notes
It is recommended to apply the MADAP15 package to use, extend or adapt the provided data analysis. For performing analysis using the MADAP15 package, a specific range of rows and columns of the dataframe can be selected. For example, to reproduce one of the result of this article for Arrhenius analysis, the published dataset was selected as input for the MADAP15 GUI and the row indices from 3967 to 3977 and column 2 for temperatures and column 13 for electrolyte conductivity selected for the evaluation. Both plotting types were chosen, and the RUN button was pressed. Further results can be derived similarly.
Note that, in case a definition of the formulation in terms of molar fractions is desired, the amounts of substances for each component of the electrolyte as reported in the dictionary given in the column labelled metadata can be used.
Code availability
The code of the MADAP15 package is publicly available on https://github.com/fuzhanrahmanian/MADAP and the documentation can be found in https://fuzhanrahmanian.github.io/MADAP/. A stand-alone windows executable can be downloaded from the GitHub repository as well. Furthermore, MADAP15 can be installed by running pip install madap.
The analysis results presented in this article are generated using MADAP15 version 1.0. Contributions are welcome, but should follow the common guidelines for group software development, which can be found in the CONTRIBUTION section of the MADAP15 the repository. The code is developed for the Python version 3.9 and above and should use the following packages and versions: attrs > = 21.4.0, matplotlib > = 3.5.339, numpy > = 1.22.440, pandas > = 1.4.238, pytest > = 7.1.2, scikit_learn > = 1.1.2, and impedance > = 1.4.122. For running the GUI, PySimpleGUI > = 4.60.326 is required additionally.
References
Verma, P., Maire, P. & Novák, P. A review of the features and analyses of the solid electrolyte interphase in Li-ion batteries. Electrochim. Acta 55, 6332–6341, https://doi.org/10.1016/j.electacta.2010.05.072 (2010).
Peled, E. & Menkin, S. SEI: Past, Present and Future. J. Electrochem. Soc. 164, A1703–A1719, https://doi.org/10.1149/2.1441707jes (2017).
Qian, Y. et al. Influence of electrolyte additives on the cathode electrolyte interphase (CEI) formation on LiNi1/3Mn1/3Co1/3O2 in half cells with Li metal counter electrode. J. Power Sources 329, 31–40, https://doi.org/10.1016/j.jpowsour.2016.08.023 (2016).
Gauthier, M. et al. Electrode–Electrolyte Interface in Li-Ion Batteries: Current Understanding and New Insights. J. Phys. Chem. Lett. 6, 4653–4672, https://doi.org/10.1021/acs.jpclett.5b01727 (2015).
Zhao, W. et al. Recent advances in the research of functional electrolyte additives for lithium-ion batteries. Curr. Opin. Electrochem. 6, 84–91, https://doi.org/10.1016/j.coelec.2017.10.012 (2017).
Ding, M., Li, Q., Li, X., Xu, W. & Xu, K. Effects of Solvent Composition on Liquid Range, Glass Transition, and Conductivity of Electrolytes of a (Li, Cs) PF6 Salt in EC-PC-EMC Solvents. J. Phys. Chem. C. 121, 11178–11183, https://doi.org/10.1021/acs.jpcc.7b03306 (2017).
Ding, M. et al. Change of Conductivity with Salt Content, Solvent Composition, and Temperature for Electrolytes of LiPF6 in Ethylene Carbonate-Ethyl Methyl Carbonate. J. Electrochem. Soc. 148, A1196, https://doi.org/10.1149/1.1403730 (2001).
Ding, M. S. & Jow, T. R. Conductivity and Viscosity of PC-DEC and PC-EC solutions of LiPF6. J. Electrochem. Soc. 150, A620, https://doi.org/10.1149/1.1566019 (2003).
Ding, M. S. Liquid Phase Boundaries, Dielectric Constant, and Viscosity of pc-dec and pc-ec Binary Carbonates. J. Electrochem. Soc. 150, A455–A462, https://doi.org/10.1149/1.1557968 (2003).
Hubble, D. et al. Liquid electrolyte development for low-temperature lithium-ion batteries. Energy Environ. Sci. 15, 550–578, https://doi.org/10.1039/D1EE01789F (2022).
Rahmanian, F. et al. One-shot active learning for globally optimal battery electrolyte conductivity. Batteries & Supercaps 5, https://doi.org/10.1002/batt.202200228 (2022).
Rahmanian, F. et al. Dataset of 5035 conductivity experiments for lithium-ion battery electrolyte formulations at various temperatures. Zenodo https://doi.org/10.5281/zenodo.7244939 (2022).
Narayanan Krishnamoorthy, A. et al. Data-Driven Analysis of High-Throughput Experiments on Liquid Battery Electrolyte Formulations: Unraveling the Impact of Composition on Conductivity. Chemistry-Methods 2, https://doi.org/10.1002/cmtd.202200008 (2022).
Flores, E. et al. Learning the laws of lithium-ion transport in electrolytes using symbolic regression. Digital Discovery 1, 440–447, https://doi.org/10.1039/D2DD00027J (2022).
Rahmanian, F. Modular and autonomous data analysis platform. Zenodo https://doi.org/10.5281/zenodo.7374383 (2022).
Soedarmadji, E., Stein, H. S., Suram, S. K., Guevarra, D. & Gregoire, J. M. Tracking materials science data lineage to manage millions of materials experiments and analyses. npj Comput. Mater. 5, 1–9, https://doi.org/10.1038/s41524-019-0216-x (2019).
Castelli, I. E. et al. Data Management Plans: the Importance of Data Management in the BIG-MAP Project. Batteries Supercaps 4, 1803–1812, https://doi.org/10.1002/batt.202100117 (2021).
Stein, H. S. et al. From materials discovery to system optimization by integrating combinatorial electrochemistry and data science. Curr. Opin. Electrochem. 35, 101053, https://doi.org/10.1016/j.coelec.2022.101053 (2022).
Wiemhöfer, H.-D., Grünebaum, M. & Hiller, M. M. Micro electrode liquid measurement cell. WIPO Utility Patent No. WO 2014/139494 A1 (2014).
AMETEK Inc. Zview. Scientific Instruments https://www.ameteksi.com/products/software/zview-software-en (2019).
Knudsen, K. B. Pyeis: A Python-Based Electrochemical Impedance Spectroscopy Analyzer and Simulator. ECS Meet. Abstr. MA2019-01, 1937, https://doi.org/10.1149/MA2019-01/39/1937 (2019).
Murbach, M. D., Gerwe, B., Dawson-Elli, N. & Lok-kun, T. impedance.py: A Python package for electrochemical impedance analysis. J. Open. Source Softw. 5, 2349, https://doi.org/10.21105/joss.02349 (2020).
Spinner, N. AfterMath EIS Data Import Procedure. Pine Research Instrumentation https://pineresearch.com/shop/kb/software/software-help-and-support/using-aftermath/aftermath-import-procedure (2016).
Deschenes, L. A. & Bout, D. A. V. Origin 6.0: Scientific Data Analysis and Graphing Software Origin Lab Corporation (formerly Microcal Software, Inc.). web site: www.originlab.com. commercial price: 595.academicprice: 446. J. Am. Chem. Soc. 122, 9567–9568, https://doi.org/10.1021/ja004761d (2000).
Sphinx Development Team. Sphinx 4.0.0+ documentation. Sphinx https://www.sphinx-doc.org/en/master/ (2020).
Podržaj, P. & Walker, N. Proceedings of the 8th International Conference on Informatics and Applications ICIA2019, Japan, 2019 (2019).
PySimpleGUI. PySimpleGUI: Python GUIs for humans. PySimpleGUI https://www.pysimplegui.org/ (2018).
Rahmanian, F. et al. Enabling modular autonomous feedback-loops in materials science through hierarchical experimental laboratory automation and orchestration. Adv. Mater. Interfaces 9, 2101987, https://doi.org/10.1002/admi.202101987 (2022).
Pizzi, G., Cepellotti, A., Sabatini, R., Marzari, N. & Kozinsky, B. AiiDA: automated interactive infrastructure and database for computational science. Computational Mater. Sci. 111, 218–230, https://doi.org/10.1016/j.commatsci.2015.09.013 (2016).
Allan, D., Caswell, T., Campbell, S. & Rakitin, M. Bluesky’s ahead: A multi-facility collaboration for an a la Carte software project for data acquisition and management. Synchrotron Radiat. N. 32, 19–22, https://doi.org/10.1080/08940886.2019.1608121 (2019).
Dekking, F. M., Kraaikamp, C., Lopuhaä, H. P. & Meester, L. E. A Modern Introduction to Probability and Statistics: Understanding why and how, vol. 488 (Springer, 2005).
Garrett, J. D. & Peng, H. SciencePlots (v1. 0.6). Zenodo https://doi.org/10.5281/zenodo.4106650 (2020).
Laidler, K. J. The development of the arrhenius equation. J. Chem. Educ. 61, 494–498, https://doi.org/10.1021/ed061p494 (1984).
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830, https://doi.org/10.5555/1953048.2078195 (2011).
Wilkinson, M. D. et al. The FAIR guiding Principles for scientific data management and stewardship. Sci. Data 3, 1–9, https://doi.org/10.1038/sdata.2016.18 (2016).
Virtanen, P. et al. SciPy 1.0: fundamental algorithms for scientific computing in python. Nat. Methods 17, 261–272, https://doi.org/10.1038/s41592-019-0686-2 (2020).
Schönleber, M., Klotz, D. & Ivers-Tiffée, E. A Method for Improving the Robustness of linear Kramers-Kronig Validity Tests. Electrochim. Acta 131, 20–27, https://doi.org/10.1016/j.electacta.2014.01.034 (2014).
W McKinney. Data Structures for Statistical Computing in Python. In S., van der Walt & J., Millman (eds.) Proceedings of the 9th Python in Science Conference, 56–61, https://doi.org/10.25080/Majora-92bf1922-00a (2010).
Hunter, J. D. Matplotlib: A 2d graphics environment. Comput. Sci. Eng. 9, 90–95, https://doi.org/10.1109/MCSE.2007.55 (2007).
Harris, C. R. et al. Array programming with NumPy. Nature 585, 357–362, https://doi.org/10.1038/s41586-020-2649-2 (2020).
Acknowledgements
This work contributes to the research performed at CELEST (Center for Electrochemical Energy Storage Ulm-Karlsruhe) and was funded by the German Research Foundation (DFG) under Project ID 390874152 (POLiS Cluster of Excellence). This project received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 957189.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
C.W. and P.Y. conducted the experiments and created the JSON files containing the raw data, F.R. designed the data analysis platform including the GUI and created the pip package for the conducted experiments and analyzed the results. M.V. & F.R. created the data frame containing raw and processed data. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Rahmanian, F., Vogler, M., Wölke, C. et al. Conductivity experiments for electrolyte formulations and their automated analysis. Sci Data 10, 43 (2023). https://doi.org/10.1038/s41597-023-01936-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-023-01936-3