Introduction

In the last decade, halide perovskites have emerged as a class of promising solar cell materials. During this time, record efficiencies have surpassed 25%1,2 and the research has gone from basic research to gradually also containing more technology-oriented device development. Several companies now claim that commercial perovskites solar cells may be just around the corner3,4,5. Those early-stage commercialization efforts increase the demand for rigorous performance evaluation and standardized testing protocols. Accurate measurements and dissemination of reliable performance metrics are in fact vital for providing a truthful reflection of the field’s progress and necessary for guiding the development of commercial products. In this regard, the publication of erroneous and overinflated performance metrics is damaging the exploitation of any technology as it causes misled research efforts, diminishes the trustworthiness of the research field, and hampers the development of commercial devices.

For emerging PV materials, where device efficiency is a key performance indicator and the incentive to demonstrate high numbers is strong, history has shown us reasons to be concerned6,7,8. There is, for example, data indicating that the performance of organic solar cells systematically has been overestimated in the literature6. Probably less due to fraudulent intent and more due to a lack of unified testing protocols and appropriate check-ups. There are in fact several pitfalls when evaluating solar cell performance9,10, e.g. lamp calibration, spectral mismatch, light inhomogeneities, leakage currents due to improper masking, etc. Maybe are there also too little incentives to go the extra mile to ensure correctly calibrated measurements, especially if it would result in slightly lower values which may impede publication. Early development and work not explicitly focusing on efficiency optimisation are expected to be especially susceptible to errors of this kind. There are no reasons to believe that the perovskite field would be spared from such problems. One could in fact argue that this is a good example of a field where this type of problem can be expected.

Even if not taking into account common pitfalls like the calibration of the solar simulator, proper cell masking, and the correct determination of the active cell area9, the perovskites have their own set of peculiarities. Perovskites are for example known to show hysteresis in the JV response11,12,13, an initial burn-in14, and for having stability problems15,16,17. All of this can distort the performance metrics obtained from JV scans.

Those problems have not gone unnoticed. The research community has responded appropriately and there is in fact an entire sub-genre dedicated to discussing, evaluating, and proposing measuring protocols for emerging PV materials, both for general solar cell evaluation9,10,18, and for perovskites specifically19,20,21,22,23. Publishers are also more frequently pushing for the dissemination of detailed measurement protocols, as well as for the underlying raw data. Both the Nature journals and Energy and Environmental Science have for example implemented checklists to be used for anyone wanting to publish perovskite device data24,25,26, to mention just two prominent examples.

Measurement protocols and their underlying assumptions are, however, not static but gradually evolve as new pitfalls and peculiarities of the investigated systems are discovered. In this paper, we take a closer look at one of those quality checks, namely the integrated external quantum efficiency, Jsc,EQE, and how it compares to the short circuit current extracted from JV-measurements, Jsc,JV.

The standard way to evaluate solar cell performance is by a JV measurement. Due to known problems, like calibration uncertainties, spectral mismatch, and hysteresis, a comparison with a second type of independent measurement highly improves the trustworthiness of the results. Certification at an independent test institute is considered the gold standard. That is, however, not attainable for routine experiments. To verify that the determined PCE from a JV measurement reflects the steady-state performance of the solar cell, reference measurements carried out under steady-state conditions are therefore recommended. Two such options are external quantum efficiency, EQE, commonly carried out at a steady bias, i.e., 0 V, and stabilised efficiency during maximum power point tracking (MPPT), where the latter aims to mimic operational conditions.

Observed mismatch between integrated EQE and JSC

If the EQE is measured as a function of wavelength, the response can be multiplied with the AM 1.5 spectrum and the unit charge, q, and integrated over the entire spectra. This gives an estimate of the short circuit current under illumination, Jsc,EQE, (Eq. 1), which can be compared to the short circuit current extracted from the JV-measurement, Jsc,JV.

$${J}_{{sc},{EQE}}={\int }_{0}^{{{\infty }}}{qEQE}\left(\lambda \right)S\left(\lambda \right)d\lambda,\, S={AM}1.5$$
(1)

The underlying assumption is that Jsc,EQE and Jsc,JV should be the same, or at least quite similar and that a large mismatch indicates that something is not right19. This check is a part of the checklist in Energy and Environmental Science25, and the reporting standards of the Emerging-PV Reports Initiative27. Meanwhile, when submitting to Nature journals, the community is required to provide details of characterization, or experimental and analytical design in the solar cells reporting summary, where they are being asked if a comparison of the two values has been made26. The equivalence of Jsc,EQE and Jsc,JV is considered as a robust metrics, at least for established PV technologies. It is for example the basis of one of the most common procedures of calibrating solar simulators, where its light intensity is tuned until the Jsc,JV for a reference silicon cell matches its Jsc,EQE9. A Jsc,EQE and Jsc,JV discrepancies have also been used to detect questionable data in the organic PV-community6.

For perovskite cells, concerns have been raised that this may be a less reliable test than assumed. In 2020, Saliba et al. pointed out that for externally certified cells reported in the literature, Jsc,JV appears to be systematically larger than Jsc,EQE28. The dataset was, however, small and the underlying papers do not always clearly state if the EQE was measured on the exact same devices as the corresponding certified JV data. This was an indication of a mismatch between the two metrics, but with insufficient data to turn the intuition into a solid conclusion.

Here we can now demonstrate that for perovskite cells there really is a statistically significant discrepancy between Jsc,EQE and Jsc,JV. This has been possible due to the Perovskite Database Project29,30, which is the result of a communal effort to collect all perovskite solar cell device data available in the peer-reviewed literature and make it comply with the FAIR data principles, i.e. Findable, Accessible, Interoperable, and Reusable31,32. In the Perovskite Database, there are at the time of writing data for over 42,000 devices, which represents essentially every device someone has thought it worth the trouble to properly describe in the peer-reviewed literature up until spring 2020.

Out of the >42,000 devices in the dataset, there are 5575 for which both Jsc,EQE and Jsc,JV are reported. In Fig. 1, the Jsc,EQE is plotted against the Jsc,JV for the entire dataset, including devices of all levels of performance, with all possible architectures, stack sequences, and perovskite compositions. This representation of the data shows that the two values are indeed rarely the same. Jsc,JV is larger than the Jsc,EQE for 83% of all data points.

Fig. 1: Relation between Jsc,JV and Jsc,EQE.
figure 1

a Integrated external quantum efficiency, Jsc,EQE, against short circuit current from JV-measurements, Jsc,JV, for all 5575 devices found in the Perovskite Database where both values are reported. The black diagonal line represents Jsc,JV = Jsc,EQE. b Distribution of Jsc,JV/Jsc,EQE for the entire dataset. The bin size is set to 0.004.

Possible causes for the observed mismatch

Potential reasons for the discrepancy between Jsc,JV and Jsc,EQE are manifold. First of all, measuring absolute EQE values is non-trivial. The calibration can fail. The implementation of the integration algorithm can affect the results if the distance between measurement points is too large33. A difference between the sun simulator and the reference AM 1.5 spectrum also has an effect, especially if the calibration diode’s absorption range is different from that of the test cell, which is the case if a silicon diode is used. This could be a problem when standard xenon lamps are used for IV measurements as they may have intensity spikes which fall well within the absorption range of silicon but are located outside the region of the absorption onset of typical perovskite compositions. If a lock-in amplifier is used, the frequency of the light modulation can affect the results as well33,34,35. Some of the observed discrepancy will certainly be due to errors like these. Even if there were numerous ways to mess up both JV and EQE measurements, we would expect such errors to be random and go in both directions. On average, we posit that random errors will average out when analyzing large datasets like this one.

To elucidate the exact physical mechanism, or mechanisms, behind this observation will require detailed experimental work beyond the scope of this report. We can, however, quantify the discrepancy. If Jsc,JV/Jsc,EQE is plotted as a box diagram (Fig. 1b), a normal distribution is assumed, and obvious outlier are excluded, i.e., only include Jsc,JV/Jsc,EQE = 1 ± 0.25, the distribution has a mean, m, of 1.041 with a standard deviation, σ, of 0.057 (including outliers give m = 1.050 and σ = 0.15). This translates to a 76% probability of measuring Jsc,JV/Jsc,EQE above 1. Data does not perfectly follow a normal distribution but is skewed towards higher values, reflected in that over 83% of all reported data points are above 1. The historical data thus indicate that for a random device one would expect Jsc,JV to be around 4% larger than Jsc,EQE if both device and measurements are sound. A few measurements of Jsc,JV/Jsc,EQE around 1.04 would not be a sign of a systematic discrepancy, especially given that it is not uncommon to explain away a 5% mismatch as an acceptable calibration error. When all data are considered together, the mismatch is, however, obvious.

Several possible hypotheses can be discarded based on the dataset. The cell efficiency, PCE, does for example have no effect on the average Jsc,JV/Jsc,EQE. For the worst cells, i.e. PCE < 5%, the data scatters widely, but for better cells the only change seen is a decrease in the spread of the Jsc,JV/Jsc,EQE values with increased PCE (Supplementary Fig. 1). There is no systematic shift of the Jsc,JV/Jsc,EQE values with respect to the open circuit voltage, Voc, or the fill factor, FF, either (Supplementary Fig. 23). When the short circuit current Jsc,JV changes, the same average value around 1.04 is seen except for the lowest and the very highest currents. For low Jsc,JV, the data is scarce and a lot of things can go wrong in a failed device. For the highest reported Jsc,JV, the data is also scarce and those are reasonably the values with the highest probability of being erroneously high, as seen by the fact that some of those values are approaching, or even exceeding, the Shockley–Queisser limit.

The perovskite cells are known for that there can be hysteresis during JV-sweeps. That could obstruct the accurate determination of cell performance11,12,13, and at first glance that could be a reasonable hypothesis for the Jsc,JV, Jsc,EQE mismatch. The amount of hysteresis in the JV-measurement does, however, not affect the average Jsc,JV/Jsc,EQE values (Supplementary Fig. 5). At least not to the extent the hysteresis is reflected in the JV-measurement, which may not always be the case as the JV-sweep parameters could be selected to minimize the measured hysteresis without reflecting any true steady-state conditions. The carrier extraction at the charge selective contacts does not have an effect either. The same average values for Jsc,JV/Jsc,EQE are observed for all common hole and electron transport layers (Supplementary Fig. 89). That is also true for both nip and pin device architectures (Supplementary Fig. 10). Only for stack layers with few reported cells do we observe a larger spread in values. That is statistically expected, but even there the average Jsc,JV/Jsc,EQE values are positive.

If the current mismatch would be a result of measurement or calibration errors, one would expect those to decrease with time as groups improve on their artesian handicraft and their experimental protocols, as knowledge spreads through the community, and as more rigorous measurement protocols are followed. To our surprise, this is not the case (Fig. 2).

Fig. 2: Development of the Jsc,JV/Jsc,EQE as a function of publication date.
figure 2

The bin size is half a year. The end of the boxes represents the 25 and the 75 percentiles. The whiskers are placed at an interquartile range of 1.5, which means that for a normal distributed data set, 99.3% of points should be within that range.

Given the perovskites stability problems, some test cells may have degraded between measurements which would give a lower Jsc,EQE as that measurement most often is done after the JV-measurement. The time consistency of the effect does, however, talk against that as a main exploration given the gradual increase in stability seen due to for example perovskite composition engineering.

The observed short circuit current mismatch is thus probably not due to measurement errors. It does not depend on device performance, the cell architecture, or the choice of charge-selective contacts. It has also been persistent over almost a decade of development. This makes it reasonable to assume that this effect either has something to do with the perovskite itself or that there is something in the measurement situation that makes the perovskite behave differently in typical JV and EQE measurements.

In terms of the perovskite, the composition and the band gap does not seem to have an impact. This is at least true for all compositions and band gap ranges where there is sufficient data for reliable statistics (Supplementary Fig. 7). For the highest band gap devices, i.e., above 2.4 eV, the effect seems to disappear, but in that band gap range there are too few data points to draw any firm conclusions. The largest spread in the data we have found is with respect to the perovskite deposition method, but a larger spread in the Jsc,JV/Jsc,EQE values is only seen in cases where the data points are few (Supplementary Fig. 12).

In terms of the measurement conditions, in a JV-sweep, the voltage is constantly changed while it is held constant during each EQE measurement. The former is hence prone to be affected by capacitive or “poling” effects caused by the voltage sweep, which get mobile ions to move around in the applied electric field causing both capacitive effects and charge accumulation at interfaces36,37. A JV measurement is also generally much faster than an EQE measurement, at least historically. Another set of possible explanations revolves around that EQE often is measured without a biased light, or with a biased light much less than one sun. The photophysics of the perovskite may thus be different in the two measurements. Dye Sensitized Solar does for example tend to perform best at lower light intensities due to the photocurrent being nonlinear with respect to light intensity38. That could, however, not be the explanation here as that would cause the Jsc,EQE to be larger than Jsc,JV6.

There are several things that can change in the perovskites under increased illumination. There are for example reports of light-induced phase separation39, light-dependent ion migration40,41, a large light intensity dependence of the dielectric constant42, as well as a strong correlation between light intensity and conductivity with up to 3-4 orders of magnitude difference between AM 1.5 and the dark43,44, which could cause larger resistive losses at low light conditions. The recombination pathways can also be different. The photoluminescence quantum yield has for example been found to be strongly dependent on excitation intensity, which is commonly interpreted as differences in the fraction of charge carriers recombining non-radiatively45, i.e., certain defect levels that cause non-radiative recombination or lead to lower extraction efficiencies may simply get saturated at higher light intensities. This points towards several plausible mechanisms that potentially could explain lower short circuit currents from low light intensity EQE measurements. The Perovskite Database does, unfortunately, not resolve the bias light conditions during EQE measurements, but based on the reasoning above, some of the most reasonable hypotheses seem to be related to light intensity-induced changes in the perovskite. Those could for example be related to differences in photo current generation, conduction, ion transport, or recombination. Resolving this will require further detailed studies.

Alternative quality checks

An alternative quality check of the device performance is to measure stabilised efficiency, PCEstab, ideally under maximum power point tracking. That is a steady state measurement under operational conditions and considered to be a better measure of the true efficiency of a cell than the PCE extracted from a dynamic JV-scan, PCEJV. Not surprisingly, the PCEstab is lower than PCEJV. For the 3367 data points where both measures are available the median difference is 2.4 % (Supplementary Fig. 13), which is in the same direction but smaller than the average discrepancy between the Jsc,JV and the Jsc,EQE. If the two discrepancies, i.e. PCEJV/PCEstab and Jsc,JV/Jsc,EQE are compared, the median value of their quotient is 1.016 (Supplementary Fig. 14) indicating a co-variation.

Summary

To summarize. Based on the historical device data in the Perovskite Database we have demonstrated a systematic discrepancy between the short circuit current extracted from JV-measurements, Jsc,JV, and that obtained from integrating the external quantum efficiency Jsc,EQE. The Jsc,JV could on average be expected to be 4% larger than the Jsc,EQE. Based on the data in the Perovskite Database we show that this mismatch transcends device performance, cell architecture, the choice of charge selective contacts, perovskite composition and deposition procedure and that it has been persistent for almost a decade of experimental development. The exact mechanism behind this discrepancy is still unclear and requires further experimental studies.

Conclusion

While we here could demonstrate that the short circuit current extracted from JV measurements, Jsc,JV on average is 4% larger than the integrated external quantum efficiency, an approximate resemblance of the two measurements is still a reasonably good consistency check. However, if the two values systematically and on average are equal, it is a high probability that at least one of the measurements is inaccurate. In other words, a higher Jsc,JV compared to JSC,EQE is the norm rather than the exception.

One of the goals of this work is to highlight potential problems of systematic overrepresentation of performance data published on perovskite PV devices. From this metadata study, we conclude, among others, that more attention should be paid to MPP tracking as the primary performance measurement, which compared to JV-scans and EQE measurements is the measurement most closely representing relevant operational conditions.

This work also demonstrates the power of common data dissemination platforms like the Perovskite Database Project46, which enabled us to demonstrate new perovskite physics to explore that previously had been overlooked due to the lack of large datasets. Finally, it demonstrates that some of the perovskite communities' reporting protocols and their underlying assumptions need a slight update.