China’s low-carbon policy intensity dataset from national- to prefecture-level over 2007–2022

Dong, Xinyang; Wang, Can; Zhang, Fang; Zhang, Haowen; Xia, Chengqi

doi:10.1038/s41597-024-03033-5

Download PDF

Data Descriptor
Open access
Published: 16 February 2024

China’s low-carbon policy intensity dataset from national- to prefecture-level over 2007–2022

Scientific Data volume 11, Article number: 213 (2024) Cite this article

4607 Accesses
8 Altmetric
Metrics details

Subjects

Abstract

Low-carbon policies are essential for facilitating manufacturing industries’ low-carbon transformation and achieving carbon neutrality in China. However, recent studies usually apply proxy variables to quantify policies, while composite indices of policy intensity measured by objectives and instruments focus more on the national level. It is deficient in direct and comprehensive quantification for low-carbon policies. Hence, having extended the meaning of policy intensity, this paper constructs a low-carbon policy intensity index quantified by policy level, objective and instrument via phrase-oriented NLP algorithm and text-based prompt learning. This process is based on the low-carbon policy inventory we built for China’s manufacturing industries containing 7282 national-, provincial- and prefecture-level policies over 2007–2022. Lastly, we organize the dataset in two formats (.dta and .xlsx) for multidiscipline researchers. Apart from the inventory and intensity for each policy, the policy intensity is also aggregated to national-, provincial- and prefecture-level with sub-intensity for four objectives and three instruments. This dataset has potential uses for future studies by merging with macro and micro data related to low-carbon performances.

A computational approach to analyzing climate strategies of cities pledging net zero

Article Open access 26 August 2022

Diverse climate actors show limited coordination in a large-scale text analysis of strategy documents

Article Open access 09 February 2021

Threshold effect of technological innovation on carbon emission intensity based on multi-source heterogeneous data

Article Open access 04 November 2023

Background & Summary

Under the goal of carbon neutrality, building green and low-carbon transformation systems and facilitating high-quality development of manufacturing industries are of great importance to achieving ambitious decarbonization targets¹. Affected by the developing stage, China’s manufacturing industries are exposed to the dilemma of reducing carbon emissions and promoting economic development², which requires governments’ support and stimulation through low-carbon policies. However, owing to the data limitation, what types of low-carbon policies are effective in facilitating low-carbon transformation and industrial upgrading are still unclear. Therefore, it is imperative to construct a comprehensive low-carbon policy intensity index based on policy texts from national- to prefecture-level.

Policy intensity is an index reflecting the stringency and importance of policy, which weighs through policy objectives and policy instruments^3,4,5,6,7. While policy objectives mainly focus on issues that low-carbon policies formally aim to address, policy instruments pay attention to general norms that guide implementation preferences^4,7. Based on this “policy objective–policy instrument” pattern, the meaning of policy intensity has been interpreted from different perspectives. Some studies coded objectives in terms of emission reduction and renewable energy production⁸. Other researchers divided policy instruments into demand-side, environmental and supply-side instruments⁹. Meanwhile, six measures containing objectives, scope, budget, implementation, and monitoring were also adopted to build policy intensity⁸.

However, a majority of policy intensity indices are constructed and discussed at the national level, ignoring the contribution of policy effects from sub-national regions. For example, an environmental policy stringency (EPS) index developed by OECD measured 13 policy instruments for 40 countries^10,11. By investigating countries’ policy portfolios from 1998 to 2010, an index of climate policy activity was constructed for the energy production sector in Austria, Germany, and the United Kingdom⁸. To be more specific, policy level is an essential factor influencing the intensity of policy instruments. In general, national policies are more overarching, which provide developing directions and guidance for the whole country but promulgate few region-specific actions. Sub-national policies (e.g., provincial-level, prefecture-level instruments) are more focused and capable of devising appropriate measures suitable to local conditions. Hence, some policies from lower administration levels could have a higher intensity than their counterparts from higher policy levels, which needs to be quantified through policy intensity.

Moreover, studies pay less attention to the policy intensity of low-carbon policies. Previous literature mainly used dummy variables or ordinal variables to quantify the individual effect of pilot^12,13,14,15, short-term¹⁶, regional and industrial¹⁷ low-carbon policies, which are not able to evaluate the effects of low-carbon policies as a whole. Meanwhile, when it comes to different types of low-carbon policy instruments, although effects of tax measures, subsidies, investment incentives, bidding systems, voluntary programs, quantity obligations, and environmental protection laws have been discussed by several researchers^18,19, no consistent conclusion has been made for command-and-control regulation and market-based policies.

From the methodology perspective, owing to the immense progress made in machine learning and artificial intelligence, it is possible to construct policy indicators based on the meaning of policy texts and these methods, which could have limited human biases and increase the effectiveness of policy indices²⁰. However, this research area has not been sufficiently developed, and only a few studies built low-carbon or environmental indicators based on machine learning. On the one hand, a semi-automated policy analysis tool with a labelling process and sentence-BERT (SBERT) model was used to identify forest policies and classify policy instruments²¹. On the other hand, China’s national-level environmental policy intensity for 1978–2019 has been constructed by policy text analysis and machine learning algorithms (e.g., random forest, support vector machine, Ridge), which shows the policy-intensity evolution towards environmental issues⁵.

Therefore, this paper aims to build a low-carbon policy intensity index from 2007 to 2022 by prompt learning for China’s manufacturing industries, which contributes to the literature in three aspects. First, the meaning of low-carbon policy intensity is deepened by adding the factor of policy level, which is based on 7282 low-carbon policies promulgated from the nation, 31 provinces and all 334 prefecture cities for manufacturing industries. Hence, following the “policy objective-policy instrument” pattern, the low-carbon policy intensity in this paper is quantified by multiplying each policy’s objective, instrument, and level. Second, the construction process of low-carbon policy intensity combines phrase-oriented NLP algorithm and text-based prompt learning, which is a few-shot learning suitable for policy texts with smaller samples and higher manual labelling costs. This new paradigm in NLP field avoids the need for large samples in pre-train and fine-tuning paradigms. Meanwhile, it also reduces human biases and the non-repeatability brought by traditional 100% manual policy scoring. Lastly, this study further aggregates sub-intensity indices for three policy levels (i.e., national-, provincial-, and prefecture-level), four policy objectives (i.e., carbon reduction, energy conservation, capacity utilization, and technology), and three policy instruments (i.e., command-and-control, market-based, composite instruments). It is more convenient for future research to merge this dataset with macro- and micro-data for extended analysis and discuss the impact of low-carbon policies from different perspectives.

Methods

The research framework of this paper includes six modules, which are displayed in Fig. 1. Based on the constructed low-carbon policy inventory with 7282 policies over 2007–2022, this paper extends the “policy target-policy instrument” pattern and uses prompt learning to quantify the low-carbon policy intensity, thereby calculating the policy intensity via multiplying policy level, objective and instrument.

Data preparation

The section for data preparation lays the foundation for building low-carbon policy intensity, which is achieved through three steps: constructing a low-carbon policy inventory, structuring policy texts, and disaggregating texts into policy objectives and policy instruments.

Step 1: Constructing a low-carbon policy inventory

This paper collects low-carbon policies from PKULaw.com (https://pkulaw.com/), which is a policy database containing not only national policies but also policies from provincial- and prefecture-level. This paper focuses on different administration levels’ low-carbon policies from 2007 to 2022. 2007 is of great importance for China’s climate change issue when a national leading group headed by the Premier of the State Council and with officials from 30 ministries and commissions as members was founded in response to climate change, energy conservation, and emission reduction. Since then, China’s low-carbon policy has gradually changed from being attached to various environmental policies to an important policy category with more independent policies.

When selecting low-carbon policies, this paper defines a generalized scope of “low-carbon”. Keywords such as “carbon reduction”, “greenhouse gases”, “energy conservation”, “energy efficiency”, “energy consumption”, and “overcapacity” have been adopted to search policies, since policies in these fields are able to directly or indirectly mitigate climate change and influence low carbon development. Meanwhile, we put more emphasis on targeted and implementable policies, which have relatively clear low-carbon policy objectives and instruments with carbon reduction potential. To be more specific, despite higher legal forces, laws and outlines for quality development related to low-carbon and climate change are excluded from the policy inventory. Meanwhile, policies related to publicity have not been considered, which are more likely to raise awareness rather than taking action.

Finally, this paper builds a low-carbon policy inventory with 7282 policy texts from national-, provincial- and prefecture-level by careful selection. Policies from central and local departments related to development and reform, ecology and environment, finance, industry and information technology, and taxation are included in the inventory. Table 1 displays departments at different administration levels that have promulgated low-carbon policies.

Table 1 Departments from different levels that have promulgated low-carbon policies.

Full size table

Step 2: Structuring policy texts

It is known that policy texts have a high degree of fixed features, which contain structural information such as policy title, policy background, policy objectives, policy instruments, issuing institution, and publication year⁸. Hence, it is more convenient to grasp the meaning of whole policy texts by breaking it into different parts.

During the structuring process, headings and subheadings are key marks of each section. Headings of policy texts usually include sections for guiding ideology, working principle, target and goals, and supporting measures, which are led by phrases such as “Chapter X” and “Chinese number、”. Subheadings of policy texts are detailed sub-sections of headings, led by “(Chinese number)” or “Arabic figure、”. Thus, by locating lines with those special phases, headings, subheadings as well as contents for each part in the text file are able to be extracted respectively and sequentially to each cell of Excel in columns “title” and “content”.

Step 3: Disaggregating policy texts into the policy-objective file and policy-instrument file

Based on the Excel file generated in Step 2, we refine texts for policy objective and policy instrument into two separate files by keywords. On the one hand, “target”, “tasks”, “indices”, and “policy guidance” are keywords in headings, subheadings and contents for selecting policy objectives. Those cells are selected and sorted by subheadings, headings, and contents before being saved as a new text file for policy objective. Because the description of policy objectives in subsections tends to be more specified with clear expression than the counterparts. For some policy texts without headings or subheadings, sentences related to policy objectives cannot be located and have to be extracted from the whole content. On the other hand, policy instruments have diverse keywords and need to adopt an exclusion method. Hence, after dropping parts for policy objective and background (e.g., status, meaning, guidance, principle, general thoughts, requirements), the remaining parts of the policy are saved as a new text file for policy instrument.

Policy text classification

Having followed the “policy objective-policy instrument” pattern and refined text files for policy objective and instrument^5,8, we consider three factors influencing low-carbon policy intensity, namely, policy level, policy objective, and policy instrument. Based on the low-carbon policy inventory we built, this paper classifies policies into national-, provincial-, and prefecture-level, which are promulgated by departments of the nation, provinces, autonomous regions, or cities. As for policy objectives, we follow the scope of “low-carbon” defined in this paper, and include policy objectives for carbon reduction, energy conservation, capacity utilization, and technology. Carbon reduction is the composite objective directly related to low carbon, while energy conservation is the composite objective for energy consumption and efficiency indirectly associated with low carbon. The remaining two objectives focus on specific fields of easing overcapacity, technological innovation, and industrial upgrading, which show an indirect connection with low carbon. Finally, based on existing studies^5,22, we not only include policies with single command-and-control or market-based instruments, but also define policies with composite instruments (i.e., having command-and-control and market-based instruments in one policy at the same time).

Apart from the policy level, policy objectives and instruments are classified through high-frequency phrases extracted from policy texts and titles. First, during the process of data preparation, a lexicon has been constructed for policy classification, containing keywords and detailed expressions for titles and contents. Following Tian et al.²³, keywords related to different policy objectives and instruments are constructed by extensively checking through low-carbon literature, which include 38 seed words and are shown in Table 2. Furthermore, detailed expressions for titles and contents are extracted by using word2vec to acquire the semantic similarity between keywords and tokenized policy contents. To be more specific, having calculated the similarity between the word vectors of each phrase in policy contents and keywords, we selected 90 synonym words with the highest similarity to each keyword. After manually checking and excluding phrases with inappropriate meanings to low-carbon, a lexicon for policy classification is constructed and applied to calculate phrase frequency for policy text classification.

Table 2 Selection of keywords in different dimensions for policy text classification.

Full size table

Second, in the policy classification, one policy only belongs to a single category of policy objectives and policy instruments. For policies with the same number of phrase frequency in multiple categories, it is necessary to set the priority for classification. In this paper, composite objectives (i.e., carbon reduction, energy conservation) take precedence over specific fields (i.e., capacity utilization, technology), and direct policy objective (i.e., carbon reduction) is given priority over the indirect counterpart (i.e., energy conservation). For policy instruments, on the basis of using high-frequency phrases to divide command-and-control and market-based instruments, it is necessary to define the grouping criteria for composite instruments. The average phrase frequency for market-based instruments is 10 phrases higher than the counterpart for command-and-control instruments. Hence, policies are classified into the composite instrument if the phrase frequencies for both instruments are greater than 10 and the difference is within 10.

Quantifying low-carbon policy intensity

Having classified policies into different categories, we add the factor of policy level into “policy objective-policy instrument” pattern and calculate the low-carbon policy intensity for each policy using Eq. (1):

$$P{I}_{r,a,t}={L}_{r,a,t,l}\times {O}_{r,a,t,o}\times {I}_{r,a,t,i}$$

(1)

where $P{I}_{r,a,t}$ is the low-carbon policy intensity for policy a in region r and year t. Region r could be a nation, a province ρ or a prefecture-level city c. ${L}_{r,a,t,l}$ is the intensity of policy level l for policy a in region r and year t. O_r,a,t,o is the intensity of policy objective o for policy a in region r and year t. ${I}_{r,a,t,i}$ is the intensity of policy instrument i for policy a in region r and year t.

Figure 2 shows the components of low-carbon policy intensity. To be more specific, the policy level l could be national-, provincial-, or prefecture-level. Policy objective o has four categories (i.e., carbon reduction, energy conservation, capacity utilization, technology). There are three types of policy instrument i, namely, command-and-control, market-based, and composite instruments.

As shown in Fig. 2, $P{I}_{r,a,t}$ includes 36 types of policies and can be aggregated in different patterns by Eqs. (2–4).

$$P{I}_{r,t}={\sum }_{a}P{I}_{r,a,t}$$

(2)

$$P{I}_{r,t}=PI\_C{R}_{r,t}+PI\_E{C}_{r,t}+PI\_C{U}_{r,t}+PI\_{T}_{r,t}$$

(3)

$$P{I}_{r,t}=PI\_C{C}_{r,t}+PI\_M{B}_{r,t}+PI\_C{M}_{r,t}$$

(4)

Calculated by Eq. (2), $P{I}_{r,t}$ is the sum of policy intensity promulgated by region r in year t. From the perspective of policy level, $P{I}_{r,t}$ could be $PI\_{N}_{t}$, $PI\_{P}_{\rho ,t}$ or $PI\_{C}_{c,t}$. $PI\_{C}_{c,t}$ is the sum of prefecture-level low-carbon intensity promulgated by prefecture-level city c in year t. $PI\_{P}_{\rho ,t}$ is the sum of provincial low-carbon intensity promulgated by province ρ in year t. $PI\_{N}_{t}$ is the national level low-carbon intensity in year t. In Eq. (3), $P{I}_{r,t}$ is summed by four different policy objectives. $PI\_C{R}_{r,t}$, $PI\_E{C}_{r,t}$, $PI\_C{U}_{r,t}$ and $PI\_{T}_{r,t}$ are policy intensities respectively aiming at carbon reduction, energy conservation, capacity utilization, and technology promulgated by region r in year t. Finally, $P{I}_{r,t}$ can also be aggregated by three different policy instruments in Eq. (4). $PI\_C{C}_{r,t}$, $PI\_M{B}_{r,t}$ and $PI\_C{M}_{r,t}$ are respectively policy intensities for command-and-control, market-based and composite instruments for region r in year t.

Manual labelling

In order to create a supervised prompt learning model and predict intensities for policies’ objective and instrument, a manual annotation of 3334 policy texts in 16 provinces is used to train the model. The intensity of policy level ${L}_{r,a,t,l}$, policy objective O_r,a,t,o, and policy instrument ${I}_{r,a,t,i}$ are respectively rated on a scale of 1 to 3 (Label 1: weak intensity, Label 2: medium intensity, Label 3: high intensity, Label 4: highest intensity (for energy conservation objective only)).

For policy objectives, quantitative policy objectives with specific industrial, regional, or year-by-year supporting goals will be given a higher score. On the contrary, if the expression of policy objective is vague without quantitative or clear targets, the intensity will be relatively low. Unlike the scoring system for carbon reduction, capacity utilization, and technology goals, the policy objective for energy conservation has a 4-level scoring system and will be normalized within 3 (i.e., 3, 2.25, 1.5, 0.75) after scoring. Because the maximum level of energy conservation objective has been intensified after 2015, which changed from controlling the intensity of energy consumption to restricting the total amount and intensity of energy consumption at the same time. Hence, the energy conservation objective with level 4 is mainly used for scoring policies with “dual control of energy consumption”.

Meanwhile, the scoring criteria for three policy instruments are quite different. For command-and-control instruments, Label 3 will be given to policies for shutting down enterprises, eliminating excess production capacity and the target responsibility system for local government officials, which is related to their promotion, awards and punishment. Label 2 is mainly for strictly prohibiting additional production capacity, which has less impact on existing capacity. The intensity for market-based instruments is related to the clarity of policies’ economic supports. For composite instruments, since it includes command-and-control and market-based instruments at the same time, this type of policy will be respectively scored on the basis of two instruments. The maximum value of two labels will be taken as the intensity of composite instrument.

Detailed criteria and labelling examples for each objective and instrument are presented in Tables 3, 4.

Table 3 Labelling criteria of the intensity for different policy objectives.

Full size table

Table 4 Labelling criteria of the intensity for different policy instruments.

Full size table

Prompt learning and prediction

Based on the dataset of manual labelling, this paper trains models for each objective and instrument, and predicts the intensity by prompt learning, which is a few-shot learning suitable for policy texts with a smaller sample and higher manual labelling costs. Before prompt learning, the dominant pre-train and fine-tune paradigm in NLP requires a fixed architecture pre-trained as a language model (LM) and adapts the LM model to different downstream tasks by adding additional parameters and fine-tuning²⁴. Hence, it is inefficient due to the need for parameters for every single project, and the gap between large-scale unsupervised learning and downstream tasks. Compared with the paradigm mentioned above, prompt learning is able to reformulate downstream tasks, makes it similar to the original pre-trained LM, and predicts desired output without additional training for tasks with the help of a textual prompt²⁴.

To be more specific, prompt learning requires three steps to make a text input x to predict label output y through learning the parameters θ of model $P\left(y| x;\theta \right)$ in the task for text classification²⁴. First, a template is designed by the prompting function $prompt\;x{\prime} ={f}_{{\rm{prompt}}}\left(x\right)$ with an input slot x and an answer text z, which will be mapped to y. Then, based on the pre-trained LM and small word subset Z for classification, the highest-scoring text $\widehat{z}$ is searched using Eq. (5), which could maximize the score of LM. Finally, the highest-scoring answer $\widehat{z}$ is used to predict the highest-scoring label output $\widehat{y}$.

$$\widehat{z}=\mathop{{\rm{search}}}\limits_{z\in Z}P\left({f}_{fill}\left(x{\prime} ,z\right);\theta \right)$$

(5)

Therefore, this paper applies ERNIE 3.0 to conduct the task of classifying the intensity of texts for policy objectives and instruments through prompt learning. ERNIE 3.0 (https://wenxin.baidu.com/wenxin/modelbasedetail/ernie3/) is a large-scale knowledge enhanced pre-training model for language understanding, which has better performance on few-shot learning²⁵. Having trained the model with 3334 labelled policy texts, we conduct a model evaluation to evaluate the training result. Then two strategies of data augmentation (TrustAI) are used to deal with data sparsity and class-imbalance. Then, models with the best accuracy are selected and used to predict the intensity of policy objectives and instruments for the remaining unlabelled policy texts.

Low-carbon policy intensity integration

After receiving all the intensity of policy objectives and instruments through manual labelling and prompt learning, the low-carbon policy intensity of each policy is multiplied by Eq. (1). Descriptive statistics of the low-carbon policy intensity are provided in the Supplementary Information. Figure 3 further shows the heterogeneity of low-carbon policy intensity from the perspective of policy level, time, and space. Figure 3a,b indicate strong cyclicality for national-level low-carbon policy intensity, which had stronger intensity in 2011 and 2016 (i.e., the first year of China’s 12^th and 13^th Five-Year Plan). Policies for energy conservation had a better continuity, while carbon reduction started to have independent policies in 2010 after China first proposed the carbon emission reduction target in 2009. Although command-and-control policies dominated the intensity of national low-carbon policies before 2015, market-based and composite instruments made greater contributions after 2015. The yearly average low-carbon policy intensity at the provincial- and prefecture-level in Fig. 3c,d show that prefecture-level policy intensity was relatively lower compared with the provincial-level counterpart. Shanghai (east), Shandong Province (east), and Shaanxi Province (northwest) had the largest average provincial low-carbon policy intensity. Among 334 prefecture-level cities, 21 cities’ prefecture-level policy intensities were in the highest level, including 11 cities in the east of China (e.g., Wuxi, Jiangsu Province), 3 cities in the north of China (e.g., Shijiazhuang, Hebei Province), 3 cities in the central of China (e.g., Luoyang, Henan Province), and other 4 cities from southwest, northwest, and south of China.

Data Records

Not only the low-carbon policy inventory and intensity for each policy but also the aggregated low-carbon policy intensity for national-, provincial- and prefecture-level are organized in the dataset. We provide two different format datasets for multidiscipline researchers. The first format of the dataset is stored in Stata files (.dta), while the second format is in Excel files (.xlsx). Lexicons for policy classification and definitions of variables are also attached to the dataset. Detailed information for this dataset is listed in Table 5. These files are all accessible via figshare²⁶.

Table 5 Detailed information for data files.

Full size table

The link of each policy has been provided in folder “(For each policy) low-carbon policy inventory and intensity” of the dataset, which could locate to policy texts in the Laws & Regulations of PKULAW Database. Full policy texts in this database are only available for subscribers. Hence, researchers need to make sure that the academic institutions or individuals have subscribed to this database if you want to reproduce the process based on this paper.

Technical Validation

Model performance

In this paper, we use prompt learning to train models and predict the intensity for each objective and instrument. Among 7282 policies in our low-carbon policy inventory, 3334 policies with labels are used to train and evaluate the prompt learning model. 80% of the labelled dataset is used for training, while the remaining 20% is for validation. Two data augmentation strategies (i.e., sparse data identification and training data augmentation) are applied to improve prompt learning models by dealing with data sparsity and class-imbalance. Table 6 summarizes the accuracy of each model, which reflects the ratio of how many predicted labels are equal to actual labels in the sample. A detailed explanation of accuracy is provided in the Supplementary Information. Table 6 shows that the policy objective for carbon reduction has the best performance through “Prompt learning + Sparse data identification”, while “Prompt learning + Training data augmentation” is more suitable for the remaining three policy objectives and two policy instruments.

Table 6 Accuracy of prompt learning models.

Full size table

Comparison with existing studies

Due to the absence of a publicly available dataset for low-carbon policy intensity with three administrative levels, we cannot directly compare with existing datasets. However, there are present studies constructing environmental policy indices for China. On the basis of the “policy objectives-policy measures” pattern, Zhang et al. constructed China’s national-level environmental policy intensity for 1978–2019 by policy text analysis and machine learning algorithms⁵. OECD built a country-specific and internationally comparable environmental policy stringency index (EPS) on the basis of second-hand data for 13 policy instruments, which covers 40 countries from 1990 to 2020^10,11. These two policy indices have subcategories or sub-indices (e.g., energy conservation and emission reduction, industrial upgrading, market-based policies, non-market-based policies, technology support), which are closely related to low-carbon policies. Hence, the trend similarity among policy indices mentioned above could be used to validate the low-carbon policy intensity index constructed in this paper.

Following Zhang et al., we apply dynamic time warping (DTW) to measure the similarity between two series that vary in trend. Having normalized three indices, three pairs of DTW distance are calculated by Euclidean distance, and paths minimizing DTW distance between three national indices are shown in Fig. 4. The distance between this paper and Zhang et al. is 2.53, being the smallest distance among three pairs. Its DTW minimum paths in the white line are small and almost parallel with the main diagonal. This shows that there is a highest degree of trend similarity between national-level indices of this paper and Zhang et al.’s environmental policy intensity, both of which are based on policy text analysis but with different training methodologies. That is to say, the low-carbon policy intensity built in this paper is able to reflect the changing trend of policy intensity.

Manual checking

To further verify the accuracy of prompt learning prediction, we randomly select 500 policies through stratified random sampling based on policy objectives. Hence, each type of policy objective contains 125 policies to be manually labelled. From the perspective of policy instruments, these policies include 169 command-and-control policies, 276 market-based policies and 55 composite policies. Because composite policies are scored on the basis of two instruments, 224 command-and-control policies and 331 market-based policies need to be manually labelled. Table 7 presents the accuracy between results of prompt learning prediction and manual labelling, ranging from 0.85 to 0.93.

Table 7 Manual checking for prompt learning results.

Full size table

Usage Notes

Usage of low-carbon policy intensity

For the usage of this policy intensity index, the understanding of policy intensity needs to change from “policy-maker” to “policy-taker”. Hence, the meaning of $PI\_al{l}_{r,t}$ is the total low-carbon policy intensity received by region r in year t, which are $PI\_al{l}_{c,t}$ for city c in year t and $PI\_al{l}_{\rho ,t}$ for province ρ in year t. It is known that China is a centralized nation with the central government at the top and provincial, prefecture governments below it^27,28. Under the planning- and goal-based governance, the central government usually disaggregates low-carbon targets among provinces, while provincial governments assign targets to prefecture-level cities through agreements and performance measures to achieve their targets. Hence, China has a multi-layered policy structure²⁹, and governments not only implement policies from their own levels but also receive policies from higher administrative levels. To be more specific, when the research object is the performance of prefecture-level cities or firms, the received total low-carbon policy intensity $PI\_al{l}_{c,t}$ in Eq. (6) contains not only the intensity of prefecture-level policies but also those above-city policies (i.e., national- and provincial-level policies from which province it affiliates). If the study focuses on the provincial level, the total policy intensity $PI\_al{l}_{\rho ,t}$ in Eq. (7) received by provincial policy-takers includes both national and provincial policies.

Hence, the impact of policy intensity on city-level or firm-level policy-takers needs to be based on $PI\_al{l}_{c,t}$. If studies want to further discuss the impact of the policy intensity of different policy levels on carbon performances, the city-level intensity ($PI\_{C}_{c,t}$) and the above-city intensity $PI\_aboveCit{y}_{c,t}\left(=PI\_{P}_{\rho ,t}+PI\_{N}_{t}\right)$ in Eq. (6) can be added to the regression at the same time.

$$\begin{array}{ccc}PI\_al{l}_{c,t} & = & PI\_{C}_{c,t}+PI\_aboveCit{y}_{c,t}\\ & = & PI\_{C}_{c,t}+PI\_{P}_{\rho ,t}+PI\_{N}_{t}\end{array}$$

(6)

$$PI\_al{l}_{\rho ,t}=PI\_{P}_{\rho ,t}+PI\_{N}_{t}$$

(7)

The starting and ending years of each policy

For user convenience, this dataset provides policies’ starting and ending years. Unlike the certain starting year of each policy, ending years of most policies in China are not clearly stated. Hence, policies’ end years have been verified in four criteria. First, the ending year of a few policies has been mentioned in the policy text (e.g., the planning period is 2012–2015, valid for 5 years, valid until 2021). Then, some policies have been abolished or replaced by new policies. Moreover, for policies without a clear ending year or abolishment, the policy end-year is verified on the basis of policy objectives. For example, although a policy was released in 2007 and hasn’t been abolished yet, expressions such as “at the end of 11th five-year plan period” and “until 2010” could be seen in the section of policy objective. Hence, the ending year of this policy is verified to be 2010. Lastly, for policies that cannot be confirmed using criteria mentioned above, the ending years of those policies are marked as “-” for uncertainty.

Limitations and future work

This study has some possible limitations. On the one hand, limited by the availability of policy texts, this paper constructs the low-carbon policy intensity primarily based on policies that are publicly available. Hence, the intensity of those policy texts that have not been publicly released or only partly released in public cannot be fully evaluated. On the other hand, the release of policy texts and the execution from relevant departments are key factors affecting low-carbon policy effects. This paper mainly focuses on policy texts from the perspective of policy release, but pays less attention to policy execution. Future studies can further investigate the gap between the intensity of released low-carbon policy texts and strength of execution from different policy levels, which are likely to influence the progress of carbon reduction and low-carbon transformation.

Code availability

Code used for constructing the low-carbon policy intensity is written in Python 3.10.8 and Stata 15, and has been uploaded to figshare²⁶.

References

Rogge, K. S. & Schleich, J. Do policy mix characteristics matter for low-carbon innovation? A survey-based exploration of renewable power generation technologies in Germany. Res. Policy 47, 1639–1654 (2018).
Article Google Scholar
Li, L., Ma, S., Zheng, Y., Ma, X. & Duan, K. Do regional integration policies matter? Evidence from a quasi-natural experiment on heterogeneous green innovation. Energy Econ. 116, 106426 (2022).
Article Google Scholar
Knill, C., Schulze, K. & Tosun, J. Regulatory policy outputs and impacts: Exploring a complex relationship. Regul. Gov. 6, 427–444 (2012).
Article Google Scholar
Howlett, M. Governance modes, policy regimes and operational plans: A multi-level nested model of policy instrument choice and policy design. Policy Sci. 42, 73–89 (2009).
Article Google Scholar
Zhang, G. et al. China’s environmental policy intensity for 1978–2019. Sci. Data 9, 75 (2022).
Article PubMed PubMed Central Google Scholar
Cashore, B. & Howlett, M. Punctuating which equilibrium? Understanding thermostatic policy dynamics in Pacific Northwest forestry. Am. J. Polit. Sci. 51, 532–551 (2007).
Article Google Scholar
Howlett, M. & Cashore, B. The dependent variable problem in the study of policy change: Understanding policy change as a methodological problem. J. Comp. Policy Anal. 11, 33–46 (2009).
Google Scholar
Schaffrin, A., Sewerin, S. & Seubert, S. Toward a comparative measure of climate policy output. Policy Stud. J 43, 257–282 (2015).
Article Google Scholar
Huang, C., Yang, C. & Su, J. Policy change analysis based on “policy target–policy instrument” patterns: a case study of China’s nuclear energy policy. Scientometrics 117, 1081–1114 (2018).
Article Google Scholar
Fabrizi, A., Guarini, G. & Meliciani, V. Green patents, regulatory policies and research network policies. Res. Policy 47, 1018–1031 (2018).
Article Google Scholar
Kruse, T., Dechezleprêtre, A., Saffar, R. & Robert, L. Measuring environmental policy stringency in OECD countries: An update of the OECD composite EPS indicator. OECD Economics Department Working Papers https://doi.org/10.1787/90ab82e8-en (2022).
Zhu, J., Fan, Y., Deng, X. & Xue, L. Low-carbon innovation induced by emissions trading in China. Nat. Commun. 10, 4088 (2019).
Article PubMed PubMed Central ADS Google Scholar
Ren, S., Yang, X., Hu, Y. & Chevallier, J. Emission trading, induced innovation and firm performance. Energy Econ. 112, 106157 (2022).
Article Google Scholar
Pan, A., Zhang, W., Shi, X. & Dai, L. Climate policy and low-carbon innovation: Evidence from low-carbon city pilots in China. Energy Econ. 112, 106129 (2022).
Article Google Scholar
Liu, B., Gan, L., Huang, K. & Hu, S. The impact of low-carbon city pilot policy on corporate green innovation: Evidence from China. Financ. Res. Lett. 58, 104055 (2023).
Article Google Scholar
Tang, K., Qiu, Y. & Zhou, D. Does command-and-control regulation promote green innovation performance? Evidence from China’s industrial enterprises. Sci. Total Environ. 712, 136362 (2020).
Article CAS PubMed ADS Google Scholar
Jiang, Z., Wang, Z. & Li, Z. The effect of mandatory environmental regulation on innovation performance: Evidence from China. J. Clean Prod. 203, 482–491 (2018).
Article Google Scholar
Johnstone, N., Haščič, I. & Popp, D. Renewable energy policies and technological innovation: Evidence based on patent counts. Environ. Resour. Econ. 45, 133–155 (2010).
Article Google Scholar
Stucki, T., Woerter, M., Arvanitis, S., Peneder, M. & Rammer, C. How different policy instruments affect green product innovation: A differentiated perspective. Energy Policy 114, 245–261 (2018).
Article Google Scholar
Herman, K. S. & Shenk, J. Pattern discovery for climate and environmental policy indicators. Environ. Sci. Policy 120, 89–98 (2021).
Article Google Scholar
Firebanks-Quevedo, D. et al. Using machine learning to identify incentives in forestry policy: Towards a new paradigm in policy analysis. Forest Policy Econ. 134, 102624 (2022).
Article Google Scholar
Li, H., Zhu, X., Chen, J. & Jiang, F. Environmental regulations, environmental governance efficiency and the green transformation of China’s iron and steel enterprises. Ecol. Econ. 165, 106397 (2019).
Article Google Scholar
Tian, J., Cheng, Q., Xue, R., Han, Y. & Shan, Y. A dataset on corporate sustainability disclosure. Sci. Data 10, 182 (2023).
Article PubMed PubMed Central Google Scholar
Liu, P. et al. Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing. ACM Comput. Surv. 55, 1–35 (2023).
Google Scholar
Sun, Y. et al. ERNIE 3.0: Large-scale knowledge enhanced pre-training for language understanding and generation. Preprint at https://arxiv.org/abs/2107.02137 (2021).
Dong, X., Wang, C., Zhang, F., Zhang, H. & Xia, C. China’s low-carbon policy intensity dataset from national- to prefecture-level over 2007–2022, Figshare, https://doi.org/10.6084/m9.figshare.c.6761106.v1 (2024).
Young, O. R. et al. Institutionalized governance processes: Comparing environmental problem solving in China and the United States. Glob. Environ. Change-Human Policy Dimens. 31, 163–173 (2015).
Article Google Scholar
Li, H., Zhao, X., Yu, Y., Wu, T. & Qi, Y. China’s numerical management system for reducing national energy intensity. Energy Policy 94, 64–76 (2016).
Article Google Scholar
Chu, W. Industry policy with Chinese characteristics: a multi-layered model. China Econ. J. 10, 305–318 (2017).
Article Google Scholar
Zhao, X., Ma, X., Chen, B., Shang, Y. & Song, M. Challenges toward carbon neutrality in China: Strategies and countermeasures. Resour. Conserv. Recycl. 176, 105959 (2022).
Article CAS Google Scholar
Su, Z., Guo, Q. & Lee, H. Green finance policy and enterprise energy consumption intensity: Evidence from a quasi-natural experiment in China. Energy Econ. 115, 106374 (2022).
Article Google Scholar
Zhang, Y. & Fu, B. Impact of China’s establishment of ecological civilization pilot zones on carbon dioxide emissions. J. Environ. Manage. 325, 116652 (2023).
Article CAS PubMed Google Scholar
Pardo Martínez, C. I. & Silveira, S. Energy efficiency and CO₂ emissions in Swedish manufacturing industries. Energy Effi. 6, 117–133 (2013).
Article Google Scholar
Lin, B. & Liu, J. Principles, effects and problems of differential power pricing policy for energy intensive industries in China. Energy 36, 111–118 (2011).
Article Google Scholar
Sun, Y. China’s target responsibility system and convergence of CO₂ emissions. Singap. Econ. Rev. 63, 431–445 (2018).
Article Google Scholar
Lo, K., Li, H. & Wang, M. Energy conservation in China’s energy-intensive enterprises: An empirical study of the Ten-Thousand Enterprises Program. Energy Sustain Dev. 27, 105–111 (2015).
Article Google Scholar
Zhu, L., Zhang, X., Li, Y., Wang, X. & Guo, J. Can an emission trading scheme promote the withdrawal of outdated capacity in energy-intensive sectors? A case study on China’s iron and steel industry. Energy Econ. 63, 332–347 (2017).
Article CAS Google Scholar
Zhao, C. et al. The effect of air pollution control auditing on reducing carbon emissions: Evidence from China. Int. J. Environ. Res. Public Health 19, 17019 (2022).
Article CAS PubMed PubMed Central Google Scholar
Du, K., Cheng, Y. & Yao, X. Environmental regulation, green technology innovation, and industrial structure upgrading: The road to the green transformation of Chinese cities. Energy Econ. 98, 105247 (2021).
Article Google Scholar
Turken, N., Carrillo, J. & Verter, V. Facility location and capacity acquisition under carbon tax and emissions limits: To centralize or to decentralize? Int. J. Prod. Econ. 187, 126–141 (2017).
Article Google Scholar
Zhang, J., Liang, G., Feng, T., Yuan, C. & Jiang, W. Green innovation to respond to environmental regulation: How external knowledge adoption and green absorptive capacity matter? Bus. Strateg. Environ. 29, 39–53 (2020).
Article Google Scholar

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (No. 72348001). The authors are grateful to Professor Junjie Zhang for his insightful discussion and helpful suggestions on research design.

Author information

Authors and Affiliations

State Key Joint Laboratory of Environment Simulation and Pollution Control (SKLESPC), School of Environment, Tsinghua University, Beijing, 100084, China
Xinyang Dong, Can Wang, Haowen Zhang & Chengqi Xia
School of Public Policy and Management, Tsinghua University, Beijing, 100084, China
Fang Zhang

Authors

Xinyang Dong
View author publications
You can also search for this author in PubMed Google Scholar
Can Wang
View author publications
You can also search for this author in PubMed Google Scholar
Fang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Haowen Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Chengqi Xia
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.D. and C.W. designed the study and generated the dataset. X.D. and F.Z. specified standards of policy classification and scoring criteria for manual labelling. X.D. and H.Z. trained the prompt learning model for low-carbon policy intensity prediction. X.D. and C.X. conducted data validation. All the authors made significant contributions to manuscript editing, revising, and approved the final version of the manuscript.

Corresponding author

Correspondence to Can Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dong, X., Wang, C., Zhang, F. et al. China’s low-carbon policy intensity dataset from national- to prefecture-level over 2007–2022. Sci Data 11, 213 (2024). https://doi.org/10.1038/s41597-024-03033-5

Download citation

Received: 27 July 2023
Accepted: 31 January 2024
Published: 16 February 2024
DOI: https://doi.org/10.1038/s41597-024-03033-5

Subjects

Abstract

Similar content being viewed by others

A computational approach to analyzing climate strategies of cities pledging net zero

Diverse climate actors show limited coordination in a large-scale text analysis of strategy documents

Threshold effect of technological innovation on carbon emission intensity based on multi-source heterogeneous data

Background & Summary

Methods

Data preparation

Step 1: Constructing a low-carbon policy inventory

Step 2: Structuring policy texts

Step 3: Disaggregating policy texts into the policy-objective file and policy-instrument file

Policy text classification

Quantifying low-carbon policy intensity

Manual labelling

Prompt learning and prediction

Low-carbon policy intensity integration

Data Records

Technical Validation

Model performance

Comparison with existing studies

Manual checking

Usage Notes

Usage of low-carbon policy intensity

The starting and ending years of each policy

Limitations and future work

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links