Introduction

Vertical integration of coal and electricity is the development trend of China's coal and power industry and is an effective means to alleviate the conflict between coal and electricity1. By integrating resources and production command, the enterprise effectively reduces production and internal transaction costs and maximizes residual income and optimizing internal distribution in the chain of integrated operations2. Vertical integration of coal and electricity can also enhance the adaptability of coal and electricity companies to market fluctuations and ensure a stable supply of coal in the form of internal transactions, which in turn will guarantee the supply of electricity.

However, China's electricity supply is still dominated by thermal power generation, as shown in Fig. 1. Although the proportion of thermal power in all power production has generally declined between 2016 and 2020, it still accounts for more than 70%. Therefore, China’s power generation industry is still highly dependent on a stable supply of thermal coal. Due to the wide scope of business and large scale of operation, coal-electricity-integrated enterprises usually can’t meet the coal demand by relying only on the supply of their own coal mines, so they also need to purchase outsourced coal3. Its coal supply structure is shown in Fig. 2. Due to the market-based pricing mechanism of commodity coal, its price is prone to fluctuations due to external factors4. Especially when the price of commodity coal rises, much higher than the long-term contract price, the outsourcing coal suppliers may reduce their shipments to the coal-electricity-integrated enterprises after measuring their default costs and benefits, resulting in shortage of coal supply and affecting the companys’ production and operation, at which time the coal-electricity-integrated enterprises will encounter a difficult period of coal procurement. When the difficult period is over, in order to optimize their coal supply chain, the enterprises will prioritize the allocation of orders to high-quality suppliers based on the shipments data during the difficult period.

Figure 1
figure 1

The structure of electricity production in China.

Figure 2
figure 2

The structure of coal supply chain in coal-electricity-integrated enterprises.

The evaluation of coal suppliers in coal-electricity-integrated company is studied in this paper for the following reasons:

  1. 1.

    Due to the fluctuation of coal price, the vertically integrated enterprises will encounter the difficult period of coal procurement, i.e. the outsourcing coal suppliers will choose to breach the contract for their own interest when the market price of coal is higher than the contract price and will not complete the order.

  2. 2.

    Purchased coal accounts for a portion of the coal used by a vertically integrated coal and electricity enterprise. And the order fulfillment of the outsourced coal suppliers affects the normal operation of this enterprise.

  3. 3.

    The vertically coal-electricity-integrated companies, as the demand side of coal, need to assign orders to superior suppliers on a priority basis based on past supplier performance.

  4. 4.

    The vertically integrated coal and electricity enterprises are usually large in scale and have a high degree of information technology platform development. The performance data of its coal suppliers is stored in the platform's back-end database. The evaluation model is required to update and display the evaluation results based on the changes of the platform background data, so a auxiliary decision-making module can be established in the information technology platform.

The purpose and main task of this paper is to seek a reasonable and accurate short-term supplier evaluation method that meets the requirements for evaluating the performance of thermal coal suppliers. The evaluation results can be used as the basis for coal-electricity integration enterprises to allocate coal orders. The method must be suitable for dynamic and objective evaluation data and meets the requirements of the company’s supply chain management system (SCM) for real-time update and display of evaluation results.

The CRITIC method is a comprehensive and objective weighting method. Compared with the entropy weight method and the FANMA method, it not only considers the dispersion of the data of each criterion evaluation set, but also considers the correlation of the criteria, and the calculation method is simple. When using CRITIC method to evaluate thermal coal suppliers, it is found that the conflicting value range of the indicators will lead to the problem of inaccurate calculation of the amount of information in the criteria, but the existing research literature rarely mentions this phenomenon. Therefore, this paper proposes an improved CRITIC method to evaluate and select suppliers based on the short-term delivery data of coal-electricity integrated enterprises. In the improved CRITIC method, the conflicting value ranges of indicators ignored in the traditional CRITIC method is considered. By using improved CRITIC method, the problem of inaccurate calculation of information content was solved.

Literature review

Coal-electricity integration studies

The mismatch of institutional change between the coal industry and the thermal power industry has led to a conflicting relationship between the two industries. Scholars have mainly studied the vertical integration of coal and electricity as a method to solve the conflict relationship between coal and electricity. Linmei Yuan5 analyzed the coal power industry through the SCP model, and believed that the vertical integration of coal power is an effective way to alleviate the contradiction between coal and electricity. Shukui Yu6 analyzed the significance, existing problems and development countermeasures of coal enterprises implementing vertical integration strategy. From the perspective of bilateral matching of coal and electricity, Rui Nie7 constructed a transaction model of coal and electricity enterprises, designed a price generation mechanism for thermal coal, and put forward countermeasures to ensure the smooth operation of the thermal coal market. Daqing Zhu8 argued that the current level of coal-electricity integration in China is still low, and accelerating the process of integration requires enhancing the short-term market power of power coal and coal-fired power producers.

As for the advantages of vertical integration of coal and electricity, Hongji Shi9 established a game model of coal-electricity integration and concluded that integration can improve corporate profits. Yujia Wang10 found that vertical integration of coal and electricity can optimize resource allocation and improvethe productivity of enterprises. David Brown11 found that vertical integration of electricity sector reduced retail electricity prices while increasing industry capacity investment, wholesale electricity supply and consumer surplus. Hongye Guo12 used the MILP model to simulate the integrated market equilibrium. Through case research, it is found that the introduction of vertical integration in the power industry can achieve Pareto optimality and alleviate the abuse of market power.

Scholars have also analyzed the problems that can result from the vertical integration of coal and electricity. Hongyi Li13 believed that compared with developed countries, vertical integration in developing countries improves the ability of insiders to obtain private benefits and has negatively effects on the efficiency of enterprises. IN Pinopoulos14 studied the welfare effect of the integration of upstream and downstream suppliers through linear tariffs, and found that when downstream enterprises have higher bargaining power, vertical integration will reduce the welfare effect. From the perspective of risk, Jinfang Yao15 analyzed the relationship between the degree of enterprise integration and financial risk through a multiple linear regression model and concluded that the higher the degree of vertical integration of coal enterprises, the greater the financial risk faced by the enterprises.

Studies on supplier evaluation and selection

The supplier evaluation and selection problem is often viewed as a multi-criteria decision problem (MCDM). Although scholars have already done relatively mature research on supplier evaluation, they mainly consider long-term factors, but there is not much research on the evaluation of suppliers based on a small amount of data in the short term. The large and comprehensive criteria system in long-term evaluation usually involves much data collection work, such as issuing questionnaires and consulting relevant materials, so it is impossible to evaluate based on data that changes in a short period. For thermal power plant companies, the behavior of their coal suppliers may change with coal price fluctuations in the short term, which is mainly reflected in the coal shipments of suppliers. In addition, with the improvement of the level of informatization, enterprises have also put forward higher requirements for the real-time display and update of the evaluation results. In conclusion, the evaluation based on short-term delivery data of each supplier can meet the needs for optimizing the thermal coal supply chain.

Scholars usually consider factors such as quality, price, and delivery when evaluating suppliers, and there are different criteria according to different evaluation objects, as shown in Table 1. Gamiy21 considered coal supply interruption from the perspective of coal mining and self-heating.

Table 1 Criteria involved in long term evaluation.

Methodology for determining indicator weights

In multi-criteria decision problems, it is often necessary to assign different weights to different criteria for the following two purposes. 1. For different research areas and evaluation purposes, different weights are usually assigned to each criterion due to different focus and emphasis. For example, for the evaluation of high-end manufacturing suppliers such as gas turbines, the highest weight is given to technology16. However, the evaluation of suppliers in industries that require environmental protection measures, such as papermaking enterprises, will give a higher weight to environmental factors22. 2. Making full use of the information in the evaluation data can make the evaluation results more distinguishable. As the evaluation data usually contains some mathematical features, such as variance, extreme deviation, information entropy, etc. Taking information entropy as an example, the greater the information entropy of the criterion the higher its data dispersion, which should be given a higher weight. Commonly used weighting methods include subjective methods, objective methods, and methods in which subjective and objective weights are combined. Representative subjective weighting methods include AHP23, DEMATEL24, etc. The subjective method can reflect the knowledge and experience of decision-makers, but it is highly subjective. Objective weighting methods, such as the entropy weight method25, CRITIC method26, etc., can reflect the influence of historical data on the weight of evaluation criteria and are highly objective, but highly dependent on data. The combined method27 refers to the combination of subjective and objective weight through additive synthesis, multiplicative synthesis, subjective correction based on objective, range maximization, or other combination methods, is a more holistic approach, but it's a lot of work.

Many scholars use subjective methods to determine weights. Dragan Pamucar28 thought that the traditional BWM method ignores the possibility of multiple evaluation criteria with the same meaning in the expert preference. He improved the traditional BWM method, reduced the number of comparisons, and verified the practicability of the improved method through examples. Based on the LBWA method, Sanjib Biswas29 analyzed the site selection of colleges and universities from the perspective of business school candidates, and considered that transportation convenience and commuting time are the primary factors. Alptekin Ulutas30 used the FUCOM method to evaluate the location of textile manufacturing facilities in a province of Turkey. Compared with the AHP and BWM methods, the FUCOM method has more reliable standard weight coefficients, which reduces the subjective influence and inconsistency of the expert preference on the final value of the standard weight. At the same time, the number of pairwise comparisons of the FUCOM method is significantly reduced.

When it comes to objective weighting methods, C Bai et al.31 evaluated the social sustainability of suppliers of an Iranian manufacturing company based on the TODIM method. The TODIM method is based on the value function of prospect theory, and establishes the relative superiority function of a solution compared with other solutions according to the psychological behavior of decision makers, and carries out solution selection according to the magnitude of superiority. MA Kaviani et al.32 applied gray system theory to supplier evaluation in the oil and gas industry considering the uncertainty faced by decision makers due to lack of experience and information.Chia-nan Wang et al.33 developed a flexible and generalizable supplier evaluation and selection model based on the TOPSIS method for supplier evaluation and selection in the Vietnamese apparel industry. Mukhametzyanov34 has conducted a comparative analysis of objective methods for determining criteria weights in MCDM and concluded that the soundness of all objective methods for evaluating criteria weights for MCDM tasks is questionable. Navid Zarbakhshnia et al.35 evaluated sustainable suppliers in the plastics industry based on the DEA approach in terms of inputs and outputs.

In terms of mixed subjective and objective evaluation methods, Stojanović I et al.36 assessed the logistics performance of the GCC countries based on the CRITIC-MABAC hybrid model, and selected the United Arab Emirates as a regional logistics hub that effectively integrates the GCC into the global supply chain system. Abdulaziz Alossta et al.37 solved the site selection problem of emergency medical centers based on the AHP-RAFIS method, and considered that the road network is the best construction site compared with other sites. Ibrahim Bad et al.38 used the FUCOM-MARCOS method to evaluate the green innovation capability of the Nigerian textile industry, and established a set of green innovation evaluation models that can be applied to other industries. As evaluation problems often involve fuzzy decision making, fuzzy theory is also widely used, usually in combination with other methods such as Fuzzy-AHP39, Fuzzy-TOPSIS40 and Fuzzy-TODIM41.

In summary, subjective, objective, and combination of subjective and objective weighting methods have their own advantages and disadvantages. Both subjective and subjective–objective-combined weighting methods involve questionnaires, expert consultation and other work, which usually take a long time and a large workload. So they are not suitable for rapid short-term evaluation. In this paper, since the supplier evaluation relied on short-term dynamic objective data, objective weighting method was adopted.

CRITIC method and improvement

CRITIC method

The CRITIC method measures weight according to the dispersion degree of the criterion and the conflict degree between criteria. The dispersion degree refers to the difference between the values of a criterion in each evaluation scheme. The conflict degree reflects the amount of similar information between different criteria. If a criterion has a higher dispersion degree and higher conflict degree, its weight should be larger.

Suppose there are n alternatives to be evaluated and p evaluation criteria to form the original data matrix:

$$X=\left(\begin{array}{ccc}{x}_{11}& \cdots & {x}_{1p}\\ \vdots & \ddots & \vdots \\ {x}_{n1}& \cdots & {x}_{np}\end{array}\right),$$
(1)

where \({x}_{ij}\) represents the value of alternative i regarding the criterion j.

To eliminate the influence of different dimensions on the evaluation results, it is necessary to perform the dimensionless process on each index. Positive normalization is used for positive volume criteria and negative normalization is used for negative volume criteria. Formula is as follows:

$$ \begin{gathered} {\text{For}}\,{\text{positive}}\,{\text{indicators:}}\,\,\,\,x_{ij}^{^{\prime}} = \frac{{x_{ij} - \min \{ x_{ij} \} }}{{\max \{ x_{ij} \} - \min \{ x_{ij} \} }} \hfill \\ {\text{For}}\,{\text{the}}\,{\text{negative}}\,{\text{indicators:}}\,\,\,x_{ij}^{^{\prime\prime}} = \frac{{\max \{ x_{ij} \} - x_{ij} }}{{\max \{ x_{ij} \} - \min \{ x_{ij} \} }}. \hfill \\ \end{gathered} $$
(2)

The dispersion degree is expressed in the form of standard deviation and is calculated as follows:

$$\left\{\begin{array}{c}{\overline{x} }_{j}=\frac{1}{n}{\sum }_{i=1}^{n}{x}_{ij}\\ {S}_{j}=\sqrt{\frac{{\sum }_{i=1}^{n}{({x}_{ij}-\overline{x })}^{2}}{n-1}}\end{array}\right..$$
(3)

The conflict degree is calculated by:

$${R}_{j}=\sum_{\begin{array}{c}i=1\\ i\ne j\end{array}}^{p}(1-{r}_{ij}),$$
(4)

where \({r}_{ij}\) represents the Pearson correlation coefficient of criteria i and j. The calculation formula is as follows:

$${r}_{ij}=\frac{\sum_{i=1}^{n}({x}_{i}-\overline{x })({y}_{i}-\overline{y })}{\sqrt{{\sum }_{i-1}^{n}{({x}_{i}-\overline{x })}^{2}}\sqrt{{\sum }_{i-1}^{n}{({y}_{i}-\overline{y })}^{2}}}.$$
(5)

The amount of information is calculated as follows:

$${C}_{j}={S}_{j}\times {R}_{j}.$$
(6)

The weight of the criterion is:

$${\omega }_{j}=\frac{{C}_{j}}{{\sum }_{j=1}^{p}{C}_{j}}.$$
(7)

Problems and improvements

If there are n criteria, the value range of Rj is [0, n−1]. When calculating Cj, according to formula (4), if Rj > 1, Cj will be larger than it should be after being multiplied; if Rj < 1, Cj will be smaller than it should be after multiplied. However, if the amount of information in one criterion is enlarged while another is reduced, the weight will be enlarged or reduced correspondingly, which will lead to unreasonable evaluation results.

The cause of this problem is the value range of Rj. If the value of Rj can be limited to [0, 1], the problem can be solved. The calculation of Rj can be improved as follows:

$${R}_{j}=\prod_{\begin{array}{c}i=1\\ i\ne j\end{array}}^{p}(1-{r}_{ij}).$$
(8)

Case study

Taking a large integrated coal and power company in China as an example, the amount of self-produced coal and purchased coal in its coal usage over the past 5 years is shown in Fig. 3. Purchased coal still accounts for about 1/3 of the used coal of the coal-electricity-integrated company. Suppliers and the company are usually bounded by long-term contracts for the trading of commodity coal. This coal-electricity-integrated company has 36 suppliers of thermal coal and the price of coal increased successively from September to October 2021 due to various factors such as cold spells, lower coal imports and other reasons, as shown in Fig. 4. As a result of the increase in the price of thermal coal, the coal suppliers reduce their coal shipments to the company for their own benefit, affecting the normal production activities of the company.

Figure 3
figure 3

The amount of coal used by a coal-electricity-integrated enterprise in the past 5 years.

Figure 4
figure 4

The trend of coal price.

The enterprise has a high level of information technology development.Relevant operation data can be obtained directly from the database of the information technology platform. The company wants to establish an auxiliary decision-making module in the platform to realize real-time evaluation, update and display of coal suppliers according to the platform's background data, so as to provide reference for the order allocation work of coal procurement personnel. Each supplier was evaluated using the shipment data during the difficult period of coal acquisition from September to October 2021.The current calculation is the order fulfillment rate of a supplier/total order fulfillment rate of all suppliers. This method does not consider the weight setting, ignores the relative importance between different evaluation criteria. And once the evaluation indexes are increased, the method is no longer applicable.

The improved CRITIC method determines the weights of each indicator based on the variability of indicators and the conflict between indicators, which does not rely on human decision making, is more scientific, and is compatible with the information technology platform. Therefore, in this paper, 36 coal suppliers are evaluated and selected by the improved CRIITC method assigned with the shipment volume of each site in late September, actual shipment volume and planned volume of each site in October as the evaluation data. Due to the small number of indicators and easy availability of quantitative data, a linear weighting method was used to calculate each supplier's score for simplicity, and the calculation process is as follows.

Raw data processing

Due to the different dimensions of the 3 criteria, normalization is necessary.

$$A={\left[\begin{array}{cccccccccc}0.595& 1.000& 0.380& 0.264& \cdots & 0.000& 0.123& 0.000& 0.000& 0.000\\ 0.901& 0.747& 0.538& 0.208& \cdots & 0.067& 0.067& 0.008& 0.000& 0.000\\ 1.000& 0.691& 0.829& 0.866& \cdots & 0.221& 0.141& 0.036& 0.000& 0.000\end{array}\right]}^{T}$$

where \({a}_{ij}\) represents the normalized value of alternative i regarding the criterion j.

Weight calculation

The CRITIC method is used to calculate the weight of each evaluation criterion, and the results are shown in Table 2.

Table 2 The results of the CRITIC method.

Red indicates a conflicting indicator calculation greater than 1. It can be seen from Table 2 that the conflict degree of the 3rd criterion is larger than 1, which leads to information distortion. As shown in Fig. 5, the slope of the curve changes more when the indicator conflict is at different levels, especially from 0.5 to 2.0, i.e., the influence of the indicator variability on the weights becomes greater. As can be seen in Fig. 5, the variability does not change much in the 0 to 1 interval with the change in the value of conflict. Therefore, the improved CRITIC method was used to calculate the weight of each criterion. The result is shown in Table 3.

Figure 5
figure 5

The impact of dispersion degree on weight at different conflict degree levels.

Table 3 The results of the improved CRITIC method.

It can be seen from Table 3 that the improved CRITIC method limits the range of conflict values. The correlation analysis shows that the correlation coefficient of dispersion degree and weight is 0.76 and 0.62 in the CRITIC and improved CRITIC method. It can be considered that the improved method reduces the impact of conflict degree on weight.

Suppliers evaluation based on improved CRITIC method

The evaluation result based on the weight method of CRITIC and improved CRITIC and the result of the original method is shown in Fig. 6.

Figure 6
figure 6

The result of different weighting methods.

Comparing the results of different methods, the ranking changes of each supplier are shown in Table 4. Number in parenthesis is the ranking changes of the supplier. The first number is the ranking change of the evaluation results weighted by the CRITIC method compared to the original results. The second number is the ranking change of results weighted by the CRITIC method compared to the improved CRITIC method.

Table 4 The ranking changes of each supplier.

Analysis and discussion

Comparing the result of the original evaluation method and CRITIC weighting method, it is obvious that suppliers 2 and 33 had a big rise in ranking. Supplier 2 ranked 4/36, 11 places upwards. The reason is that the original method only considered the shipment data in October while neglecting the data in late September. Supplier 33 ranks 21/36, 13 places upwards. Supplier 31 and Supplier 36 have dropped more in the ranking, ranking 11th and 15th in the CRITIC result, 6 and 7 places downwards respectively. The reason is that the suppliers delivered few coals when the company was short of coal in September.

Comparing the result of the CRITIC weighting method and the improved method, suppliers have little change in ranking. In this case, the weight change of the three cri-teria is − 4.59%, − 7.55%, and + 12.14% respectively. As is shown in Table 4, Supplier 36, which rose the most, rose 3 places, while Supplier 1 dropped the most, dropping 2 places.

After the internal discussion of the thermal power company, it is believed that the evaluation results of the improved CRITIC method are the most realistic.

This paper uses the linear weighting method to evaluate and rank suppliers based on evaluation index data, because the case has only three indicators and the linear weighting method is simple to calculate. TOPSIS, a commonly used supplier evaluation method42, calculates the relative closeness of each evaluation method to the positive and negative ideal solution to rank each supplier to select the best supplier. However, in the application scenario of this paper, as it is a short-term evaluation, the evaluation results need to be updated frequently based on the constantly changing short-term evaluation data, and TOPSIS may cause the reverse ranking phenomenon when the evaluation data changes frequently. Therefore, if the focus is only put on the selection of the best supplier over time, TOPSIS method can be adopted. Besides the best supplier, if it is also necessary to focus on changes in the ranking of suppliers, other methods need to be considered.

The subject of this paper is a coal-electricity-integrated enterprise, whose supply chain management system currently provides data including two indicators of the planned order quantity and actual shipment quantity of each supplier each month. As late September and October were the difficult period for the enterprise to procure coal, some suppliers refused to fulfil their long-term contracts and reduced their coal supply to the enterprise. The company would like to assess the performance of its suppliers based on data from this difficult period. In order to optimise its coal supply chain, the company would allocate more coal orders to stable suppliers based on the results of the assessment. Taking the results of the improved CRITIC method described above as an example, the highest ranked supplier during the difficult coal procurement period from late September to October is Supplier 9, and the relatively low ranked suppliers are Suppliers 28 and 29. Therefore, when placing coal orders in the following November, the coal-electricity-integrated company will increase the coal order quantity for Supplier 9 and decrease the coal order quantity for Suppliers 28 and 29 in order to establish a more stable coal supply chain.

Conclusion and outlook

Purchased coal accounts for a certain proportion of coal used by vertically integrated coal and electricity companies. Coal and electricity vertically integrated enterprises carry out thermal coal suppliers evaluation to help optimize their supply chain structure. Weight assignment based on the CRITIC method, which relies solely on the variability and conflict of indicators, excludes the influence of subjective factors. The improved CRITIC method solves the problem of distortion of information quantity caused by conflicting value ranges of indicators in the traditional CRITIC method, and improves the theory of weight calculation. From the empirical analysis, it can be seen that the short-term supplier evaluation based on the improved CRITIC method empowerment is objective and avoids unreasonable results due to subjective bias.

This paper has the following limitaions:

Evaluation indicators and data are few. In the future, with the construction of the company’s SCM platform and the continuous improvement of the system's back-end data, more evaluation indicators can be incorporated into the evaluation system.

Due to the real-time update of the data of the enterprise supply chain management system, the evaluation indicators, the number of evaluation schemes and the data in the evaluation model are in the process of dynamic change. The subjective weighting method, which requires questionnaires or expert scoring, usually takes a long time and cannot meet the needs of SCM to update the evaluation results instantly and dynamically, so the objective weighting method is chosen in this paper to determine the weights. However, a common drawback of the objective weighting method is that if an evaluation indicator is 20–30% higher than other indicators, the weight of that indicator will rise significantly, which is unreasonable. In the future, if the SCM system can integrate functions such as expert scoring into the system to reduce the time and workload required to determine subjective weights, then a combination of subjective and objective weighting methods can be considered in order to obtain more reasonable evaluation results.

The evaluation model is an important part of SCM and enables real-time updating and presentation of the evaluation results based on updates of the back-end data. The evaluation results are recognised by the empirically analysed company and have been used within the company. Therefore, the evaluation method proposed in this paper has a certain degree of practicality. In addition, the main difference between different countries/industries is that the evaluation indicators are different, and as long as the evaluation indicators have quantitative and objective data, the improved CRITIC method can be applied, which makes the method highly scalable. For different countries/industries, the CRITIC method can also be modified by the subjective weighting method to obtain more reasonable weights.

Due to the current general increase in the degree of informatization in various enterprises, the improved CRITIC method proposed in this paper provides some implications for the construction of order allocation decision-making platforms and SCM in other enterprises.