An adaptive categorical effect size method based on intuitionistic meta fuzzy functions

There are several categorical effect size methods in the literature. It is not clear which method performs better for a given dataset and it is a challenging task to select the correct method for a given dataset. In this sense, to overcome the questions like “Which method should we choose?” and “Which categorical effect size method is more reliable for a given dataset?”, an adaptive categorical effect size method based on intuitionistic meta fuzzy functions is introduced in the paper. Thus, the main motivation of the proposed method is to obtain more accurate outcomes by combining the results of better performing methods instead of relying on only one method. In the study, the intuitionistic fuzzy c-means clustering algorithm is adapted to meta fuzzy functions by incorporating not only membership degrees but also non-membership degrees to improve the clustering accuracy of meta fuzzy functions. Meta fuzzy functions are the linear combination of seven categorical effect size methods and the weights, which are calculated from membership grades from intuitionistic fuzzy c-means algorithm. Among the functions, the one with the lowest mean absolute percentage error is selected as the best. To evaluate the performance of the proposed method, 2 × 3, 2 × 4, and 3 × 4 contingency tables were simulated. Additionally, the performance of the proposed method is also assessed by applying it to a real-time dataset. Experimental results show that the proposed method outperforms compared to the evaluated seven categorical effect size methods in terms of mean absolute percentage error. Also, the calculated effect sizes are within the range of ±10% in terms of bias. Thus, the results verified that proposed method achieves greater reliability.

based on their performances for a given dataset.In this sense, the motivation of this paper is to combine different categorical effect sizes methods in functions with Meta Fuzzy Functions (MFF) based on Intuitionistic Fuzzy C-Means Clustering (IFCM) algorithm.Fuzzy c-means (FCM) clustering algorithm is used in MFF.FCM, proposed by Bezdek et al. 12 , stands out as one of the frequently employed methods because of its simplicity and the benefits it offers compared to the k-means clustering algorithm.Nevertheless, it has certain drawbacks, including its susceptibility to initial settings and sensitivity to noise.In this sense, IFCM that accounts for hesitancy of an object belonging to a cluster is employed in MFF.Intuitionistic Fuzzy Sets (IFSs) are introduced as a modification of Zadeh's fuzzy set theory by Atanassov 13,14 .The main difference between fuzzy sets and IFSs is that fuzzy sets only consider membership degree while IFSs consider both membership and non-membership degrees.That is, IFSs account also for the hesitancy of membership grades in clusters.Thus, the centers of the clusters are obtained more accurately.It has been determined by the studies that IFSs are more effective than traditional fuzzy set theory by overcoming uncertainty 15 .IFSs have been commonly used for forecasting and engineering problems.In addition to time series and forecasting methods, IFSs are widely used in the field of medicine for clustering images and diagnostics [16][17][18] .Numerous studies employing IFSs have been proposed by Fan et al. 19 , Kumar and Gangwar 20 , Lei, et al. 21, Tak 22 , Gwak et al. 23 .
Because aforementioned advantages of IFCM in the literature, it is employed in MFF.The MFF was proposed by Tak 24 .The purpose of the MFF is to combine methods or definitions used for the same purpose.Its logic is simply based on meta-analysis.Meta-analysis is a method that combines the outcomes of multiple studies to yield stronger results for a specific purpose.For example, Tak and Gök 25 and Gök and Tak 26 utilized the MFF to merge different definitions of currency crisis.By employing this approach, they aimed to enhance the accuracy and reliability of their analysis.Similarly, Tak et al. 27 employed the MFF to combine various time series methods.Their objective was to improve the forecasting performance by integrating multiple forecasting techniques within the framework.Cevik et al. 28 used the MFF approach to forecast the number of immigrants within the maritime line.Tak 29 used the MFF approach to forecast combination.These studies have shown that combining different methods with the MFF has better estimation accuracy.
Yabacı Tak and Ercan 30 ensembled some ES definitions for two independent groups with MFF to obtain a more accurate effect size value.Yabacı Tak and Ercan 30 combined six effect size methods for numerical variables with the MFF approach by using classical fuzzy c-means algorithm (FCM), which can be used with or without the assumption of normal distribution.The combined methods in the previous study were not used for categorical variables.Thus, numerous categorical ES methods are combined in this study.Besides, the FCM clustering method only uses membership degrees while calculating the cluster centers.Thus, the MFF approach with the IFCM , which provides a more accurate estimation of the cluster centers, has been developed in the study.
In the light of this information, we will introduce intuitionistic meta fuzzy categorical effect size functions (I − MFCESF) approach.The aim of the study is to obtain better outcomes by combining seven categorical effect size measures in functions.The purpose of combining the ES is the assumption that each measure might have much or partial information for a given dataset.Therefore, while the methods that perform better will be gathered into one function, the methods that perform worse will be gathered into another.In the remainder of the paper, we will describe the IFCM and the meta fuzzy functions briefly in the section "Preliminaries".The proposed method (I − MFCESF) is discussed in section "Intuitionistic meta fuzzy categorical effect size func- tions (1-MFCESF)".The performance of the proposed method is evaluated with some applications for simulated and real datasets in section "Evaluation".Finally, the results of the proposed method are discussed in section "Conclusion".

Preliminaries
The methods (effect sizes, intuitionistic fuzzy c-means and meta fuzzy function) that are used in the paper are detailed in this section.

Categorical effect size methods
Short descriptions of seven types of ES measures are provided for r × c contingency tables.Cramer′sV is pro- posed in 1946 and it is an effect size measure that is generally used with nominal variables in r × c contingency tables 7,[31][32][33][34] .It is calculated in Eq. (1) based on Pearson's chi-square statistic.It takes values between 0 and + 1.
where, χ 2 is the Pearson's chi-squared statistics, n is the total observations number, c is the number of coloumns and r is the number of rows.In the Eq.(1), numerator of formula is based on the observed frequencies, denomina- tor of formula is based on an unobserved frequencies.Therefore, when Cramer′sV = 1 , the marginal frequencies are not zero and r or c has not zero cell frequencies.
Tschuprow ′ sT is a ES which measures the association between two nominal variables in rxc contingency tables 35 .It takes values between 0 and +1, and calculated in Eq. ( 2).
where, χ 2 is the Pearson's chi-squared statistics, c is the number of coloumns and r is the number of rows.
Another measure of categorical effect size is the Pearson ′ scontingencycoefficient ( Pearson ′ sc ).It takes values between 0 and + 1. Pearson ′ sc can be calculated in Eq. (3) 36 . (1) where, χ 2 is the Pearson's chi-squared statistics, and n is the total number of observations.
Cohen ′ sw effect size is proposed by Cohen 37 .Cohen ′ sw should be used for larger contingency tables.Cohen's w effect size measure is obtained in Eq. ( 4).
where, m is the number of cells, p 0i is the value of the ith cell under the null hypothesis, p 1i is the value of the ith cell under the alternative hypothesis.
Goodman − KruskalTau(G − KTau) is another ES measure of nominal variables.It measures the predictabil- ity of the column or row variable given the value of other variables, in percentage.The measure varies between 0 and 1 38,39 .G − KTau is calculated in Eq.( 5) 40 .
where, n is the total number of observation, a ij is the value of number of observation in ith row and jth column, a .j is the total number of observation in jth column and a i. is the total number of observation in ith row.
Uncertaintycoefficient(U) is first introduced by Theil 41 .It is also called Proficiency, Entropy Coefficient or Theil's U. It is often used as a measure of the ES of nominal variables in statistics and takes the value between 0 and + 1.This measure is defined in Eq. ( 6) where, H(X) is the entropy of a single distribution, H(XY ) is the conditional entropy and U(XY ) is the uncertainty coefficient.P X,Y x, y is the conditional distribution.
Goodman − KruskalLambda( ) statistic is an effect size proposed to measure the strength of the relationship between two nominal variables by evaluating the proportional reduction of error (PRE) 39 .Also, is the asymmetrical measure.The statistic takes value between 0 and 1.How to calculate the statistic is given in Eq. ( 8).
where, E 1 is the number of prediction errors made when the independent variable is ignored, E 2 equal to the number of prediction errors made when the prediction is based on the independent variable.

IFCM
Over the past decades, the fuzzy set theory proposed by Zadeh 14 has been expanded with different approaches.Among these, intuitionistic fuzzy set theory, which has been commonly used in the literature and has many applications in different fields, was developed by Atanassov 13 .While only the membership degree is taken into account in the FCM, non-membership degree is also taken into account in IFCM.So that, the centers of the clusters are calculated more accurately.Algorithm are given below 22 : Step-1.Determine the number of clusters (c) , the fuzziness index (f), and initialize the cluster centers (v i ) randomly.

Meta fuzzy functions
Tak 24 proposes MFF to combine different methods or definitions, such as prediction and forecasting.The MFF consists of three components: functions, weights, and the best meta fuzzy function.Functions; the linear combination of weights and the findings of the selected methods.Weights: the membership grades that are obtained from FCM clustering algorithm are used to compute weights.The best meta fuzzy function: the function that has the best evaluation criteria.Meta fuzzy functions begin with obtaining the outcomes of the methods chosen for a purpose as the input matrix.After that, the input matrix is clustered using fuzzy c-means clustering algorithm to separate the categorical ES methods based on how well they predict outcomes.As a result, each method will be assigned to a cluster with a membership grade.Then, using membership grades for each cluster, the weights of the methods are calculated.In this case, there will be an equal number of functions as the cluster number.Finally, the best meta fuzzy function is selected based on its evaluation criteria.

Intuitionistic meta fuzzy categorical effect size functions (I − MFCESF)
Cramer ′ sv, Tschuprow ′ sT, Pearson ′ sc, Cohen ′ sw, G − KTau, U and methods can be used to calculate effect size measures for a dataset.However, there is no definite information in the literature about which method is better or in which situations it should be used.Therefore, the performance of the methods may change according to the type of datasets.Because the performance of the ES measures in the proposed method is uncertain, we are looking for the optimum weights of the ES measures in the combination function.For this purpose, I − MFCESF method is proposed in this paper.The ES measures are clustered based on their performances by using the IFCM.There will be as many functions as the number of clusters.Functions are obtained by multiplying each method by its weight in the clusters.The ES measures that perform better for the dataset will be in a function with a higher membership degree, while the ES measures that perform worse will be in another function with a higher degree of membership.Finally, the function with the minimum model evaluation criterion is selected as I − MFCESF best and new effect size value will be calculated for the dataset.So, I − MFCESF method is an adaptive combination of categorical effect size measures.
Step-by-step algorithm, pseudocode and flowchart are given below for I − MFCESF approach.Algorithm 1 Step 1. Determine m categorical ES measures and simulated data randomly for t iterations.Obtain input matrix (Z) by applying m measures to the simulated dataset for t repeats.
where, Z ij is the ES value of ith repeat for jth measure.
The input matrix is clustered by using intuitionistic fuzzy c-means.
Step 2.1.The number of fuzzy clusters (c) is determined and fuzzy index value f and center of clusters (v) are initialized.
The degrees of membership ( µ ) and non-membership value are calculated in each cluster with Eqs.(9-11).
Step 2.3.The new clusters center is calculated by using Eq. ( 12).
Step 2.4.If the difference between two iterations drops under some threshold, stop the algorithm; otherwise, repeat Step 1 and Step 2.
Step 3. Intuitionistic meta categorical effect size functions are obtained.I − MFCESF is given in Eq. ( 14). ( where, c is the number of clusters, µ * ij is the membership grades of jth method in i th cluster,I − MFCESF i is the ith intuitionistic meta categorical effect size functions, and w ij is weight of j.th method in i th cluster. Step 4. Select the best intuitionistic meta categorical effect size functions that has the minimum Mean absolute percentage error (MAPE).
MAPE values are calculated for select I − MFCESF best .Mape formula is given in Eq. (16).
where, y i is the mean of the ES value calculated from each method for the population and y i is the predicted ES value obtained from 1000 simulated samples.The pseudo code and the flow chart of I − MFCESF based on MFF is given Algorithm 2 and Fig. 1, respectively.Use I-FCM to determine the weights of the categorical effect size measures in functions Obtain the I-MFCESF by using Eq. ( 14) and Eq. ( 15) Calculate MAPE values of I-MFCESF i=i+1 end while Return the function best of I-MFCESF that has the minimum MAPE Calculate the new categorical effect size value by using − .

Evaluation
The estimation performance of the proposed I-MFCESF method is evaluated through both simulation studies and the use of real-world datasets.In the simulation study, random generation of two categorical variables (x and y) is performed to create contingency tables of different sizes (2 × 3, 2 × 4, and 3 × 4).These tables are generated for a sample size of N = 1000 and repeated for t = 1000 iterations.Real-world datasets are obtained from the UCI Machine Learning Repository 42 , and 1000 different samples are taken with replacement from these datasets.By applying the selected categorical effect size methods to each dataset, an input matrix (Z) is obtained.
The I-MFCESF method incorporates two crucial parameters: the number of clusters (c) and the fuzziness index parameter (m).To determine the optimal number of clusters (c), the minimum mean absolute percentage error (MAPE) for the I-MFCESF is calculated iteratively between 2 and 5. Due to the lack of consensus on the optimal value for the fuzziness index parameter of IFCM (intuitionistic fuzzy c-means algorithm), a value of 2 is selected for this study.The performance of the proposed method is evaluated using the MAPE, which measures the average percentage difference between the estimated values and the true values.

Simulated 2 × 3, 2 ×4 and 3 × 4 contingency tables for the datasets of categorical variables
Two categorical variables x and y ( 2 × 3 , 2 × 4 and 3 × 4 contingency tables) are simulated randomly for N = 1000 sample size and t = 1000 iterations.Selected measures: Cramer ′ sv (Metasure 1), Tschuprow ′ sT (Measure 2), Pearson ′ sc(Measure 3), Cohen ′ sw (Measure 4), G − KTau(Measure 5), U (Measure 6) and (Measure 7) are applied to all datasets.The input matrix (Z) consists of the outcomes of the ES measures for the simulated data set.The proposed method utilizes the IFCM clustering algorithm, where the fuzziness index parameter (m) is set to 2. After obtaining the input matrix, the IFCM algorithm is applied.In this method, the number of functions is equal to the optimal number of clusters.Functions are obtained by multiplying the weights of the methods with the actual value and sum them (Eq.14) up.The weights of each method in each function are obtained as in Algorithm 1 (Step 3).Finally, the MAPE values are calculated for each from obtained I − MFCESF functions.When calculating the MAPE values, the actual value is considered as the average of the values calculated from the dataset of the selected seven ES measures.The function with the lowest Mean Absolute Percentage Error is chosen as I − MFCESF best and the new ES value is computed based on this selection.
The first dataset is simulated for 2 × 3 contingency table and the input matrix ( Z ) is obtained by applying the selected categorical ES methods.The first five and last five prediction values of the input matrix are summarized in Table 1.www.nature.com/scientificreports/For the first simulated dataset, the optimal cluster number, which is set to 2, is determined by selecting the minimum MAPE value for I − MFCESFs .As a result, two functions are obtained by multiplying each method with their respective weights.The weights for the I − MFCESF are computed using intuitionistic membership grades, as outlined in Table 2.The functions of the proposed method are obtained using the following equations (Eqs.17, 18).
Table 2 provides a clear depiction that I − MFCESF 2 exhibits the lowest MAPE.Therefore, I − MFCESF 2 is identified as the best I-MFCESF.The MAPE values are computed and presented in Table 3, to assess the performance of the proposed method.
Table 3 clear that the I-MFCESF outperforms the other categorical ES methods in terms of the MAPE values.According to the Li et al. 48a parameter prediction is considered acceptable when the bias is within ± 10%.The bias value of the proposed method was determined as − 1% in Table 3.Thus, the accuracy of the method is also sufficient in terms of bias.
A subsequent dataset is simulated for a 2 × 4 contingency table, and the input matrix (Z) is obtained by applying the chosen categorical ES methods.Table 4 provides a summary of the first five and last five prediction values found in the input matrix.
The weights for the I − MFCESF are calculated by using intuitionistic membership grades as in Table 5, and the functions of the proposed method are obtained as in Eqs.(19, 20).( 17)  Based on the information provided in Table 6, it is evident that the I-MFCESF method demonstrates superior performance compared to the individual categorical effect size methods in terms of MAPE.The bias of the proposed method is determined as respectively − 1.9%.Because bias is between ± 10%, the accuracy of the proposed method is also sufficient in terms of bias.
Lastly, a dataset is simulated for a 3 × 4 contingency table, and the input matrix (Z) is generated by applying the selected categorical ES methods.Table 7 provides a summary of the first five and last five prediction values found in the input matrix.
The weights for the I − MFCESF are calculated by using intuitionistic membership grades as in Table 8.
( www.nature.com/scientificreports/Table 8 demonstrates that two functions are computed by multiplying each method with their respective weights.In the case of I-MFCESF, the weights are determined using intuitionistic membership grades.The functions of the proposed method are derived using the equations provided in Eqs.(21, 22).
According to Table 8, it is evident that I − MFCESF 2 exhibits the lowest MAPE.Therefore, I − MFCESF 2 is identified as the best I-MFCESF.The MAPE of the methods are computed and presented in Table 9 to assess the performance of the proposed method.Based on the information provided in Table 9, it is evident that the I-MFCESF method outperforms the individual categorical ES methods in terms of MAPE.The I-MFCESF bias value was determined as respectively.− 3.2 %.Because bias is between ±10%, the accuracy of the proposed method is also sufficient in terms of bias.The "family history", "eosinophi", and "erythema" variables in the "Dermatology" dataset are used.In the dataset, the family history feature has the value "1" if any of these diseases has been observed in the family, and "0" otherwise.Eosinophi has the value "0"" if feature was not present, "1" indicate the relative intermediate values, "2" indicate the largest amount possible.Erythema has the value "0" if feature was not present, "3" indicates the largest amount possible, and "1", "2" indicate the relative intermediate values.A totally of 1000 different samples with replacements are drawn from the Dermatology dataset.In the proposed method, the input matrix (Z) is obtained from the outputs of the calculated categorical ES measures for these samples.Then, the membership grades are obtained by clustering the input matrix with the IFCM algorithm.The fuzziness index parameter ( m) is taken as "2".Using the membership grades, the weights of each categorical ES method in each cluster are calculated.The next step is to obtain the fuzzy functions by using the weights.There will be as many fuzzy functions as the optimum number of clusters.The optimum cluster number is searched between "2" and "5", iteratively.Finally, the fuzzy function with the smallest MAPE is chosen and the new effect size value is calculated.

Family history and Eosinophi variables ( 2 × 3 contingency tables)
"Family history" and "Eosinophi" variables are selected in the Dermatology dataset for 2 × 3 contingency table.
The input matrix (Z) is obtained from outcomes of seven ES measures for these variables.The first five and last five prediction values of the input matrix are summarized in Table 10.
The weights for the I − MFCESF are calculated as in Table 11 and I − MFCESF 1 and I − MFCESF 2 are obtained as in Eqs.(23, 24) for Family history and Eosinophi variables.( 23) www.nature.com/scientificreports/ In consideration of Table 11, it is obviously seen that the I − MFCESF 2 has the lowest MAPE.Thus, the best I-MFCESF is I − MFCESF 2 .Seven methods contribute the performance of the second function.Besides, the sixth method makes the most contribution, but the seventh, fifth, third, fourth, second, and first methods also have an impact on the effectiveness of I-MFCESF.The MAPE of the methods are computed, and the results are presented in Table 12 to evaluate the performance of the proposed method.Additionally, Fig. 5 provides a visual representation of the MAPE and Bias values for the proposed and selected methods specifically for the family history and eosinophi variables.
According to Table 12, it is obviously seen that proposed I-MFCESF outperforms other categorical effect size methods in terms of the MAPE criterion.Moreover, the bias value of the proposed method is in the range of ± 10%, and it was found to be sufficient in terms of bias.As a result, the new ES value is calculated as 0.020 from Eq. (25).

Family history and Eryhthema variables ( 2 × 4 contingency tables)
For 2 × 4 contingency table, "Family history" and "Erythema" variables are selected in the Dermatology dataset.The input matrix of I − MFCESF are obtained from outcomes of seven effect size measures for these variables.The input matrix is summarized in Table 13.
When the number of clusters was iteratively tried between 2 and 5 to obtain the smallest MAPE, it was determined as 3 for this data set.The weights for the $$I-MFCESF$$ are calculated as in Table 14 and $${I-MFCESF}_ {1}$$, $${I-MFCESF}_{2}$$ and $${I-MFCESF}_{3}$$ are obtained as in Eqs.(26-28).www.nature.com/scientificreports/According to Table 14, it is seen that the I − MFCESF 2 has the lowest MAPE and the best I-MFCESF is I − MFCESF 2 .Seven methods contribute to the performance of the proposed method.Besides, the first method makes the most contribution, but the third, fifth, seventh, fourth, second, and sixth methods also have an impact on the effectiveness of I-MFCESF respectively.The MAPE values of the methods are given in Table 15 to evaluate the performance of the proposed method.Also, Fig. 6 represents the MAPE and the Bias values of the proposed and selected methods for family history and eryhthema variables.

Family history x Eosinophi
It is clear from the Table 15 that proposed I-MFCESF give very accuracy prediction results for both evaluation criteria MAPE and bias.The MAPE value of the proposed method is better than other categorical effect size  www.nature.com/scientificreports/methods and the bias value is in the range of ± 10%.Therefore, I-MFCESF was found to be sufficient in terms of MAPE and bias.As a result, the new effect size value is calculated as 0.1328 from Eq. ( 29).

Eosinophi and Eryhthema variables (3 × 4) contingency tables
For 3 × 4 contingency table, "Eosinophi" and "Eryhthema" variables are selected in the Dermatology dataset.The input matrix of I − MFCESF are obtained from outcomes of seven effect size measures for these variables.The input matrix is summarized in Table 16.Table 17 is show that the weights are calculated on eosinophi and eryhthema variables.The functions I − MFCESF 1 and I − MFCESF 2 , which were created over the weights are given in Eqs. (30)and (31) 17, it is clear that regarding the MAPE criterion, I − MFCESF 2 function the best prediction performance for this contingency table.The most contributed performance of the proposed method is Pearson ′ sc .Also, other selected methods have smaller impact on the performance of the best function.Figure 7 represents the MAPE and the Bias values of the selected and proposed methods for eosinophi and eryhthema variables.
Table 18 lists the performances of selected and proposed method.It is obvious by looking at the MAPE and the Bias values of the methods that the best performance is produced by the proposed method.The bias value of the proposed methods is in the range of ± 10%, and the MAPE value of the proposed method is the lowest according to other effect size methods.Finally, new effect size value is calculated by using Eq.(32).

Conclusion
The significant two key points of the study can be highlighted as follows.The first, a new approach categorical effect size method based on the IFCM and MFF is used to ensemble seven different categorical effect size measures.Thus, instead of depending on a single categorical effect size method, seven categorical effect size methods are aggregated for more reliable and accurate outcomes.The second, I-MFCESF is an adaptive method that adjust itself based on the given dataset.Some advantages of I-MFCESF are below: (30)   www.nature.com/scientificreports/

Eosinophi x Eryhthema
The proposed method incorporates seven different categorical effect size measures that are proposed under various conditions.In the literature, Cramer ′ sv, Pearson ′ sc, Tschuprows ′ T, Cohen ′ sw, G − KTau, U and effect size measures are most used to r × c contingency tables.The interpretation ranges of these methods are in the same scale.Thus, these techniques are selected for the proposed method.
IFCM, in which the hesitancy of an object belonging to a cluster with a degree of membership valueis taken into consideration, is used to improve the performance of the proposed method to obtain more accurate results.
I − MFCESF is gathered the information of selected effect size measures in functions by considering their accuracy performances for a dataset.For example, for a given dataset, the X method may perform better than the Y method, while in another dataset, the Y method may perform better than the X method.In this case, the weight of the X method will be higher in the best in the first dataset, while the weight of Y method in the best function will be higher in the second dataset.For this reason, the proposed method has adaptive properties.
I − MFCESF is usually select the best effect size measures with a higher weight in terms of MAPE among seven measures.
To demonstrate the performances of the proposed method, we generate two randomly independent categorical variables for N = 1000 sample and t = 1000 repeat.Besides, we have investigated Dermatology real-world dataset which are taken from the UCI Machine Learning Repository database.According to the simulation results, MAPE was obtained as 0.4168 with a bias of − 0.0106 for the 2 × 3 contingency table, 0.3581 with a bias of − 0.0019 for the 2 × 4 contingency table, and 0.2753 with a bias of − 0.0032 for the 3 × 4 contingency table.The results obtained from the real data, on the other hand, were 0.3196 MAPE with a bias of − 0.0083 for the 2 × 3 contingency table, 0.4767 MAPE with a bias of − 0.0595 for the 2 × 4 contingency table, and 0.3335 MAPE with a bias of − 0.0370 for the 3 × 4 contingency table.Both the simulation study and the applications on the real data set showed us that; the proposed method can predict the results better than the other effect size measures in terms of MAPE and bias values.The MAPE value of the proposed method was found to be lower in all the application results compared to the other methods, and the bias value was in the range of ± 10%.From the results we can claim that I-MFCESFs improve prediction accuracy by combining different effect sizes results.The limitation of the study can be identified as the fact that the performance of the proposed method is affected by the performance of a clustering algorithm.Although, IFCM accounts for the hesitancy of an object to be belong to a cluster, it does not consider the outliers in the dataset.In this sense, possibilistic fuzzy clustering algorithm, that accounts for the outliers, can be adapted in MFF.This scenario is left for the future study.Therefore, as a future research direction, we plan to combine the effect size measures used for different types of variables and utilize possibilistic fuzzy c-means.Also, to improve the performance of the proposed method, different categorical effect size measures can be included in MFF.

Figures 2 ,
3 and 4 illustrate the MAPE and Bias values of the proposed methods and selected methods for various contingency tables.

Figure 2 .
Figure 2. MAPE and Bias values of the I-MFCESF and effect size methods for 2 × 3 simulated data.

Figure 5 .
Figure 5. MAPE and Bias values of the I-MFCESF and effect size methods for family history and eosinophi variables.

Figure 7 .
Figure 7. MAPE and Bias values of the I-MFCESF and selected methods for eosinophi and eryhthema variables u ik

Table 2 .
Weights of the I − MFCESF for 2 × 3 contingency table Significant values are in [bold].

Table 3 .
MAPE and BİAS values of the proposed and selected effect size methods for 2 × 3 contingency table Significant values are in [bold].

Table 5
clearly shows I − MFCESF 2 has the lowest MAPE.Thus, the best I-MFCESF is I − MFCESF 2 .The MAPE values of the methods are computed, and the results are presented in Table6to assess the performance of the proposed method.

Table 5 .
Weights of the I − MFCESF for 2 × 4 contingency table Significant values are in [bold].

Table 8 .
Weights of the I − MFCESF for 3 × 4 contingency table Significant values are in [bold].

Table 9 .
MAPE and BİAS values of the proposed and selected effect size methods for 3 × 4 contingency tables Significant values are in [bold].
The first dataset contains 34 variables; 33 of which are categorical and one of them is numerical.There are 366 observations in the dataset.The dataset is a related to the differential diagnosis of erythematous-squamous diseases.The data is taken from the UCI Machine Learning Repository database.It can be open accessed via (https:// archi ve.ics.uci.edu/ ml/ datas ets/ Derma tology).

Table 10 .
Input Matrix for family history and eosinophi variables contingency table

Table 11 .
Weights of the I − MFCESF for family history and eosinophi variables Significant values are in [bold].

Table 12 .
MAPE and BİAS values of the proposed and selected effect size methods for family history and eosinophi variables Significant values are in [bold].

Table 13 .
Input Matrix for family history and eosinophi variables contingency table

Table 14 .
Weights of the I − MFCESF for family history and eryhthema variables Significant values are in [bold].

Table 15 .
MAPE and BİAS values of the proposed and selected effect size methods for family history and eryhthema variables Significant values are in [bold].

Table 18 .
MAPE and BİAS values of the proposed and selected effect size methods for eosinophi and eryhthema variables Significant values are in [bold].