Neural networks and particle swarm for transformer oil diagnosis by dissolved gas analysis

Guerbas, Fettouma; Benmahamed, Youcef; Teguar, Youcef; Dahmani, Rayane Amine; Teguar, Madjid; Ali, Enas; Bajaj, Mohit; Dost Mohammadi, Shir Ahmad; Ghoneim, Sherif S. M.

doi:10.1038/s41598-024-60071-0

Download PDF

Article
Open access
Published: 23 April 2024

Neural networks and particle swarm for transformer oil diagnosis by dissolved gas analysis

Fettouma Guerbas¹,
Youcef Benmahamed²,
Youcef Teguar¹,
Rayane Amine Dahmani¹,
Madjid Teguar²,
Enas Ali³,
Mohit Bajaj^4,5,6,
Shir Ahmad Dost Mohammadi⁷ &
…
Sherif S. M. Ghoneim⁸

Scientific Reports volume 14, Article number: 9271 (2024) Cite this article

361 Accesses
Metrics details

Subjects

Abstract

The lifetime of power transformers is closely related to the insulating oil performance. This latter can degrade according to overheating, electric arcs, low or high energy discharges, etc. Such degradation can lead to transformer failures or breakdowns. Early detection of these problems is one of the most important steps to avoid such failures. More efficient diagnostic systems, such as artificial intelligence techniques, are recommended to overcome the limitations of the classical methods. This work deals with diagnosing the power transformer insulating oil by analysis of dissolved gases using new techniques. For this, we have proposed intelligent techniques based on Multilayer artificial neural networks (ANN). Thus, a multi-layer ANN-based model for fault detection is presented. To improve its classification rate, this one was optimized by a meta-heuristic technique as the particle swarm optimization (PSO) technique. Optimized ANNs have never been used in transformer insulating oil diagnostics so far. The robustness and effectiveness of the proposed model is demonstrated, and high accuracy is obtained.

Discernment of transformer oil stray gassing anomalies using machine learning classification techniques

Article Open access 03 January 2024

Application of back propagation neural network in complex diagnostics and forecasting loss of life of cellulose paper insulation in oil-immersed transformers

Article Open access 13 March 2024

Damage assessment in structures using artificial neural network working and a hybrid stochastic optimization

Article Open access 23 March 2022

Introduction

Power transformers are crucial and essential components in the electrical power transmission and distribution networks. Such expensive electrical devices should work properly for years¹. The lifetime of a power transformer closely depends on its insulation system, generally consisting of a traditional solid component (paper, etc.) and a dielectric fluid².

Most power transformers use insulating oil as dielectric fluid, due to its low price and good physico-chemical properties. Besides insulation, this oil dissipates the heat generated by the magnetic circuit and the windings. Following its movement in a transformer in service, the insulating oil conducts this heat to the internal cooling systems (radiators, etc.), before releasing it into the environment^2,3.

Insulating oil is subjected to several electrical, thermal and chemical constraints in service. These latter lead to the gradual degradation of the insulating oil and eventually cause the transformer to de-energize when not analyzed in time^4,5. Indeed, various oil analyses are proposed to diagnose the power transformer's internal state. The most popular are physico-chemical analyzes^2,3,4 and dissolved gas analysis (DGA)^5,6,7,8,9,10.

DGA is a widely used as diagnostic technique. It is based on interpreting the concentrations of gases dissolved in the insulating oil. Indeed, the oil decomposes under electrical and thermal stresses, releasing gases in small quantities^5,6,7,8,9,10. In fact, DGA can be performed by introducing sensors into the transformers in service (online mode), or in the laboratory on samples (offline mode)⁸.

The five main gases resulting from the oil decomposition are hydrogen (H₂), methane (CH₄), acetylene (C₂H₂), ethylene (C₂H₄) and ethane (C₂H₆). The proportions of the concentrations of these gases in a sample allow determining the defect type^5,6,7,8,9,10. According to IEC 60599 (2007)¹¹ and IEEE Standard C57.104¹², six electrical and thermal faults exist. They consist of partial discharges (PD), low energy discharges (D1), high energy discharges (D2), thermal faults for T < 300 °C (T1), thermal faults for T from 300 °C to 700 °C (T2) and, finally, thermal faults for T > 700 °C (T3).

Various traditional techniques have been developed to interpret the results of DGA of transformer oil^13,14. The most popular use gas concentration ratios of Dornenburg¹⁵, Rogers¹⁶, and IEC 60599 (1978)¹⁷, or graphical methods of Duval employing percentages of concentration ratios such as the triangle¹⁸ and the pentagon¹⁹. Although these techniques are simple and easy to implement, they have some drawbacks. First, they use only specific gas ratios. Their accuracy remains limited and are very sensitive to DGA data uncertainties. For instance, Duval’s triangle method showed certain PD detection failures and some interferences between thermal and electrical faults. IEC 60,599 technique presented some interferences between D1 and D2 faults. Except T1, the effectiveness of Rogers' methods has not been demonstrated for the other faults⁶. Finally, Dornenburg's method considers only three faults: thermal decomposition, partial discharges or low energy corona, and high energy electric arcs¹⁵.

Recent artificial intelligence and meta-heuristic approaches have been integrated with traditional methods to overcome such difficulties and improve the transformer oil diagnostic by DGA. Benmahamed et al. have developed two algorithms to improve the classification rate of the Duval pentagon (initially at 80%). The first (Duval pentagon-SVM-PSO) combines the Duval pentagon and support vector machines (SVM), whose parameters have been optimized by the particle swarm optimization technique, PSO. The second (Duval pentagon-KNN) combines Duval pentagon and the K-nearest neighbors (kNN) algorithm.

The accuracy rate of the first algorithm is 88% compared to 82% for the second²⁰. In another research work, Benmahamed et al. established two classifiers KNN and Naïve Bayes (NB) to diagnose transformer oil by DGA. The KNN algorithm provided the highest accuracy rate of 92%²¹. Furthermore, Benmahamed et al. developed two classifiers. The first is Gaussian and the second (SVM-Bat) uses Support Vector Machines (SVM), whose parameters have been optimized by the Bat algorithm. The SVM-Bat accuracy rate is 93.75% against 69.37% for the Gaussian⁵. Kherif et al. developed an algorithm combining KNN with the decision tree principle. An accuracy rate exceeding 93% was obtained, demonstrating the effectiveness of the proposed algorithm⁷. Taha and al proposed an approach using the particle swarm optimization and the fuzzy-logic (PSO-FS) to enhance the of Rogers’ four-ratio diagnostic accuracy from 47.19 to 85.65% and IEC 60,599 one from 55.09 to 85.03%²². Ghoneim et al. established an efficient teaching–learning based optimization (TLBO) a model to optimize both gas concentration percentages and ratios. The proposed algorithm allowed obtaining higher diagnostic accuracy (of 82.02%) than the best (78.65%) offered by other DGA techniques presented in the same paper²³. Ghoneim et al. developed a smart fault diagnostic approach (SFDA) integrating Dornenburg, Rogers three and four-ratio, IEC three-ratio and Duval triangle techniques. Using gas concentrations, the SFDA algorithm has been improved by ANN. The SFDA algorithm allowed obtaining 79.6%. This accuracy rate has been improved to 87.8% with the integration of ANN²⁴. It is worth noting that optimized ANNs have never been used in transformer insulating oil diagnostic so far.

In order to improve the accuracy rate of faults detection in power transformers oil by DGA, multilayer artificial neural networks (ANN) are developed, in this investigation. We opt for a multilayer neural network (MLP) comprising an input layer, two hidden layers and an output. Several input vectors are tested, namely the five gases in ppm and in percentage, Dornenberg ratio, Rogers four-ratio and IEC 60,599 three-ratio as well as the combination between the ratios of Rogers and those of Dornenberg, the centers of mass of the triangle and pentagon of Duval, as well as their combination. Various learning algorithms and activation functions are also considered.

To further improve diagnostic rates, neural networks are optimized using the particle swarm technique (PSO). Indeed, different population sizes have been adopted. The performances of these neural networks have been studied in terms of accuracy rate. A total of 481 sample datasets are considered⁵. Two-thirds are used for the training process (so 321 samples), and the rest (160 samples) for the test. The six fault classes (PD, D1, D2, T1, T2, and T3) recommended by IEC 60599 (2007)¹¹ and IEEE Standard C57.104¹² are adopted. A comparative study is carried out between the different neural networks developed.

In this paper, we have demonstrated that the combination of the artificial neural network and the particle swarm optimization leads to global model. This one takes into account classification problem by learning and also optimization by the PSO. It gives significant results versus to the classical methods and we obtain a high level of accuracy.

ANN models for insulating oil diagnostic by DGA

The state of the power insulation system is responsible for determining the lifetime of the transformers. This insulation system is generally exposed to some constraints resulting from overheating, carbonization of the paper, electric arcs and low or high energy discharges. Such faults can accelerate insulation degradation, affecting the transformer's reliability and lifetime. Indeed, early detection of these faults can prevent undesirable abnormal operating conditions or failures of power transformers. The dissolved gas analysis (DGA) technique is considered to be one of the fastest and most economical techniques widely used to diagnose power transformer fault types¹³.

As mentioned above, traditional fault diagnostic techniques in power transformers have generally shown their limitations and inconsistencies. Despite their simplicity, these techniques are not really adopted by the scientific community, due to their low accuracy rate in faults detection, except Duval pentagon method giving acceptable rate faults classification^6,20. That is why artificial intelligence (AI) and/or meta-heuristic approaches can be combined with conventional ones to improve the diagnostic accuracy of power transformer insulating oil further.

This section proposes several artificial neural networks to detect faults in an oil-immersed power transformer. These networks have the same architecture (structure) consisting of a multilayer perceptron (MLP). However, their training algorithms and activation functions are different.

The artificial neural network connection is multilayer and consists of an input, output, and two hidden internal layers. In this structure, neurons belonging to the same layer are not connected. Each layer receives signals from the previous layer and transmits its processing result to the next layer. Thus, the information flows in a one direction, from the input to the output through the hidden layers. Such connection type is called feed-forward artificial neural networks (FFANNs)²⁵.

The number of neurons in the input layer is equal to the number of elements of the input vector denoted B. Nine models with input vectors have been considered, namely:

1st Model: The database comprises the concentrations of the five gases in ppm. In such conditions, the input vector of this model is given by: B = [H₂ CH₄ C₂H₂ C₂H₄ C₂H₆]^T
2nd Model: For the same database, the input vector of this model for each sample can be written in terms in gas concentrations in percentages as follows: B = [%H₂%CH₄%C₂H₂%C₂H₄%C₂H₆]^T, with:
$$\% {\text{H}}_{{2}} {\text{ = H}}_{{2}} /\left( {{\text{H}}_{{2}} + {\text{CH}}_{{4}} + {\text{C}}_{{2}} {\text{H}}_{{2}} + {\text{C}}_{{2}} {\text{H}}_{{4}} + {\text{C}}_{{2}} {\text{H}}_{{6}} } \right) \, \times { 1}00$$
(1)
$$\% {\text{CH}}_{{4}} = {\text{ CH}}_{{4}} /\left( {{\text{H}}_{{2}} + {\text{CH}}_{{4}} + {\text{C}}_{{2}} {\text{H}}_{{2}} + {\text{C}}_{{2}} {\text{H}}_{{4}} + {\text{C}}_{{2}} {\text{H}}_{{6}} } \right) \, \times { 1}00$$
(2)
$$\% {\text{C}}_{{2}} {\text{H}}_{{2}} = {\text{ C}}_{{2}} {\text{H}}_{{2}} /\left( {{\text{H}}_{{2}} + {\text{CH}}_{{4}} + {\text{C}}_{{2}} {\text{H}}_{{2}} + {\text{C}}_{{2}} {\text{H}}_{{4}} + {\text{C}}_{{2}} {\text{H}}_{{6}} } \right) \, \times { 1}00$$
(3)
$$\% {\text{C}}_{{2}} {\text{H}}_{{4}} = {\text{ C}}_{{2}} {\text{H}}_{{4}} /\left( {{\text{H}}_{{2}} + {\text{CH}}_{{4}} + {\text{C}}_{{2}} {\text{H}}_{{2}} + {\text{C}}_{{2}} {\text{H}}_{{4}} + {\text{C}}_{{2}} {\text{H}}_{{6}} } \right) \, \times { 1}00$$
(4)
$$\% {\text{C}}_{{2}} {\text{H}}_{{6}} = {\text{ C}}_{{2}} {\text{H}}_{{6}} /\left( {{\text{H}}_{{2}} + {\text{CH}}_{{4}} + {\text{C}}_{{2}} {\text{H}}_{{2}} + {\text{C}}_{{2}} {\text{H}}_{{4}} + {\text{C}}_{{2}} {\text{H}}_{{6}} } \right) \, \times { 1}00$$
(5)
3rd Model: Dornenburg ratios is one of the first techniques introduced for the power transformer oil diagnostic to interpret the results of dissolved gas analyzes¹⁵. In this model, we use four ratios of gas, in ppm, consisting of B = [CH₄/H₂ C₂H₂/C₂H₄ C₂H₄/C₂H₆ C₂H₂/CH₄]^T
4th Model: The four ratios of Rogers are also considered. For this model, the input vector can be expressed, for each sample (gas in ppm), by Ref.¹⁶ A = [CH₄/H₂ C₂H₂/C₂H₄ C₂H₄/C₂H₆ C₂H₆/CH₄]
5th Model: IEC 60,599 model uses three ratios of gas in ppm as follows¹⁷:
$${\text{B }} = [{\text{CH}}_{{4}} /{\text{H}}_{{2}} {\text{C}}_{{2}} {\text{H}}_{{2}} /{\text{C}}_{{2}} {\text{H}}_{{4}} {\text{C}}_{{2}} {\text{H}}_{{4}} /{\text{C}}_{{2}} {\text{H}}_{{6}} ]^{{\text{T}}}$$
6th Model: Duval triangle model is a graphic representation using the percentage ratios of three dissolved gases: CH₄, C₂H₂ and C₂H₄. These percentages are employed as coordinates (T_x, T_y) to draw the mass center point in the triangle and identify, for each sample, the defect zone in which it is located¹⁸. The input vector for this model is therefore written by: B = [T_x T_y]^T
7th Model: Duval pentagon model is a graphic representation similar to triangle one. Pentagon uses the five dissolved gas in percentages (%H₂%CH₄%C₂H₂%C₂H₄%C₂H₆) to draw the mass center point coordinates (P_x, P_y)¹⁹. The input vector of this model is given by: B = [P_x P_y]^T
8th Model: A combination between the triangle and the pentagon of Duval was proposed for this model. In such conditions, the two mass centers coordinate of both triangle and pentagon of Duval will constitute the input vector, given by: B = [C_x C_y P_x P_y]^T
9th Model: We suggest here another model consisting of combination between Rogers and Dornenburg ratios. The input vector of this model can be written as: B = [CH₄/H₂ C₂H₂/C₂H₄ C₂H₄/C₂H₆ C₂H₂/CH₄ C₂H₆/CH₄]^T

The coordinates T_x and T_y in vector 6 are calculated, for each gas sample, as follows¹⁸:

$$T_{x} = \frac{1}{3A}\sum\limits_{i = 0}^{n - 1} {\left( {x_{i} + x_{i + 1} } \right)\left( {x_{i} y_{i + 1} - x_{i + 1} y_{i} } \right)} \,$$

(6)

$$T_{y} = \frac{1}{3A}\sum\limits_{i = 0}^{n - 1} {\left( {y_{i} + y_{i + 1} } \right)\left( {y_{i} x_{i + 1} - y_{i + 1} x_{i} } \right)} \,$$

(7)

A is the irregular triangle area given by:

$$A = \frac{1}{2}\sum\limits_{i = 0}^{n - 1} {\left( {x_{i} y_{i + 1} - x_{i + 1} y_{i} } \right)} \,$$

(8)

The coordinates x_i and y_i(i = 0 to n − 1 with n = 3 is the number of gas in percentages) are computed as follows:

$${\text{x}}_{0} = \, \% {\text{CH}}_{{4}} {\text{cos}}\left( {\pi /{2}} \right)$$

(9)

$${\text{x}}_{{1}} = \, \% {\text{C}}_{{2}} {\text{H}}_{{4}} {\text{cos}}\left( {\pi /{2 } + \, \alpha } \right)$$

(10)

$${\text{x}}_{{2}} = \, \% {\text{C}}_{{2}} {\text{H}}_{{2}} {\text{cos}}\left( {\pi /{2 } + { 2}\alpha } \right)$$

(11)

$${\text{y}}_{0} = \, \% {\text{CH}}_{{4}} {\text{sin}}\left( {\pi /{2}} \right)$$

(12)

$${\text{y}}_{{1}} = \, \% {\text{C}}_{{2}} {\text{H}}_{{4}} {\text{sin}}(\pi /{2 } + \alpha )$$

(13)

$${\text{y}}_{{2}} = \, \% {\text{C}}_{{2}} {\text{H}}_{{2}} {\text{sin}}(\pi /{2 } + { 2}\alpha )$$

(14)

where α = 2π/3.

For the 7th Model, each sample, the coordinates P_x and P_y are computed by:

$$P_{x} = \frac{1}{6A}\sum\limits_{i = 0}^{n - 1} {\left( {x_{i} + x_{i + 1} } \right)\left( {x_{i} y_{i + 1} - x_{i + 1} y_{i} } \right)} \,$$

(15)

$$P_{y} = \frac{1}{3A}\sum\limits_{i = 0}^{n - 1} {\left( {y_{i} + y_{i + 1} } \right)\left( {y_{i} x_{i + 1} - y_{i + 1} x_{i} } \right)} \,$$

(16)

The pentagon surface Air given by:

$$A = \frac{1}{2}\sum\limits_{i = 0}^{n - 1} {\left( {x_{i} y_{i + 1} - x_{i + 1} y_{i} } \right)} \,$$

(17)

The parameters x_i and y_i (i = 0 to n − 1 with n = 5 is the gas number) are expressed by:

$${\text{x}}_{0} = \, \% {\text{H}}_{{2}} {\text{cos}}\left( {\pi /{2}} \right)$$

(18)

$${\text{x}}_{{1}} = \, \% {\text{C}}_{{2}} {\text{H}}_{{6}} {\text{cos}}\left( {\pi /{2 } + \, \alpha } \right)$$

(19)

$${\text{x}}_{{2}} = \, \% {\text{CH}}_{{4}} {\text{cos}}\left( {\pi /{2 } + { 2}\alpha } \right)$$

(20)

$${\text{x}}_{{3}} = \, \% {\text{C}}_{{2}} {\text{H}}_{{4}} {\text{cos}}\left( {\pi /{2 } + { 3}\alpha } \right)$$

(21)

$${\text{x}}_{{2}} = \, \% {\text{C}}_{{2}} {\text{H}}_{{2}} {\text{cos}}\left( {\pi /{2 } + { 4}\alpha } \right)$$

(22)

$${\text{y}}_{0} = \, \% {\text{H}}_{{2}} {\text{sin}}\left( {\pi /{2}} \right)$$

(23)

$${\text{y}}_{{1}} = \, \% {\text{C}}_{{2}} {\text{H}}_{{6}} {\text{sin}}(\pi /{2 } + \alpha )$$

(24)

$${\text{y}}_{{2}} = \, \% {\text{CH}}_{{4}} {\text{sin}}(\pi /{2 } + { 2}\alpha )$$

(25)

$${\text{y}}_{{3}} = \, \% {\text{C}}_{{2}} {\text{H}}_{{4}} {\text{sin}}(\pi /{2 } + { 3}\alpha )$$

(26)

$${\text{y}}_{{2}} = \, \% {\text{C}}_{{2}} {\text{H}}_{{2}} {\text{cos}}(\pi /{2 } + { 4}\alpha )$$

(27)

where α = 2π/5.

Other possible combinations of the above technique have also been proposed to give strong credibility to the obtained results. Two combinations are given below.

Using a large number of hidden layers is not recommended. Most classification standards problems use only one or at most two hidden layers²⁶. Such ascertainments have been confirmed during our modeling. After several attempts, the best results have been achieved for two hidden layers of ten neurons of each. In addition, we have considered one output delivering, for each gas sample, a single fault (PD, D1, D2, T1, T2 or T3). Indeed, we introduce only the number of input neurons varying from 2 to 5 according to the elements number of the input vector. Thus, the topology of the multilayer neural networks we adopted in this work is shown in Fig. 1.

The weighing parameters of the layers are calculated using learning algorithms. Many algorithms are proposed in the literature^25,27. We can find the supervised and unsupervised ones. In this paper, we are interested by the supervised ones. Supervised learning is done by introducing pairs of inputs and their desired outputs. This learning uses an optimization criterion allowing it to find optimal synaptic weights giving the desired behavior using random samples. Several learning techniques are adopted to readjust weights²⁷. One of the most widespread algorithms is the “Back propagation”. Unfortunately, this algorithm suffers from the local minimum problem. A variant of the previous method consists of choosing an appropriate displacement to accelerate the convergence of the algorithm, which then leads to fast back propagation with momentum. There is another version called Robust Back propagation applicable in the stochastic case. In order to improve the choice of the direction to take in the weight space, we use to second-order optimization methods of the objective function specified by the Hessian.

Many learning algorithms, existing in the Matlab toolbox, have been tested. We have chosen three algorithms using the back-propagation of the quadratic error and only one using back-propagation of the gradient of the quadratic error²⁷. This latter is between the response calculated by the network and the desired one.

In fact, back-propagation or gradient back-propagation algorithms are the most widely used models in supervised training. For this, several models are presented in the Matlab toolbox. These models are used to adjust the weights and biases satisfying the quadratic error or its gradient between the output value and the desired one reaches minimum value for every gas sample. The selected training algorithms are as follows:

Levenberg–Marquardt back-propagation algorithm (trainlm).
One-step secant back-propagation algorithm ( trainoss).
Resilient backpropagation algorithm (trainrp).
Scaled conjugate gradient back-propagation algorithm ( trainscg).

The choice of a specific algorithm depends on the input vector and the cost function. There is no theoretical method to select one algorithm versus another. In our case, we have used four training algorithms for different DGA techniques.

The activation function transforms the unbounded signal into a bounded one. This function is chosen non-decreasing monotonic. Increasing the input can only increase the output or keep it constant. Choosing a linear activation function makes calculation easier, but the neuron loses its robustness. A nonlinear activation function increases the network's ability to approximate complex functions. There are three categories of activation functions. The first one allows to distinguish between differentiable functions (sigmoid, tangent, hyperbolic) and non-differentiable ones (threshold function, thresholding). The second category concerns the functions that have significant values around zero, and significant values far from this one. The third ones deal with difference between positive functions and functions with zero mean (0 and 1 or – 1 and 1).

Five activation (transfer) functions are selected from the Matlab toolbox: softmax, radbas, purelin, logsig and poslin. Several attempts have been made to obtain the best diagnostic rates. The best combinations have been found when using softmax in the second hidden layer and purelin in the output layer. Indeed, the five aforementioned activation functions are applied only in the first hidden layer, maintaining, softmax and purelin for the second hidden layer and the output one respectively³⁰.

The definitions of the selected activation functions are³⁰:

Normalized Exponential (softmax): It is a normalized function, which takes as input vector B of m elements and gives a vector of K strictly positive numbers whose sum equals 1. This function is defined by:
$$f(x_{j} )\, = \,\frac{{e^{{x_{j} }} }}{{\sum\nolimits_{i = 1}^{m} {e^{{x_{i} }} } }}\,\,\,{\text{with}}\,\,\,j = 1, \, 2, \, \ldots ,m$$
(28)
Radial Basic (radbas): it consists of the radial basic transfer function of Gaussian type, given by:
$$f(r)\, = \,e^{{ - r^{2} }}$$
(29)
r being a real replacing the Euclidean distance neuron-center³⁰.
Linear (purelin): this activation function is linear.
Hyperbolic Tangent Sigmoid (tansig): It consists of the hyperbolic tangent sigmoid activation function defined by:
$$f(x)\, = \,\frac{2}{{1\, + \,e^{ - 2x} }} - 1$$
(30)
Rectified Linear Unit Layer (poslin). It is a rectified linear unit layer activation function. For an input value x, this function can be expressed by:
$$f(x)\, = \,\left\{ {\begin{array}{*{20}c} {x,} & {x\, \ge \,0} \\ {0,} & {x < \,0} \\ \end{array} } \right.$$
(31)

The choice of the activation functions depends on the input data. These functions modify the data values from the input through the output. The combination of them leads to avoid the elimination of the information. So, it is necessary to proceed by some beginning choice of these functions. According to our experience, it is more convenient to make a combination of these functions from the input layer to the output one. In the hidden layer, a smoothie function is recommended to keep the information of the signal without truncation. This is very important for decision process and exploration of all the range of the input data.

ANN-PSO models for insulating oil diagnostic by DGA

Our optimization problem aims to find the best solution, consisting of the global optimum, among a set of solutions belonging to the search space, by minimizing the MSE as function objective.

In fact, meta-heuristic optimization methods have the advantage of being adapted for a wide range of problems without major modification of their algorithms²⁸. These methods are based on populations of solutions²⁹. They are often inspired by natural processes, in particular by the theory of evolution in animal and insect societies which relate to evolutionary biology such as Genetic Algorithms (GA)³⁰, or to the ethological theory such as Particle Swarm Optimization (PSO)³¹, Ant Colony Optimization (ACO)³², Social Spiders Optimization (SSO)^33,34.

In our investigation, we opted for the PSO technique. To this end, we first present its principle, elements and parameters. We subsequently present the approach undertaken for optimizing the training of the ANN using PSO.

Particle swarm optimization is based on a homogeneous set of particles, initially arranged randomly. These particles move in the search space and each constitutes a potential solution. Each particle memorizes its best visited solution and communicates with the nearby particles. Thus, the particle will follow a trend based, on one hand, on its desire to return to its solution optimal, and, on other hand, on its mimicry with respect to the solutions found in its neighborhood on other hand. Indeed, from the local optima, the set of particles converges towards the optimal global solution of the treated problem³⁵.

To be able to apply the PSO algorithm, one must define a search space of the particles and an objective function to be optimized. The principle is to move these particles to find the optimum. Each particle contains³⁶:

A position characterized by its coordinates in the definition space of the objective function: ${X}_{i}=({X}_{i1},\dots ,{X}_{ij},\dots ,{X}_{ik})$
A velocity allowing the particle to change position during the iterations according to its best neighborhood, its best position, and its previous position: ${V}_{i}={(V}_{i1},\dots ,{V}_{ij},\dots ,{V}_{ik})$
A neighbourhood constituted by the set of particles directly interacting on the particle, in particular the one having the best value of the objective function.

At any moment, each particle knows:

Its best visited position P_i(t) through its coordinates and the value of the objective function;
The position of the best neighbour of the swarm g_i(t) which corresponds to the optimal scheduling;
The value assigned to the objective function f (P_i(t)) at each iteration following the comparison between the value of this function given by the current particle and the optimal one.

The particle swarm algorithm is based on:

Population size: According to Van den Bergh and Engelbrecht³⁵, increasing swarm size slightly improves the optimal value. Eberhart and Shi³⁶ illustrated that population size has a minimal effect on the performance of the EP method. The same observation was made by Nezhad and his colleagues³⁷. In our investigation, various population sizes N_p, namely N_p = 40, 80, 100 and 120.
Initialization of position and velocity: Before generating the population of particles, it is necessary to define the search space for them and place them randomly according to a uniform distribution. In a d-dimensional search space, the particle i of the swarm is modelled by its position vector X_i according to Eq. (33), and by its velocity vector V_i according to Eq. (32)³⁸.
Position and velocity update: The quality of the particle position is determined by the value of the objective function. Along its path, this particle memorizes its best position, which we note p_best = (p_i1,…, p_i2,…, p_id). Furthermore, the best position found for its neighbouring particles is g_best = (g_i1,…, g_i2,…, g_id). At each iteration, the particles update their positions and velocities taking into account their best positions and those of its neighbourhood³⁹.

The new velocity is calculated by Ref.⁴¹:

$${V}_{i}\left(k+1\right)={wV}_{i}\left(k\right)+{{c}_{1}r}_{1}\left(k\right)\left({p}_{best}\left(k\right)-{X}_{i}\left(k\right)\right)+{{c}_{2}r}_{2}\left(k\right)\left({g}_{best}\left(k\right)-{X}_{i}\left(k\right)\right)$$

(32)

Therefore, the new position velocity is calculated as follows⁴⁰:

$${X}_{i}\left(k+1\right)={X}_{i}\left(k\right)+{V}_{i}\left(k+1\right)$$

(33)

X_i(k), X_i(k + 1): the positions of the P_i particle at iteration k and k + 1 respectively ; V_i(k), V_i(k + 1): the velocities the P_i particle at iteration k and k + 1 respectively; p_best(k + 1): the best position obtained by the P_i particle at iteration k + 1; g_best(k + 1): the best position obtained by the swarm at iteration k + 1; c₁ et c₂: constants representing the acceleration coefficients; r₁ et r₂: random numbers; w(k): inertial weight.

Methods

In the previous ANN presentation, each neural network delivers the best classification rate by minimizing the root mean quadratic error, MSE, estimated from the computed outputs and the desired ones, as a function of synaptic weights and biases.

The biases consist of b_h1, b_h2 and b_o. b_h1 and b_h2 are vectors of 10 elements each. The first is presented to the first hidden layer, and the second to the second hidden layer. b_o is an additional scalar added to the output layer.

The synaptic weights consist of w_i between the input layer and the first hidden layer, w_h inter-hidden layers, and finally w_o between the second hidden layer and the output one.

w_i is a matrix having a number of rows equal to that of input vector elements (m varying from 2 to 5). Moreover, the number of columns of w_i is equal to the number of neurons of the first hidden layer, i.e. q = 10. In this hidden layer, we applied an activation function, f₁. The five previous functions (softmax, radbas, purelin, tansig and poslin) have been tested for this latter.

w_h linking the first hidden layer to the second, is also a square matrix of dimension qxq (i.e. 10 × 10). The activation function, denoted f₂, applied in the second hidden layer, is sotfmax. As previously indicated, this activation function has been kept unchanged throughout the diagnostic process.

Likewise, w_o connecting the second hidden layer and the output layer, is also a matrix of dimension 1 × 10 (i.e. a vector of 10 elements). The activation function, denoted f₃, which has been adopted in the output layer is purelin. This latter has also kept unchangeable. In such conditions, the output value is given by:

$$Y= {f}_{3}(\sum {w}_{o}*{f}_{2}( \sum {w}_{h}*{f}_{1}(\sum {w}_{i}*B)+{b}_{h1})+{b}_{h2})+{b}_{o}$$

(34)

f₁, f₂ and f₃ are the activation functions, b_h1, b_h2 and b_s the biases and B the input vector.

The mean value of the mean squared error has been used as the objective function giving by the following expression:

$$MSE={\frac{1}{TS}\sum_{PT}{(Y}_{d}-Y)}^{2}$$

(35)

where TS is the total number of training samples (equal to 321), Y is the computed network output, and Y_d is the desired output.

Our contribution in this paper that we have proposed the new model for DGA by combining the artificial neural network and particle swarm optimization algorithm which gives significant results with very high accuracy. This hybridization has never been developed until now.

Taking into consideration the mathematical calculation of the PSO model in the previous section, the convergence of the PSO towards the global optimum depends on the following parameters:

Inertia factor: The inertia factor w allows controlling the impact of previous velocities on the actual one⁴¹.
- If w < < 1, rapid changes of direction are possible; little of the previous velocity is preserved;
- If w = 0, the particle moves in each step without knowledge of the previous velocity; the concept of velocity is completely lost;
- If w > 1, the particles barely change their direction, which results a great area of exploration and a hesitation against convergence towards the optimum.
Acceleration coefficients c₁ et c₂: The constant c₁ affects the acceleration of the particle towards its best performance (cognitive behaviour of the particle). Otherwise, c₂ allows the particle to accelerate towards the Global Best (social ability of the particle)⁴². These constants belong to the interval [0; 2]³⁹. In our investigation, c₁ equals 2 and c₂ equals 1.
Random numbers r₁ et r₂: At each iteration, the two parameters r₁(k) and r₂(k) are generated randomly in the interval [0; 1] by a uniform distribution⁴¹.
Stopping criterion: In order to converge towards the global optimal solution, different stopping criteria can be selected. The most commonly used consist of⁴²:
- Static criterion: it is generally based on the maximum number of iterations;
- Dynamic criterion: it refers to the stagnation of velocity.

The static criterion has been adopted in our study. The number of iterations has been fixed to 10,000.

A high value of the inertia factor facilitates exploration (the search for new sectors). A low value facilitates exploitation (favoring the current sector of research)⁴². Better convergence, providing the balance between exploration and exploitation. Furthermore, it is possible to vary this factor during the iterations according to Eq. (36). Good results were obtained for an increase value from 0.4 to 0.9.

$$w\left(k\right)={w}_{min}+\left({w}_{max}-{w}_{min}\right) \left(\frac{k}{{max}_{iter}}\right)$$

(36)

w_max: the maximum value of w (= 0.9); w_min: the minimum value of w (= 0.4); iter: the current iteration; max_iter: the maximum number of iterations.

Before applying the PSO to optimize the ANN training, first, it is necessary to choose the architecture (topology) of the ANN (of two hidden layers with 10 neurons for each layer), the training algorithms, the activation functions, PSO parameters (population sizes, maximum number of iterations, variables to optimize, etc.), the database as well as the number of samples reserved for training and testing. Next, it is necessary to determine the objective function to be optimized. This function consists of the mean square error given by Eq. (35).

The PSO algorithm is employed in training the ANN to determine the set of parameters w and b. The total number (corresponding to search space dimension) of these parameters can be determined by the following equation:

$${\varvec{N}}=\left({{\varvec{s}}}_{{\varvec{i}}}\boldsymbol{*}{\varvec{q}}+{{\varvec{b}}}_{{\varvec{h}}1}\right)+\left({\varvec{q}}\boldsymbol{*}{\varvec{q}}+{{\varvec{b}}}_{{\varvec{h}}2}\right)+{\varvec{q}}+\boldsymbol{ }{{\varvec{b}}}_{{\varvec{o}}}$$

(37)

s_i: input vector dimension.

The execution of the ANN-PSO algorithm is carried out in accordance with the following steps:

Step 1: Upload the data training set and the data test one.
Step 2: Define the architecture of ANN: number of hidden layers, neurons number, train algorithm and activation function.
Step 3: Determine the PSO algorithm parameters and randomly generate p particles.Each one containsN values of weights and biases, and generate, then after, N_pANN models.
Step 4: Train the N_p ANN and calculate objective functionMSE(p values) using weighs and biases generated by PSO.
Step 5: Select the best solution and update it if it is different from the previous iteration.
Step6: Check the criterium iteration.
Step 7: If the condition of step 6 is not verified, return to step 4 with updating the particles velocities and positions. Else, if the condition of step 6 is verified, the optimal parameters are used to test the ANN.

We have compared in the same conditions, the performance of the two algorithms. Indeed, the same multilayer topology used has been kept for this part. Also, we have adopted the input vector 9 (of the coordinates of the two centers of mass of triangle 1 and pentagon I of Duval) offering the best classification performance of 90% for neural network. This result was obtained using trainlm as training algorithm, and poslin, softmax and purelin as activation functions respectively in the first hidden layer, the second hidden one and in the output one. This training algorithm-activation functions combination has been kept in the second algorithm, in order to further improve the fault classification rate. Also, we have considered the same database of 481 samples including 321 samples (set of 66%) for training and 160 (set of 33%) for testing, the same gases (H₂, CH₄, C₂H₂, C₂H₄ and C₂H₆), and the same defects (PD, D1, D2, T, T2, and T3).

In order to ensure the convergence of the objective function (the mean square error) towards an optimum, four population sizes of the particle swarm have been adopted, namely 40, 80, 100 and 120.

The search space dimension, corresponding to the number of parameters w and b,has been fond equal to N = 171. The parameters of the PSO algorithm have been set as described in the Table 1.

Table 1 PSO algorithm parameters values.

Full size table

Results

For ANN results, 180 programs of neural networks have been developed from the nine input vectors, four training algorithms and five activation functions. The database contains 481 samples. For each network (model), 66% (i.e. 321 gas samples) have been selected for training and the rest (160 samples) for the test.

The performance of each neural network has been evaluated in terms of classification or diagnosis rate. For this, the number of iterations adopted is 1000. Each neural network has been executed 50 times, and the best diagnosis rate has been recorded.

The obtained results are presented as histograms of Figs. 2, 3, 4 and 5. These figures illustrate the different diagnostic rates as a function of the activation function of the first hidden layer, for different training algorithms trainlm, trainrp, trainoss and trainscg, respectively. Note that the softmax and purelin have been used as activation functions in the second hidden layer and the output one, respectively.

In fact, the four figures were drawn using twenty tables (five tables per figure). To avoid burdening the manuscript, we prefer to present only one Table 2 employing purlin algorithm for the input layer (for the first algorithm Purlin of Fig. 2). Such tables provide more details of the results in terms of activation functions for different input layers, accuracies and input vectors. This allows a better comparison between the considered models.

Table 2 Details for Purelin algorithm by ANN in Fig. 2.

Full size table

We are interested, through the results of Figs. 2, 3, 4 and 5, to determine the highest classification rate for each model. The input vectors can be classified in decreasing order of the classification rate (i.e. from best to bad) as follows:

8th Model gives a maximum rate of 90%, i.e. 144 faults well classified on 160 (reserved for the test).
2nd Model gives a maximum rate of 89.375% corresponding to 143 well-classified faults.
1rt Model gives a maximum rate of 77.5%, i.e. 124 well-classified faults.
6th Model gives a maximum rate of 60.625% is 97 well-classified faults.
3rd Model and 5th model of IEC 60,599 ratios have given a maximum rate of 59.375%, so 95 are well-classified faults.
9th Model has given a maximum rate of 58.125%, i.e., 93 well-classified faults.
4th Model gives a maximum rate of 54.375%, corresponding to 87 well-classified faults.

In order to provide all information of such classification, we present in the Table 3 the accuracy rate in descending order (from best to bad) for all input vectors with the activation functions of the first hidden layer as well as the training algorithms. Note that the activation functions in the second hidden layer and the output one are softmax and purelin respectively.

Table 3 Accuracy rates in descending order with activation functions of the first hidden layer and training algorithms.

Full size table

For PSO results, the variation of the objective function, consisting of the mean squared error (MSE), as a function of the number of iterations, for different population sizes is presented in Fig. 6.

Depending on the population size, the minimum mean square error, the precision rate and the number of well-classified faults, are summarized in Table 4.

Table 4 Diagnosis accuracy with several population sizes.

Full size table

Discussion

The training algorithm trainlm contributed in the elaboration of these results by approximately 78%, since it has been used 7 times on 9. Otherwise, the activation functions purelin and poslin have participated in these results of about 56% (5 on 9) and 44% (4 on 9) respectively. Indeed, for such classification type, it is recommended to use trainlm as training algorithm, purelin or poslinas as activation function in the first hidden layer, softmax for the second hidden one and purelin in the output one.

The best classification rate of 90%, i.e. 144 well-classified faults on 160), has been obtained by presenting the input vector consisting of the coordinates of the two centers of mass of the triangle and the pentagon Duval, using trainlm as training algorithm and poslin, softmax and purelin as activation functions in the first hidden, the second hidden and the output layers respectively. This accuracy rate will be improved, in the same conditions, using particle swarm optimization (PSO) technique.

For a given population size, and over the iterations, Fig. 6 shows that the MSE (objective function) decreases abruptly from 0 to 120 iterations, and slowly elsewhere, tending practically towards a constant level. This latter is called minimum MSE which could represent the global MSE. It changes from one population to another, as illustrated in Table 3. In fact, with the progressive increase in the population size from 40 to 120, the global MSE slightly decreases from 0.0216 to 0.0066, while the classification rate and therefore the number of well-classified faults slightly increases from 154 to 159 reserved for the test. Indeed, the 120-population size allows obtaining 159 well-classified faults out of 160 (reserved for the test) with an accuracy rate of 99.375% against 90% when using ANN alone.

Conclusion

In this investigation, we have developed intelligent techniques using multilayer feed-forward ANN-based models for fault detection in an oil-immersed power transformer, by analysis dissolved gases. Nine input vectors have been used. Otherwise, each hidden layer contains ten neurons. Finally, the output having only one neuron delivers a single fault for each gas sample.

Using back-propagation, four training algorithms have been chosen, namely trainlm, trainoss, trainrp and trainscg. In addition, five activation functions, consisting in softmax, radbas, purelin, tansig and poslin, have been selected. These functions have been applied for the first hidden layer, while softmax was maintained for the second hidden layer, and purelin for the output layer. The used database contains 481 samples of which 321 have been selected for training and the rest (160 samples) for testing. Inspired by IEC and IEEE standards, six faults, consisting in PD, D1, D2, T1, T2, and T3, have been adopted. The best classification rate of 90% (i.e. 144 well-classified faults out of 160) has been obtained when using the eighth input vector (formed from the coordinates of the two centers of mass of triangle 1 and pentagon I of Duval) and applying trainlm as learning algorithm.

In order to further improve the best classification rate, the corresponding multilayer network has been optimized using particle swarm technique for various population sizes, namely 40, 80, 100 and 120. The mean square error (MSE) represents the objective function to be minimized for 10,000 iterations. Obviously, the same database with the same number of samples for training and testing, and the same faults has been kept. For a given population size, the mean square error (the objective function) decreases abruptly for iterations ranging from 0 to 120, and slowly elsewhere, tending towards a constant level representing the minimum mean square error (MMSE). Furthermore, the gradual increase in the population size from 40 to 120 results in a slight decrease in the minimum squared error from 0.0216 to 0.0066, and a slight increase in the classification rate of 96.250 (corresponding to 154 faults well classified out of 160) at 99.375% (with 159 well classified faults). In other words, the best fault classification rate of 99.375% has been obtained for 120 population size. In these conditions, the ANN-PSO algorithm was able to detect 159 faults out of 160 reserved for the test.

Finally, our contribution in the paper is to present a new approach by combining learning and particle swarm optimization in dissolved gas analysis field. We have demonstrated that this technique is leading to significant results comparing them to the existing ones in the previous research. We have obtained high level of decision about the quality of the transformer oil by using different methods according to IEC and IEEE standards.

In high voltage distribution, this new model facilitates the maintenance process and avoids transformer failure. It gives us the instantaneous decision about the characteristics of the failure and time life of the transformer. This latter may include smart sensors linked to digital process unit with supervisory and data acquisition system. The advantage to use this model is to control in real time the process by minimum time calculation of the decision. It leads time saving and minimum cost.

It is worth noting that the best results are obtained with the ANN-PSO model. This hybridization is the key to reach this objective. The choice of the architecture of the neural network was the crucial phase in our study. Also, the combination of different training algorithms and activation functions allows obtaining the best model with several tests. The ANN-PSO model needs many calculations to have the convergence of the algorithm. However, it is necessary to match the obtained models to the corresponding oil analysis technique. As input vector, the graphical methods of Duval give decision with best score.

In order to use our technique in other field, it is necessary to adapt the architecture of the neural network to the problem. It means that we can choose the input layer and the output layer according to the proposed problem. After that, the number of the hidden layer can be fixed according to the accuracy of the result. Finally, it is important to have a deep knowledge of the application that we want to use the model developed in this paper.

Data availability

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

References

Sun, L. et al. An Integrated decision-making model for transformer condition assessment using game theory and modified evidence combination extended by D numbers. Energies 9(9), 697 (2016).
Article Google Scholar
Abdi, S., Boubakeur, A., Haddad, A. & Harid, N. Influence of artificial thermal aging on transformer oil properties, electric power components and systems. Electr. Power Compon. Syst. 39, 1701–1711 (2011).
Article Google Scholar
N’cho, J. S., Fofana, I., Hadjadj, Y. & Beroual, A. Review of physicochemical-based diagnostic techniques for assessing insulation condition in aged transformers. Energies 9(5), 697 (2016).
Article Google Scholar
Guerbas, F., Zitouni, M., Boubakeur, A. & Beroul, A. Barrier effect on breakdown of point-plane oil gaps under alternating current voltage. IET Gener. Transm. Distrib. 4(11), 1245–1250 (2010).
Article Google Scholar
Benmahamed, Y., Kherif, O., Teguar, M., Boubakeur, A. & Ghoneim, S. S. M. Accuracy improvement of transformer faults diagnostic based on DGA data using SVM-BA classifier. Energies 14, 2970 (2021).
Article CAS Google Scholar
Ghoneim, S. S. M. & Taha, I. B. M. A new approach of DGA interpretation technique for transformer fault diagnosis. Int. J. Electr. Power Energy Syst. 81, 265–274 (2016).
Article Google Scholar
Kherif, O., Benmahamed, Y., Teguar, M., Boubakeur, A. & Ghoneim, S. S. M. Accuracy improvement of power transformer faults diagnostic using KNN classifier with decision tree principle. IEEE Access 9, 81693–81701 (2021).
Article Google Scholar
Liang, Y., Wang, Z., Liu, Y. Power transformer DGA data processing and alarming tool for on-line monitors. 2009 IEEE/PES Power Systems Conference and Exposition, Seattle, WA, pp. 1–8, (2009).
Duval, M. Dissolved gas analysis: It can save your transformer. IEEE Electr. Insul. Mag. 5(6), 22–27 (1989).
Article ADS Google Scholar
Duval, M. A review of faults detectable by gas in oil analysis transformer. IEEE Elec. Insul. Mag. 18(3), 8–17 (2002).
Article Google Scholar
IEC 60599, Mineral oil-impregnated electrical equipment in service-guide to the interpretation of dissolved and free gases analysis. International Electrotechnical Commission, (2007).
IEEE Standard C57.104, IEEE guide for the interpretation of gases generated in oil-immersed transformers. (2009).
Abu Bakar, N., Abu-Saida, A. & Islam, S. A review of dissolved gas analysis measurement and interpretation techniques. IEEE Elec. Insul. Maga. 30(3), 39–49 (2014).
Article Google Scholar
CIGRE Technical Brochure #296, Recent Developments in DGA Interpretation. (2006).
Dornenburg, E. & Strittmatter, W. Monitoring oil-cooled transformers by gas analysis. Brown Boveri Rev. 61, 238–247 (1974).
CAS Google Scholar
Rogers, R. R. IEEE and IEC codes to interpret incipient faults in transformers using gas in oil analysis. IEEE Trans. Electr. Insul. 13(5), 348–354 (1978).
Google Scholar
IEC 60599, Interpretation of the analysis of gases in transformers and other oil-filled electrical equipment in service. International Electrotechnical Commission, (1978).
Duval, M. The Duval triangle for load tap changers, non-mineral oils and low temperature faults in transformers. IEEE Electr. Insul. Mag. 24(6), 22–29 (2008).
Article Google Scholar
Duval, M. & Lamarre, L. The duval pentagon—A new complementary tool for the interpretation of dissolved gas analysis in transformers. IEEE Electr. Insul. Mag. 30(6), 9–12 (2014).
Article Google Scholar
Benmahamed, Y., Teguar, M. & Boubakeur, A. Application of SVM and KNN to Duval pentagon 1 for transformer oil diagnosis. IEEE Trans. Dielectr. Electr. Insul. 24(6), 3443–3451 (2017).
Article CAS Google Scholar
Benmahamed, Y., Kemari, Y., Teguar, M., Boubakeur, A. Diagnosis of power transformer oil using KNN and Naïve Bayes classifiers,” Second International Conference on Dielectrics (ICD 2018 Conference), fully sponsored by IEEE Dielectrics and Electrical Insulation Society, Budapest, Hungary, pp. 1–4 (2018).
Taha, I. B. M., Hoballah, A. & Ghoneim, S. S. M. Optimal ratio limits of rogers’ four-ratios and IEC 60599 code methods using particle swarm optimization fuzzy-logic approach. IEEE Trans. Dielectr. Electr. Insul. 27(1), 222–230 (2020).
Article Google Scholar
Ghoneim, S. S. M., Mahmoud, K., Lehtonen, M. & Darwish, M. M. F. Enhancing diagnostic accuracy of transformer faults using teaching-learning-based optimization. IEEE Access 9, 30817–30832 (2021).
Article Google Scholar
Ghoneim, S. S. M., Taha, I. B. M. & Elkalashy, N. I. Integrated ANN-based proactive fault diagnostic scheme for power transformers using dissolved gas analysis. IEEE Trans. Dielectr. Electr. Insul. 23(3), 1838–1845 (2016).
Article Google Scholar
Aldakheel, F., Satari, R. & Wriggers, P. Feed-forward neural networks for failure mechanics problems. Appl. Sci. 11, 6483 (2021).
Article CAS Google Scholar
Ibnu Choldun, M., Santoso, J. & Surendro, K. Determining the number of hidden layers in neural network by using principal component analysis. In Intelligent Systems and Applications. IntelliSys 2019. Advances in Intelligent Systems and Computing Vol. 1038 (eds Bi, Y. et al.) (Springer, 2020).
Google Scholar
Das, G., Pattnaik, P. K. & Padhy, S. K. Artificial neural network trained by particle swarm optimization for non-linear channel equalization. Expert Syst. Appl. 41(7), 3491–3496 (2014).
Article Google Scholar
X. S. Yang, "Engineering optimization: An introduction with metaheuristic applications", Department of Engineering, University of Cambridge, U.K, p. 173–179, Wiley & Sons, Inc., New Jersey, (2010).
Alik, B., Teguar, M. & Mekhaldi, A. Minimization of grounding system cost using PSO, GAO, and HPSGAO techniques. IEEE Trans. Power Deliv. 30(6), 2561–2569 (2015).
Article Google Scholar
Vallée, T., Yıldızoğlu, M. Présentation des algorithmes génétiques et de leurs applications en Economie. J. Articles, Vol. 1.2, (2003).
Eberhart, R., Kennedy, J. A new optimizer using particle swarm theory. In: Proc. Sixth International Symposium on Micro Machine and Human Science (MHS'95), Nagoya, Japan, pp. 39–43, 1995.
Dorigo, M. & Gambardella, L. M. Ant colony system: A cooperative learning approach to the traveling salesman problem. IEEE Trans. Evol. Comput. 1(1), 53–66 (1997).
Article Google Scholar
Gyanesh, D., Prasant, K. P. & Sasmita, K. P. Artificial neural network trained by particle swarm optimization for non-linear channel equalization. Expert Syst. Appl. Int. J. 41(7), 3491–3496 (2014).
Article Google Scholar
Khan, K. & Sahai, A. A comparison of BA, GA, PSO, BP and LM for training feed forward neural networks in E-learning context. Int. J. Intell. Syst. Appl. 4(7), 23–29 (2012).
Google Scholar
Van den Bergh, F. & Engelbrecht, A. P. A Cooperative approach to particle swarm optimization. IEEE Trans. Evol. Comput. 8(3), 225–239 (2004).
Article Google Scholar
Shi, Y., Eberhart, R. A modified particle swarm optimizer, Evolutionary Computation Proceedings, IEEE International Conference on Evolutionary Computation Proceedings, IEEE World Congress on Computational Intelligence, Anchorage, AK, USA, pp. 69–73 4–9, (1998).
Nezhad, N. K., Fallahi, M. H. & Dozein, M. G. An optimal design of substation grounding grid considering economic aspects using particle swarm optimization. Res. J. Appl. Sci. Eng. Technol. 6(12), 2159–2165 (2013).
Article Google Scholar
Kennedy, J., Eberhart R. Particle swarm optimization. In: IEEE International Conference on Neural Networks (ICNN'95), Perth, WA, Australia, pp. 1942–1948, (1995).
Del Valle, Y., Venayagamoorthy, G. K., Mohagheghi, S., Hernandez, J.-C. & Harley, R. G. Particle swarm optimization: Basic concepts, variants and applications in power systems. IEEE Trans. Evol. Comput. 12(2), 171–195 (2008).
Article Google Scholar
Eberhart, R. & Shi, Y. Particle swarm optimisation: Developments, applications and resources. Congress Evol. Computat. 1, 81–86 (2001).
Google Scholar
Clerc, M. & Siarry, P. Une nouvelle métaheuristique pour l’optimisation difficile : la méthode des essaims particulaires. EDP Sci. 3(7), 13 (2004).
Google Scholar
Parsopoulos, K. E., Vrahatis, M. N. Particle Swarm Optimization and Intelligence: Advances and Applications. Inf. Sci. Ref. New York, USA, (2010).

Download references

Acknowledgements

The authors extend their appreciation to Taif University, Saudi Arabia, for supporting this work through the project number (TU-DSPP-2024-14).

Funding

This research was funded by Taif University, Taif, Saudi Arabia (TU-DSPP-2024-14).

Author information

Authors and Affiliations

Laboratoire des Systèmes Electriques et Industriels, Université des Sciences et de la Technologie Houari Boumediene, BP 32Bab Ezzouar, 16111, Algiers, Algeria
Fettouma Guerbas, Youcef Teguar & Rayane Amine Dahmani
Laboratoire de Recherche en Electrotechnique, Ecole Nationale Polytechnique, 10 Rue des Frères Oudek, El Harrach, 16200, Algiers, Algeria
Youcef Benmahamed & Madjid Teguar
Centre of Research Impact and Outcome, Chitkara University Institute of Engineering and Technology, Chitkara University, Rajpura, Punjab, 140401, India
Enas Ali
Department of Electrical Engineering, Graphic Era (Deemed to Be University), Dehradun, 248002, India
Mohit Bajaj
Hourani Center for Applied Scientific Research, Al-Ahliyya Amman University, Amman, Jordan
Mohit Bajaj
Graphic Era Hill University, Dehradun, 248002, India
Mohit Bajaj
Department of Electrical and Electronics, Faculty of Engineering, Alberoni University, Kapisa, Afghanistan
Shir Ahmad Dost Mohammadi
Department of Electrical Engineering, College of Engineering, Taif University, P.O. BOX 11099, 21944, Taif, Saudi Arabia
Sherif S. M. Ghoneim

Authors

Fettouma Guerbas
View author publications
You can also search for this author in PubMed Google Scholar
Youcef Benmahamed
View author publications
You can also search for this author in PubMed Google Scholar
Youcef Teguar
View author publications
You can also search for this author in PubMed Google Scholar
Rayane Amine Dahmani
View author publications
You can also search for this author in PubMed Google Scholar
Madjid Teguar
View author publications
You can also search for this author in PubMed Google Scholar
Enas Ali
View author publications
You can also search for this author in PubMed Google Scholar
Mohit Bajaj
View author publications
You can also search for this author in PubMed Google Scholar
Shir Ahmad Dost Mohammadi
View author publications
You can also search for this author in PubMed Google Scholar
Sherif S. M. Ghoneim
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

FettoumaGuerbas, Youcef Benmahamed, Youcef Teguar, Rayane Amine Dahmani: Conceptualization, Methodology, Software, Visualization, Investigation, Writing- Original draft preparation. Madjid Teguar, Enas Ali: Data curation, Validation, Supervision, Resources, Writing—Review & Editing. Mohit Bajaj, Shir Ahmad Dost Mohammadi and Sherif S. M. Ghoneim: Project administration, Supervision, Resources, Writing—Review & Editing.

Corresponding authors

Correspondence to Mohit Bajaj or Shir Ahmad Dost Mohammadi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Guerbas, F., Benmahamed, Y., Teguar, Y. et al. Neural networks and particle swarm for transformer oil diagnosis by dissolved gas analysis. Sci Rep 14, 9271 (2024). https://doi.org/10.1038/s41598-024-60071-0

Download citation

Received: 14 January 2024
Accepted: 18 April 2024
Published: 23 April 2024
DOI: https://doi.org/10.1038/s41598-024-60071-0

Keywords

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Neural networks and particle swarm for transformer oil diagnosis by dissolved gas analysis

Subjects

Abstract

Similar content being viewed by others

Discernment of transformer oil stray gassing anomalies using machine learning classification techniques

Application of back propagation neural network in complex diagnostics and forecasting loss of life of cellulose paper insulation in oil-immersed transformers

Damage assessment in structures using artificial neural network working and a hybrid stochastic optimization

Introduction

ANN models for insulating oil diagnostic by DGA

ANN-PSO models for insulating oil diagnostic by DGA

Methods

Results

Discussion

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Keywords

Comments

Search

Quick links

Subjects

Abstract

Similar content being viewed by others

Discernment of transformer oil stray gassing anomalies using machine learning classification techniques

Application of back propagation neural network in complex diagnostics and forecasting loss of life of cellulose paper insulation in oil-immersed transformers

Damage assessment in structures using artificial neural network working and a hybrid stochastic optimization

Introduction

ANN models for insulating oil diagnostic by DGA

ANN-PSO models for insulating oil diagnostic by DGA

Methods

Results

Discussion

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Comments

Search

Quick links