An RBF neural network based on improved black widow optimization algorithm for classification and regression problems

Liu, Hui; Zhou, Guo; Zhou, Yongquan; Huang, Huajuan; Wei, Xiuxi

doi:10.3389/fninf.2022.1103295

ORIGINAL RESEARCH article

Front. Neuroinform., 10 January 2023

Volume 16 - 2022 | https://doi.org/10.3389/fninf.2022.1103295

An RBF neural network based on improved black widow optimization algorithm for classification and regression problems

Hui Liu^1,2

Guo Zhou^3*

Yongquan Zhou^1,2,4*

Huajuan Huang^1,2

Xiuxi Wei^1,2

¹College of Artificial Intelligence, Guangxi University for Nationalities, Nanning, China
²Guangxi Key Laboratories of Hybrid Computation and IC Design Analysis, Nanning, China
³Department of Science and Technology Teaching, China University of Political Science and Law, Beijing, China
⁴Xiangsihu College of Guangxi University for Nationalities, Nanning, China

Introduction: Regression and classification are two of the most fundamental and significant areas of machine learning.

Methods: In this paper, a radial basis function neural network (RBFNN) based on an improved black widow optimization algorithm (IBWO) has been developed, which is called the IBWO-RBF model. In order to enhance the generalization ability of the IBWO-RBF neural network, the algorithm is designed with nonlinear time-varying inertia weight.

Discussion: Several classification and regression problems are utilized to verify the performance of the IBWO-RBF model. In the first stage, the proposed model is applied to UCI dataset classification, nonlinear function approximation, and nonlinear system identification; in the second stage, the model solves the practical problem of power load prediction.

Results: Compared with other existing models, the experiments show that the proposed IBWO-RBF model achieves both accuracy and parsimony in various classification and regression problems.

1. Introduction

There are many existing solutions to classification problems. For example, classifiers based on fuzzy logic, advanced and state-of-the-art (SOTA) deep learning, and artificial intelligence (AI) methods. Fuzzy neural classifier is developed for disease prediction in Kumar et al. (2018). Versaci et al. (2022) proposed a fuzzy similarity-based approach for classifying carbon fiber-reinforced polymer plate defects. Belaout et al. (2018), a multiclass adaptive neuro-fuzzy classifier (MC-NFC) was developed for fault detection and classification in photovoltaic array. Semi-Active Convolutional Neural Networks are developed for Hyperspectral Image Classification in Yao et al. (2022). Zhang H. et al. (2022) proposed the convolution neural network method with the attention mechanism for classification of hyperspectral and light detection and ranging (LiDAR) data. Hong et al. (2020) proposed multimodal generative adversarial networks for image segmentation. The classification method adopted in this paper is a hybrid method of black widow optimization (BWO) algorithm and radial basis function neural network (RBFNN), and it is used in several regression problems to verify the performance.

Radial basis function neural network is a kind of artificial neural network (ANN) with activation functions based on Gaussian kernels, which has the advantages of simple structure, easy parameter adjustment and strong generalization ability (Broomhead and Lowe, 1988). The key to improving the performance of ANN is to search for the optimal parameters. Methods for training ANN parameters include gradient-based algorithms and metaheuristic algorithms, the latter of which are more likely to escape from the local optimum (Wu et al., 2016). There has been a great deal of research on combining metaheuristic algorithms with ANNs for a variety of classification and regression problems. For example, particle swarm optimization (PSO) algorithm (Kennedy and Eberhart, 1995), artificial bee colony (ABC) algorithm (Karaboga, 2005), firefly algorithm (FA) (Yang, 2009), differential evolution (DE) (Storn and Price, 1997), genetic algorithm (GA) (Holland, 1975), whale optimization algorithm (WOA) (Mirjalili and Lewis, 2016), bat algorithm (BA) (Yang, 2010), salp swarm algorithm (SSA) (Wang et al., 2017), and flower pollination algorithm (FPA) (Yang, 2012). Kaya (2022) designed quick FPA algorithm training feedforward neural network (FNN) for solving parity problems. Pradeepkumar and Ravi (2017) used PSO algorithm to train the parameters of quantile regression neural network (QRNN) to predict volatility from financial time series. Asteris and Nikoo (2019) combined the ABC algorithm with FNN to predict the vibration period of infilled framed structures. Han et al. (2019) proposed a new model (DE-BP) for predicting cooling efficiency of forced-air precooling systems. Ding et al. (2013) applied GA to optimize the parameters of the Elman neural network, and experimentally verified that the convergence rate and training speed of the model were improved. Li et al. (2019) proposed an extreme learning machine based on WOA algorithm for aging assessment of Insulated gate bipolar transistor modules. Yang et al. (2016) designed a combinatorial model combining DE, BP neural network and adaptive network-based fuzzy inference system (ANFIS), and verified the effectiveness of the hybrid model on the electricity demand forecasting problem. Aljarah et al. (2018a) proposed a new method for optimizing FNN weights by the WOA algorithm. As with other models, the goal of RBFNNs in classification and regression problems is to maximize the classification rate and minimize the error of regression, which is equivalent to metaheuristic algorithm solving the global optimal value problem of the fitness function. Meanwhile, in order to verify the computational performance, practicality and persuasiveness of the hybrid model of metaheuristic algorithm and RBFNNs, some studies applied it not only to traditional classification and regression examples, but also to real problems. For example, Han et al. (2021) performed membrane bioreactor prediction with the help of the accelerated gradient algorithm. Han et al. (2022) introduced the accelerated second-order learning algorithm, and verified the accuracy of the model on issues such as prediction of water quality, nonlinear function approximation and system identification. Hu et al. (2018) proposed improved PSO-RBF and applied it to the operation cost prediction. These applied studies focus on areas that are especially needed in real life, bringing real benefits and outcomes to the economic development. Korürek and Doğan (2010) proposed an ECG beat classification method based on PSO and RBFNN. Jing et al. (2019) proposed the RBF-GA model for structural reliability analysis. Pazouki et al. (2022) designed a hybrid model of FA algorithm and RBFNN to predict the compressive strength of self-compacting concrete. Zhang and Wei (2018) developed improved PSO algorithm and RBFNN to forecast network traffic. Agarwal and Bhanot (2018) utilized FA algorithm to optimize the center vector in the hidden layer neurons of RBFNN and apply it to face recognition. Furthermore, BWO algorithm has the advantages of reliability and fast convergence speed, which promote its wide application in various fields. Micev et al. (2020) proposed adaptive black widow optimization algorithm to estimate synchronous machine parameters. Mukilan and Semunigus (2021) used deep convolution neural network (DCNN) based on the enhanced BWO algorithm for human object detection. The BWO algorithm was also used to optimize parameters for the ANFIS, SVR, and SVM (Katooli and Koochaki, 2020; Memar et al., 2021; Panahi et al., 2021). The application of the BWO algorithm is mainly in optimizing the parameters of neural networks. Analyzing the mathematical model of these algorithms, it is known that adaptive time-varying weight plays a crucial role in training network parameters. Therefore, this paper adopts the improved black widow optimization algorithm with time-varying inertia weight to learn RBFNNs to improve the accuracy and parsimony of the model, and uses its self-adjustment mechanism of local search ability and global search ability to optimize the parameters of RBFNNs. The proposed improved black widow optimization algorithm (IBWO)-RBF model is used for UCI dataset classification, nonlinear function approximation, nonlinear system identification and power load prediction.

The IBWO-RBF model combines the strengths of IBWO algorithm and RBFNNs. Therefore, the model can be applied to non-linear unstable time series forecasting, such as photovoltaic power forecasting, stock trend forecasting and water level forecasting. At present, the field of power load forecasting is of great significance in grid planning due to the connection of various new loads to the grid (Guo et al., 2020). Forecasting methods are divided into: traditional predictive models, machine learning models and combined models (Hermias et al., 2017). A single machine learning method is prone to reduce the accuracy of prediction results due to parameter limitations. Therefore, this paper uses the hybrid IBWO-RBF model to avoid defects such as weak generalization ability and large prediction error. Dong et al. (2017) proposed a method combining CNN and K-means clustering for short-term load forecasting. Ertugrul (2016) developed a recurrent extreme learning machine (RELM) method to predict electricity load. Hu et al. (2022) applied improved grasshopper optimization algorithm (IGOA), and long short-term memory (LSTM) network to forecast short-term load. Bashir et al. (2022) applied prophet-LSTM network based on BPNN to this field. Zhang et al. (2018) used fruit fly optimization algorithm (FOA) to train a hybrid model for power load forecasting.

Most studies use metaheuristic algorithms to optimize neural network parameters and verify their effectiveness on classification or regression problems. However, it has not been verified that the same model has good results in both classification and regression. Therefore, in this study, the proposed model will be tested on both classification and regression problems to verify its effect. What calls for special attention is that many metaheuristic algorithms [PSOGSA (Mirjalili et al., 2012), ABWO (Liu et al., 2022)] considered linear time-varying inertia weight to escape from the local optimum. Time-varying inertial weight of metaheuristic algorithm is the key to training RBFNNs. Therefore, in order to achieve greater accuracy in several classification and regression problems, this article improves the random inertia weight to nonlinear time-varying inertia weight. The IBWO algorithm is proposed. On the basis of the existing linear time-varying inertia weight of ABWO algorithm, nonlinear time-varying inertia weight is designed, so that the effect of IBWO algorithm optimizing RBFNNs is more significant. The IBWO algorithm is combined with RBFNNs. The IBWO-RBF model and other models compare and analyze the results on classical dataset classification, nonlinear function approximation, nonlinear system identification and real-life power load prediction problems. The innovations and contributions of this paper are summarized as follows:

(1) This article develops the IBWO-RBF model. Experiments illustrated the excellence of the proposed model in several aspects. Most articles compare the effectiveness of methods in classification or regression. However, the IBWO-RBF model in this paper verifies both the classification performance and the regression effect.

(2) This paper proposes the IBWO algorithm. Consulting many literatures shows that the time-varying inertia weights of metaheuristic algorithms are the key to training RBFNNs. Therefore, nonlinear time-varying inertia weights are designed to make the optimization of RBFNNs more effective.

(3) The proposed model is not only superior to other models on several classical problems, but also performs well in practical problem. The IBWO-RBF model is a new and effective approach to solve the problem of short-term power load forecasting, providing a priori knowledge for the safe operation and planning of the power grid.

The remainder of this paper is summarized as follows: section “2 Proposed model” briefly describes the structure of IBWO algorithm, RBFNN, and IBWO-RBF model. The simulation results of the IBWO-RBF model on traditional classification and regression examples are introduced and discussed in section “3 Experiments on common classification and regression problems.” The performance of the IBWO-RBF model on power load prediction is presented and discussed in section “4 Experiments on real problems in short-term power load forecasting.” In section “5 Conclusion and future study” are presented.

2. Proposed model

2.1. RBF neural network (RBFNN)

The experiment used a three-layer RBFNN (input layer, hidden layer, and output layer), the specific structure of which is shown in Figure 1. The input layer is connected to the hidden layer through an activation function. The activation function in the hidden layer utilizes Gaussian kernel function, which is expressed in Equation (1). RBF networks are special form of multilayer feedforward neural networks (MLP) (Du and Swamy, 2006). In MLP, activation functions in the hidden neurons are sigmoidal functions, while in RBF networks the Gaussian basis functions are typically used (Mirjalili et al., 2012). Each hidden layer neuron of RBFNNs contains a Gaussian-based activation function that measures the similarity between samples and samples. The reason why the Gaussian basis function has been exploited with respect to other activation functions is that the Gaussian kernel function maps each sample point to an infinite-dimensional feature space, so that the otherwise linearly indivisible data becomes linearly separable. The relationship between hidden layer and output layer is described in Equation (2).

h_{j} = e^{- {∥ x - c_{j} ∥}^{2} / 2 σ_{j}^{2}} (1)

y_{k} = \sum_{j = 1}^{h} w_{j, k} h_{j} (2)

FIGURE 1

Figure 1. Multiple-input multiple-output radial basis function neural network (RBFNN).

where d, h, and o are the number of neurons in the input layer, hidden layer, and output layer, respectively. x_i,h_j, and y_k represent the ith neuron in the input layer, the jth neuron in the hidden layer, and the kth neuron in the output layer. The input vector isx = [x₁,x₂,…,x_d]^T. w_j,k is the output weight between the jth hidden neuron and the kth output neuron. c_j is the center vector of the jth hidden neuron. σ_j is the width of the jth hidden neuron. ∥x−c_j∥ is the Euclidean distance between the input vector and the center vector. The topological size of RBFNN is determined by the number of hidden layer neurons.

2.2. IBWO algorithm for training RBFNN

Swarm intelligence algorithm is a kind of metaheuristic algorithm, which is an optimization method with complete theory and strong optimization performance. Similar to other swarm intelligence algorithms, the BWO algorithm (Hayyolalam and Kazem, 2020) searches for optimal solution based on the reproductive behavior of black widow spiders (crossover, cannibalism, and mutation), which has the advantages of fast convergence speed and high optimization accuracy. The search steps of the BWO algorithm are described as follows:

Step 1. The BWO algorithm initializes a random spider population. Each spider individual is represented by a candidate solution vector of the algorithm, i.e., [b₁,b₂,…,b_dim].

Step 2. The fitness function of the problem to be solved is designed to calculate the quality of the solution. Since strong offspring survive, candidate solutions with smaller fitness function values are preserved. The better solution is retained and the worse solution is deleted.

Step 3. Crossover behavior is the process by which both parents produce offspring according to the crossover rate. The mathematical model corresponding to the crossover behavior is shown in Equation (3).

{\begin{matrix} y_{1} = α .^{*} b_{1} + (1 - α) .^{*} b_{2} \\ y_{2} = α .^{*} b_{2} + (1 - α) .^{*} b_{1} \end{matrix} (3)

Where each element of the α array is a random value in the range [0,1]. b₁ and b₂ are parents. y₁ and y₂ are descendants. The crossover process is repeated dim⁡/2 times. Cannibalism refers to the fact that in both parents, strong female spiders eat weak male spiders; Among offspring, strong offspring eat weak offspring. Mutation behavior is the process by which the original individual produces a new individual according to the mutation rate. The specific mutation process is shown in Figure 2, e.g., [b₁,…,b_i,…,b_j,…,b_n] mutates to [b₁,…,b_j,…,b_i,…,b_n]. This step is to search for other individuals throughout the search space, enhance population diversity, and update candidate solutions.

FIGURE 2

Figure 2. The mutation behavior of a spider individual.

Step 4. Cycle through step 2 and step 3. Candidate solutions are continuously updated and fitness function values are compared until the maximum number of iterations is reached or converges to the optimal value.

Although excellent performance was reported, BWO algorithm has the potential to suffer from premature convergence to a point. Inspired by Fan and Chiu (2007), the β array is no longer a random value, but changes nonlinearly over time. The equation for the β array of the IBWO algorithm is as follows:

β_{i} = 8 \times {(\frac{2}{iter})}^{0.3} β = [β_{1}, β_{2}, \dots, β_{n}] (4)

Therefore, the update equation for the crossover process is:

{\begin{matrix} y_{1} = α .^{*} b_{1} + (1 - α) .^{*} b_{2} \\ y_{2} = β .^{*} b_{2} + (1 - β) .^{*} b_{1} \end{matrix} (5)

where b₁ and b₂ are parents. y₁ and y₂ are descendants. iter is the current number of iterations. Experiments had shown that the constant of the β array is set to 8, which makes the global search ability of the algorithm stronger.

2.3. System architecture of the IBWO-RBF neural network

The IBWO algorithm simultaneously searches the center vector, width, and output vector of RBFNN. The IBWO algorithm and RBFNN are connected by candidate solution vectors of the IBWO algorithm, which is shown in Figure 3.

FIGURE 3

Figure 3. The structure of each black widow individual.

where c_j, σ_j, and w_j are the center vector, width, and output vector of the jth hidden neuron for each black widow individual, respectively. The number of hidden layer neurons in the IBWO-RBF model determines the size of the network. Therefore, h is both the number of hidden layer neurons and the size of the neural network. The dimension of the black widow is determined by the number of neurons in the input layer, hidden layer and output layer of IBWO-RBF. D is the dimension of the black widow, and D is equal to d + 1 + o. d is the number of input vector and o is the number of output vector. The number of input layer neurons of IBWO-RBFNN is determined by the dimension of the input data. The model is a single-output model when solving regression problems, but a multiple-output model when solving classification problems. The number of neurons in the output layer of IBWO-RBFNN is determined by the number of categories of classification problems.

For classification and regression problems, Root Mean Square Error (RMSE) is used as a fitness function to optimize RBFNN. Take the classification problem as an example, the specific calculation equation is expressed as follows. The RMSE is expected to be as close to 0 as possible. Consequently, the goal of the search process is to obtain an optimal solution that minimizes the fitness function value. The global optimal widow optimized by the IBWO algorithm is the optimal parameter of RBFNN. Figure 4 shows the overall architecture of this article.

R M S E = \sqrt{\frac{1}{N} \sum_{k = 1}^{N} {(y_{k} - f (x_{k}))}^{2}} (6)

FIGURE 4

Figure 4. System architecture of the improved black widow optimization algorithm (IBWO)-RBF model.

where N is the number of output neurons, y_k and f(x_k) are the real output of the model and the desired output, respectively.

3. Experiments on common classification and regression problems

3.1. Classification problems for UCI datasets

In this section, the parameters of the RBFNN were trained using the IBWO algorithm for a classification problem on 10 binary classification datasets. And IBWO was compared with 12 algorithms, namely PSO, GA, ABC, DE, FA, CS, BA, ABWO, BWO, biogeography-based optimization algorithm (BBO), ant clony optimization algorithm (ACO), and gradient-based algorithm (GB).

3.1.1. Experimental setup

The 10 binary classification datasets used in this section were selected from the UCI repository, and the detailed descriptions are shown in Table 1. All datasets are trained and tested at 66 and 34%, respectively. The population size of all algorithms was fixed at 50, and each classification experiment was run 10 times independently, with 250 iterations for each run.

TABLE 1

Table 1. Description of the classification datasets.

Without prior knowledge, the number of hidden layer neurons is not easy to determine (Wang et al., 2019). In order to reduce the error caused by the randomness of the neural network, this paper adopted the trial-and-error method (Kermanshahi, 1998) to determine the number of hidden layer neurons. When the number of hidden layer neurons was set from 3 to 16 in IBWO-RBF, set to 4, 6, 8, and 10 in all other algorithms. All algorithms selected the structure of IBWO-RBF with the best classification result, and then obtained the corresponding number of hidden neurons. The specific operation is as follows: first, the parameters of the neural network were arbitrarily given, and each subsequent selection only changed the number of hidden neurons. Under the condition that the other parameters remained unchanged, repeated experiments were repeated 10 times, and by comparing the obtained average classification rates, the number of hidden neurons with the highest average classification rate was finally selected.

3.1.2. Data preprocessing

Some datasets downloaded from the UCI repository have abnormal problems, such as missing values, containing English characters, etc. Therefore, the dataset was preprocessed (including outlier handling and data normalization). First, the samples with missing information were deleted; second, the English strings were replaced with numbers such as 0, 1, 2, and 3; third, in order to eliminate the dimensional influence between the several features of the dataset, and to solve the data attributes comparability. Therefore, the data standardization process was carried out, and the attribute value and the classification value were normalized to [−1, 1]. Finally, the training set and testing set were randomly selected from the dataset samples in proportion. The attribute value of the training sample was used as the input of the model, and the category of the training sample was used as the output of the model.

3.1.3. Experimental results and discussion

In this section, all models were compared in terms of accuracy (i.e., average classification rate) and structural complexity, and the Friedman test was carried out to obtain the ranking of the algorithms. Table 2 shows the average accuracy of 10 independent runs of 13 models on 10 datasets. Table 3 describes the number of hidden neurons that gave the best results. The best accuracy results and the least number of neurons are highlighted in bold. The proposed model achieved 81.02% classification accuracy on the Blood dataset, whereas GB-RBF achieved 81.18% accuracy. On the Blood classification dataset, the proposed model ranked second. On other datasets except the Blood dataset, the average classification rate of IBWO-RBF had obtained better results than all other algorithms, that is, ranked first. Not only that, it can be seen from Table 3 that on most datasets, the IBWO-RBF model used fewer neurons in the hidden layer than other models. Especially on the Hepatitis, Diagnosis I, Australian, Diagnosis II, German and Liver datasets, the IBWO-RBF model used the simplest structure and the least number of hidden neurons. The proposed IBWO-RBF model not only has high classification accuracy, but also has a simple network structure.

TABLE 2

Table 2. Average classification rate (in %) of 13 algorithms on 10 datasets.

TABLE 3

Table 3. Minimum number of hidden neurons for 13 algorithms on 10 datasets.

Additionally, to make this conclusion convincing, a Friedman statistical test was completed to test the significance of the average classification rates. The Friedman test obtained the ranking of 14 optimized algorithms by analyzing the average accuracy value of each dataset. The sorted table is shown in Table 4. It can be known that the IBWO-RBF model ranks first. Sensitivity and specificity of these algorithms on 10 datasets is shown in Table 5. Optimal sensitivity and specificity are highlighted in bold. Compared with the other 12 models, the proposed model showed stronger sensitivity and specificity on 9 datasets (Blood, Breast, Hepatitis, Diabetes, Vertebral, Diagnosis I, Diagnosis II, Liver, and Australian). It can be seen that the IBWO algorithm achieves higher classification accuracy on each dataset than the original BWO algorithm. Therefore, the improvement to the original BWO algorithm in this paper greatly improve its classification performance. Combining the above experimental results, it can be clearly concluded that IBWO outperforms most other algorithms in optimizing the parameters of RBFNNs for classification problems, which proves the strength of the IBWO algorithm in training RBFNNs.

TABLE 4

Table 4. Friedman test results.

TABLE 5

Table 5. Sensitivity and specificity of these algorithms on 10 datasets.

3.2. Nonlinear regression problems

In this section, the IBWO-RBF model was used in nonlinear function approximation and nonlinear system identification, two nonlinear regression problems, and compared with five models, namely the BWO-RBF model, the Euclidean kernel based RBF (EK-RBF), the Cosine kernel based RBF (CK-RBF), the dynamic fusion of Euclidean and cosine kernels based RBF (DF-RBF) and the manual fusion of Euclidean and cosine kernels based RBF (MF-RBF). The results of the comparison were converted by the following equation and expressed in one electrical unit (dB).

a = 10 \log (a) d B (7)

The evaluation index is the mean square error (MSE), which is described as follows:

M S E = \frac{1}{n} \sum_{j = 1}^{n} {(y_{j} - f (x_{j}))}^{2} (8)

where n is the number of samples, y_j and f(x_j) are the real output of the network and the desired output, respectively.

3.2.1. Nonlinear function approximation

The nonlinear function approximation took the following equation as an example:

f (x, y) = e^{(x^{2} - y)} (9)

The function f(x,y) is approximately replaced by a simple function P(x,y) with the aim of minimizing the error of P(x,y) and f(x,y) in a metric sense (Zhang Y. et al., 2022). Figures 5, 6 shows the MSE value and approximation results of the IBWO-RBF model on the approximation of a nonlinear function, respectively. The MSE and the number of hidden layer neurons compared with the other four models are described in Table 6.

FIGURE 5

Figure 5. MSE value of the improved black widow optimization algorithm (IBWO)-RBF.

FIGURE 6

Figure 6. Approximation results of the improved black widow optimization algorithm (IBWO)-RBF.

TABLE 6

Table 6. Comparison of six models for the approximation of nonlinear function.

It can be seen that the MSE of the IBWO-RBF model is the smallest, which is −49.8444dB, and the number of hidden neurons is the least, and the neural network structure is simpler. Again, under the same number of iterations of 10,000 for the five models, the IBWO-RBF model converges faster and achieves early convergence. It is evident from Figure 6 that the proposed model fits the points on the nonlinear function perfectly. Moreover, the model converged to the optimal value within only 1,000 generations. Both function approximation accuracy and search speed show excellent results. Experiments show that the IBWO-RBF model outperforms the other four variants of RBFNNs and BWO-RBF model in both the accuracy of function approximation and the complexity of the network structure.

3.2.2. Nonlinear system identification

The nonlinear system is given by the following equation:

y (t) = 2 r (t) - 0.5 r (t - 1) - 0.1 r (t - 2) - 0.7

[\cos (3 r (t)) + e^{- | r (t) |}] + n (t) (10)

where r(t) is the input of the system, and y(t) is the output; the model is assumed to be N(0,0.0025), n(t) is the disturbance of the system; the constants are the polynomial coefficients. Figure 7 illustrates the MSE values of the IBWO-RBF model for nonlinear system identification. Moreover, Identification results are shown in Figure 8. The MSE and the number of hidden layer neurons compared with the other four models are shown in Table 7. It can be seen from the above figure that the IBWO-RBF model converges within 100 iterations, which is faster than the other five models and has a smaller MSE. The MSE of the proposed model is much smaller than that of the original BWO algorithm combined with RBFNN. Moreover, the conclusions support the advantage of the proposed model for the nonlinear system identification with smaller network structures.

FIGURE 7

Figure 7. MSE value of the improved black widow optimization algorithm (IBWO)-RBF against iterations.

FIGURE 8

Figure 8. Identification results of the improved black widow optimization algorithm (IBWO)-RBF model.

TABLE 7

Table 7. Comparison of six models for the nonlinear system identification.

4. Experiments on real problems in short-term power load forecasting

4.1. Data description

The dataset in this section is from the National Collegiate Electrotechnical Mathematical Modeling Competition.¹ The dataset is from 1 January to 14 May 2012 with a time interval of 15 min, including meteorological factors such as daily maximum temperature, daily minimum temperature, daily average temperature, daily relative humidity, and daily rainfall. The experiment selected data from 0:00 to 23:00 with an interval of 1 h. To improve forecast accuracy, historical data that is less relevant to the meteorological data on the forecast date was removed. A set of highly correlated meteorological data remained, for a total of 50 days. The multiple-input single-output IBWO-RBF model was adopted, which had 25 neurons in the input layer, 4 neurons in the hidden layer, and 1 neuron in the output layer. For 0:00, the input for each training sample was the load value of 0:00 for the previous 20 days and the meteorological data for the next day, and the output was the load value of 0:00 for the next day. The input of the test sample was the load value of 0:00 for the 20 days before the forecasting day, and the meteorological data for the forecasting day. The output of the test sample was the load value of 0:00 for the forecasting day. The experimental method at other time points was the same as 0:00 (Chen et al., 2018). The IBWO-RBF model was compared to the other 5 models.

4.2. Experimental setup

The Matlab R2019a is used to implement IBWO-RBF, BWO-RBF, PSO-RBF, RBF, ELM, and BPNN. All models are run independently 10 times to take the average. The number of iterations and population size for all algorithms are 500 and 50, respectively. For the IBWO-RBF and BWO-RBF model, the crossover rate is 0.8, the mutation rate is 0.4, and the cannibalism rate is 0.5. Acceleration constants of the PSO-RBF model is [2, 2], and inertia weights is [0.9, 0.6]. For the three hybrid models (IBWO-RBF, BWO-RBF, and PSO-RBF), the number of hidden layer neurons in each model is 4. A single RBF neural network model is designed quickly and error-free using the newrbe function. The number of hidden layer neurons defaults to the number of samples in the training set, which is set to 5 in the ELM model. Similarly, the number of hidden layer neurons in the BPNN model is also set to 5.

4.3. Experimental results and discussion

The predicted values and prediction errors are shown in Tables 8, 9, respectively. From 00:00 to 23:00, the smallest error at each time is marked in bold. The RMSE for IBWO-RBF, BWO-RBF, PSO-RBF, RBF, ELM and BPNN were 357.8041, 565.9603, 589.0443, 1,220.5, 666.3963, and 576.4008, respectively. The proposed model has the smallest RMSE, and the results of the BWO-RBF model are second only to the proposed model. In addition to 7:00, 8:00, 16:00, and 19:00, compared with the other three single models, IBWO-RBF, BWO-RBF, and PSO-RBF obtained the best predicted values. It can be seen that the hybrid model of metaheuristic algorithm and RBFNN is better than that of a single model, which verifies the effectiveness of the combinatorial method. Most of the time, the IBWO-RBF model has less error than all other models. In addition to 6:00 and 7:00, the IBWO-RBF model has a smaller error than the BWO-RBF, indicating that improvements to the IBWO algorithm can also improve the prediction accuracy of real-world problems.

TABLE 8

Table 8. Comparison of predicted values of 6 models.

TABLE 9

Table 9. Comparison of prediction errors of 6 models.

Again, Figures 9, 10 more vividly demonstrate the predictive performance of the 6 models. All models can predict approximate curves close to the true load values. It can be clearly seen in the Figure 10, from 22:00 to 7:00, all models show better performance. The reason for this phenomenon may be that less electricity is used at night, resulting in more stable predictions, while higher electricity consumption during the day causes inaccurate predictions. It indicates that the load value at night is more accurately predicted. Compared to other models, from 9 to 23 o’clock, the predicted values of the proposed model are more accurate. During the day, the proposed model stands out more than other models for predictive performance. A single RBFNN performed the worst, but combining RBFNN with algorithms outperformed ELM and BPNN. The curve of the IBWO-RBF model is closest to the curve of the true load value, and the error curve of the model is mostly at the bottom. Therefore, the proposed model has the smallest prediction error, the highest optimization accuracy, and its prediction curve is closest to the expected value curve. After experimental verification, the prediction performance of the proposed model for the actual power load problem is worthy of recognition.

FIGURE 9

Figure 9. Forecasting curves of 6 models and real load curves.

FIGURE 10

Figure 10. Predicted error curves of 6 models.

5. Conclusion and future work

A novel hybrid model called IBWO-RBF is applied for solving classical dataset classification, nonlinear function approximation, nonlinear system identification, and real power load prediction. The IBWO algorithm introduces a nonlinear time-varying factor to better balance the diversity and convergence for optimizing parameters of a RBFNN. First, IBWO-RBF classifies some classification datasets in the UCI repository. The conclusion proves that IBWO-RBF has higher classification accuracy and simpler structure than metaheuristic algorithms such as PSO, GA, DE, and so on. Again, the proposed model also converges faster than other RBFNNs on nonlinear learning problems with higher optimization accuracy. In addition, compared with other models, the proposed model proves its superior power load prediction performance. The experimental results demonstrated obviously that the developed model is effective, competitive and promising tool for solving classification and nonlinear regression problems. In the future, this model can be developed in power plants to predict power load simply and efficiently. Furthermore, metaheuristic algorithms can be combined with deep learning networks to solve a variety of time series forecasting problems.

Data availability statement

The original contributions presented in this study are included in this article/supplementary material, further inquiries can be directed to the corresponding authors.

Author contributions

HL: methodology and writing—original draft. GZ: writing—review and editing. YZ: analysis of experimental results and software. HH: review and editing. XW: testing of experimental results. All authors contributed to the article and approved the submitted version.

Funding

This research was funded by the National Natural Science Foundation of China (Grant Nos. 62066005, U21A20464, and 62266007) and Program for Young Innovative Research Team in China University of Political Science and Law (Grant No. 21CXTD02).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Footnotes

^ http://shumo.neepu.edu.cn

References

Agarwal, V., and Bhanot, S. (2018). Radial basis function neural network-based face recognition using firefly a lgorithm. Neural Comput. Appl. 30, 2643–2660. doi: 10.1007/s00521-017-2874-2

CrossRef Full Text | Google Scholar

Aljarah, I., Faris, H., and Mirjalili, S. (2018a). Optimizing connection weights in neural networks using the whale optimization algorithm. Soft Comput. 22, 1–15. doi: 10.1007/s00500-016-2442-1

CrossRef Full Text | Google Scholar

Aljarah, I., Faris, H., Mirjalili, S., and Al-Madi, N. (2018b). Training radial basis function networks using biogeography-based optimizer. Neural Comput. Appl. 29, 529–553. doi: 10.1007/s00521-016-2559-2

CrossRef Full Text | Google Scholar

Asteris, P. G., and Nikoo, M. (2019). Artificial bee colony-based neural network for the prediction of the fundamental period of infilled frame structures. Neural Comput. Appl. 31, 4837–4847. doi: 10.1007/s00521-018-03965-1

CrossRef Full Text | Google Scholar

Bashir, T., Haoyong, C., Tahir, M. F., and Liqiang, Z. (2022). Short term electricity load forecasting using hybrid prophet-LSTM model optimized by BPNN. Energy Rep. 8, 1678–1686. doi: 10.1016/j.egyr.2021.12.067

CrossRef Full Text | Google Scholar

Belaout, A., Krim, F., Mellit, A., Talbi, B., and Arabi, A. (2018). Multiclass adaptive neuro-fuzzy classifier and feature selection techniques for photovoltaic array fault detection and classification. Renew. Energy 127, 548–558. doi: 10.1016/j.renene.2018.05.008

CrossRef Full Text | Google Scholar

Broomhead, D., and Lowe, D. (1988). Multivariable functional interpolation and adaptive networks. Complex Sys. 2, 321–355.

PubMed Abstract | Google Scholar

Chen, L., Yin, L., and Tao, Y. U. (2018). Short-term power load forecasting based on deep forest algorithm. Electr. Power Constr. 39, 42–50.

Google Scholar

Ding, S., Zhang, Y., Chen, J., and Weikuan, J. (2013). Research on using genetic algorithms to optimize Elman neural networks. Neural Comput. Appl. 23, 293–297. doi: 10.1007/s00521-012-0896-3

CrossRef Full Text | Google Scholar

Dong, X., Qian, L., and Huang, L. (2017). Short-term load forecasting in smart grid: A combined CNN and K-means clustering approach. 2017 IEEE international conference on big data and smart computing (BigComp). Piscataway: IEEE, 119–125. doi: 10.1109/BIGCOMP.2017.7881726

CrossRef Full Text | Google Scholar

Du, K. L., and Swamy, M. N. (2006). Neural networks in a soft computing framework. London: Springer.

Google Scholar

Ertugrul, ÖF. (2016). Forecasting electricity load by a novel recurrent extreme learning machines approach. Int. J. Electr. Power Energy Syst. 78, 429–435. doi: 10.1016/j.ijepes.2015.12.006

CrossRef Full Text | Google Scholar

Fan, S. K., and Chiu, Y. Y. (2007). A decreasing inertia weight particle swarm optimizer. Eng. Optim. 39, 203–228. doi: 10.1080/03052150601047362

CrossRef Full Text | Google Scholar

Guo, X., Zhao, Q., Zheng, D., Ning, Y., and Gao, Y. (2020). A short-term load forecasting model of multi-scale CNN-LSTM hybrid neural network considering the real-time electricity price. Energy Rep. 6, 1046–1053. doi: 10.1016/j.egyr.2020.11.078

CrossRef Full Text | Google Scholar

Han, H. G., Ma, M. L., and Qiao, J. F. (2021). Accelerated gradient algorithm for RBF neural network. Neurocomputing 441, 237–247. doi: 10.1016/j.neucom.2021.02.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Han, H. G., Ma, M. L., Yang, H. Y., and Qiao, J. F. (2022). Self-organizing radial basis function neural network using accelerated second-order learning algorithm. Neurocomputing 469, 1–12. doi: 10.1016/j.neucom.2021.10.065

CrossRef Full Text | Google Scholar

Han, J. W., Li, Q. X., Wu, H. R., Huaji, Z., and Song, Y. L. (2019). Prediction of cooling efficiency of forced-air precooling systems based on optimized differential evolution and improved BP neural network. Appl. Soft Comput. 84:105733. doi: 10.1016/j.asoc.2019.105733

CrossRef Full Text | Google Scholar

Hayyolalam, V., and Kazem, A. (2020). Black widow optimization algorithm: A novel meta-heuristic approach for solving engineering optimization problems. Eng. Appl. Artif. Intell. 87, 103249.1–103249.28. doi: 10.1016/j.engappai.2019.103249

CrossRef Full Text | Google Scholar

Hermias, J. P., Teknomo, K., and Monje, J. C. (2017). Short-term stochastic load forecasting using autoregressive integrated moving average models and Hidden Markov Model. 2017 international conference on information and communication technologies (ICICT). Piscataway: IEEE, 131–137. doi: 10.1109/ICICT.2017.8320177

CrossRef Full Text | Google Scholar

Holland, J. H. (1975). An efficient genetic algorithm for the traveling salesman problem. Eur J Oper Res 145, 606–617.

Google Scholar

Hong, D., Yao, J., Meng, D., Zongben, X., and Chanussot, J. (2020). Multimodal GANs: Toward crossmodal hyperspectral multispectral image segmentation. IEEE Trans. Geosci. Remote Sens. 59, 5103–5113. doi: 10.1109/TGRS.2020.3020823

CrossRef Full Text | Google Scholar

Hu, H., Feng, J., and Zhai, X. (2018). The method for single well operational cost prediction combining rbf neural network and improved PSO algorithm. Proceedings of the 2018 IEEE 4th international conference on computer and communications (ICCC). Piscataway: IEEE, 2081–2085. doi: 10.1109/CompComm.2018.8780836

CrossRef Full Text | Google Scholar

Hu, H., Xia, X., Luo, Y., Muhammad, C. Z., Nazira, S., and Penga, T. (2022). Development and application of an evolutionary deep learning framework of LSTM based on improved grasshopper optimization algorithm for short-term load forecasting. J. Build. Eng. 57:104975. doi: 10.1016/j.jobe.2022.104975

CrossRef Full Text | Google Scholar

Jing, Z., Chen, J., and Li, X. R. B. F.-G. A. (2019). An adaptive radial basis function metamodeling with genetic algorithm for structural reliability analysis. Reliab. Eng. Syst. Safety 189, 42–57. doi: 10.1016/j.ress.2019.03.005

CrossRef Full Text | Google Scholar

Karaboga, D. (2005). An idea based on honey bee swarm for numerical optimization. Technical report-tr06, Erciyes university, engineering faculty. Comput. Eng. Depart.

Google Scholar

Katooli, M. S., and Koochaki, A. (2020). Detection and Classification of Incipient Faults in Three-Phase Power Transformer Using DGA Information and Rule-based Machine Learning Method. J. Cont. Autom. Electr. Syst. 31, 1251–1266. doi: 10.1007/s40313-020-00625-5

CrossRef Full Text | Google Scholar

Kaya, E. (2022). Quick flower pollination algorithm (QFPA) and its performance on neural network training. Soft Comput. 26, 9729–9750. doi: 10.1007/s00500-022-07211-8

CrossRef Full Text | Google Scholar

Kennedy, J., and Eberhart, R. (1995). Particle swarm optimization. Proceedings of ICNN’95-international conference on neural networks. Piscataway: IEEE.

Google Scholar

Kermanshahi, B. (1998). Recurrent neural network for forecasting next 10 years loads of nine Japanese utilities. Neurocomputing 23, 125–133. doi: 10.1016/S0925-2312(98)00073-3

CrossRef Full Text | Google Scholar

Khan, S., Naseem, I., Togneri, R., and Bennamoun, M. (2017). A novel adaptive kernel for the RBF neural networks. Circ. Syst. Signal Process. 36, 1639–1653. doi: 10.1007/s00034-016-0375-7

CrossRef Full Text | Google Scholar

Korürek, M., and Doğan, B. (2010). ECG beat classification using particle swarm optimization and radial basis function neural network. Exp. Syst. Appl. 37, 7563–7569. doi: 10.1016/j.eswa.2010.04.087

CrossRef Full Text | Google Scholar

Kumar, P. M., Lokesh, S., Varatharajan, R., Chandra Babu, G., and Parthasarathy, P. (2018). Cloud and IoT based disease prediction and diagnosis system for healthcare using Fuzzy neural classifier. Future Gener. Comput. Syst. 86, 527–534. doi: 10.1016/j.future.2018.04.036

CrossRef Full Text | Google Scholar

Li, L. L., Sun, J., Tseng, M. L., and Li, Z. G. (2019). Extreme learning machine optimized by whale optimization algorithm using insulated gate bipolar transistor module aging degree evaluation. Exp. Syst. Appl. 127, 58–67. doi: 10.1016/j.eswa.2019.03.002

CrossRef Full Text | Google Scholar

Liu, H., Zhou, Y., Luo, Q., Huang, H., and Wei, X. (2022). Prediction of photovoltaic power output based on similar day analysis using RBF neural network with adaptive black widow optimization algorithm and K-means clustering. Front. Energy Res 10:990018. doi: 10.3389/fenrg.2022.990018

CrossRef Full Text | Google Scholar

Memar, S., Mahdavi-Meymand, A., and Sulisz, W. (2021). Prediction of seasonal maximum wave height for unevenly spaced time series by black widow optimization algorithm. Marine Struct. 78:103005. doi: 10.1016/j.marstruc.2021.103005

CrossRef Full Text | Google Scholar

Micev, M., Ćalasan, M., Petrović, D. S., Ali, Z. M., Quynh, N. V., and Shady, H. E. (2020). Field current waveform-based method for estimation of synchronous generator parameters using adaptive black widow optimization algorithm. IEEE Access 8, 207537–207550. doi: 10.1109/ACCESS.2020.3037510

CrossRef Full Text | Google Scholar

Mirjalili, S. A., Hashim, S. Z., and Sardroudi, H. M. (2012). Training feedforward neural networks using hybrid particle swarm optimization and gravitational search algorithm. Appl. Math. Comput. 218, 11125–11137. doi: 10.1016/j.amc.2012.04.069

PubMed Abstract | CrossRef Full Text | Google Scholar

Mirjalili, S., and Lewis, A. (2016). The whale optimization algorithm. Adv. Eng. Softw. 95, 51–67. doi: 10.1016/j.advengsoft.2016.01.008

CrossRef Full Text | Google Scholar

Mukilan, P., and Semunigus, W. (2021). Human object detection: An enhanced black widow optimization algorithm with deep convolution neural network. Neural Comput. Appl. 33, 15831–15842. doi: 10.1007/s00521-021-06203-3

CrossRef Full Text | Google Scholar

Panahi, F., Ehteram, M., and Emami, M. (2021). Suspended sediment load prediction based on soft computing models and black widow optimization algorithm using an enhanced gamma test. Env. Sci. Poll. Res. 28, 48253–48273. doi: 10.1007/s11356-021-14065-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Pazouki, G., Golafshani, E. M., and Behnood, A. (2022). Predicting the compressive strength of self-compacting concrete containing class F fly ash using metaheuristic radial basis function neural network. Struct. Concr. 23, 1191–1213. doi: 10.1002/suco.202000047

CrossRef Full Text | Google Scholar

Pradeepkumar, D., and Ravi, V. (2017). Forecasting financial time series volatility using particle swarm optimization trained quantile regression neural network. Appl. Soft Comput. 58, 35–52. doi: 10.1016/j.asoc.2017.04.014

CrossRef Full Text | Google Scholar

Storn, R., and Price, K. (1997). Differential evolution–a simple and efficient heuristic for global optimization over continuous spaces. J. Glob. Optim. 11, 341–359. doi: 10.1023/A:1008202821328

CrossRef Full Text | Google Scholar

Versaci, M., Angiulli, G., Crucitti, P., Carlo, D. D., Lagan, F., Pellican, D., et al. (2022). A fuzzy similarity-based approach to classify numerically simulated and experimentally detected carbon fiber-reinforced polymer plate defects. Sensors 22:4232. doi: 10.3390/s22114232

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, C., Liu, Y., Zhang, Q., Haohao, G., Xiaoling, L., Yang, C., et al. (2019). Association rule mining based parameter adaptive strategy for differential evolution algorithms. Exp. Syst. Appl. 123, 54–69. doi: 10.1016/j.eswa.2019.01.035

CrossRef Full Text | Google Scholar

Wang, Z., Ding, H., Yang, J., Hou, P., Dhiman, G., Wang, J., et al. (2017). Salp swarm algorithm: A bio-inspired optimizer for engineering design problems. Adv. Eng. Softw. 114, 163–191. doi: 10.1016/j.advengsoft.2017.07.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, H., Zhou, Y., Luo, Q., and Mohamed, A. B. (2016). Training feedforward neural networks using symbiotic organisms search algorithm. Comput. Intell. Neurosci. 2016:9063065. doi: 10.1155/2016/9063065

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, X. S. (2009). Firefly algorithms for multimodal optimization. International symposium on stochastic algorithms. Berlin: Springer, 169–178. doi: 10.1007/978-3-642-04944-6_14

CrossRef Full Text | Google Scholar

Yang, X. S. (2010). A new metaheuristic bat-inspired algorithm. Nature inspired cooperative strategies for optimization (NICSO 2010). Berlin: Springer, 65–74. doi: 10.1007/978-3-642-12538-6_6

CrossRef Full Text | Google Scholar

Yang, X. S. (2012). Flower pollination algorithm for global optimization. International conference on unconventional computing and natural computation. Berlin: Springer, 240–249. doi: 10.1007/978-3-642-32894-7_27

CrossRef Full Text | Google Scholar

Yang, Y., Chen, Y., Wang, Y., Caihong, L., and Lian, Li (2016). Modelling a combined method based on ANFIS and neural network improved by DE algorithm: A case study for short-term electricity demand forecasting. Appl. Soft Comput. 49, 663–675. doi: 10.1016/j.asoc.2016.07.053

CrossRef Full Text | Google Scholar

Yao, J., Cao, X., Hong, D., Wu, X., Meng, D., Chanussot, J., et al. (2022). “Semi-active convolutional neural networks for hyperspectral image classification,”. IEEE Trans. Geosci. Remote Sens. 60:5537915. doi: 10.1109/TGRS.2022.3206208

CrossRef Full Text | Google Scholar

Zhang, H., Yao, J., Ni, L., Gao, L., and Huang, M. (2022). ”Multimodal Attention-Aware Convolutional Neural Networks for Classification of Hyperspectral and LiDAR Data,” in IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing. Piscataway: IEEE. doi: 10.1109/JSTARS.2022.3187730

CrossRef Full Text | Google Scholar

Zhang, J., Wei, Y. M., Li, D., and Liqiang, Z. (2018). Short term electricity load forecasting using a hybrid model. Energy 158, 774–781. doi: 10.1016/j.energy.2018.06.012

CrossRef Full Text | Google Scholar

Zhang, W., and Wei, D. (2018). Prediction for network traffic of radial basis function neural network model based on improved particle swarm optimization algorithm. Neural Comput. Appl. 29, 1143–1152. doi: 10.1007/s00521-016-2483-5

CrossRef Full Text | Google Scholar

Zhang, Y., Zhou, Y., Zhou, G., Luo, Q., and Zhu, B. (2022). A curve approximation approach using bio-inspired polar coordinate bald eagle search algorithm. Int. J. Comput. Intell. Syst. 15, 1–25. doi: 10.1007/s44196-022-00084-7

CrossRef Full Text | Google Scholar

Keywords: improved black widow optimization algorithm, radial basis function neural networks, classification and regression, power load prediction, metaheuristic

Citation: Liu H, Zhou G, Zhou Y, Huang H and Wei X (2023) An RBF neural network based on improved black widow optimization algorithm for classification and regression problems. Front. Neuroinform. 16:1103295. doi: 10.3389/fninf.2022.1103295

Received: 20 November 2022; Accepted: 21 December 2022;
Published: 10 January 2023.

Edited by:

Salim Heddam, University of Skikda, Algeria

Reviewed by:

Mario Versaci, Mediterranea University of Reggio Calabria, Italy
Amirfarhang Mehdizadeh, University of Missouri–Kansas City, United States
Jing Yao, Aerospace Information Research Institute (CAS), China

Copyright © 2023 Liu, Zhou, Zhou, Huang and Wei. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Guo Zhou, www.frontiersin.org Z3VvLnpob3VAbGl2ZS5jb20=; Yongquan Zhou, emhvdXlvbmdxdWFuQGd4dW4uZWR1LmNu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.