Nonintrusive Load Monitoring Method Based on Color Encoding and Improved Twin Support Vector Machine

Zhang, Ruoyuan; Wang, Yuan; Song, Yang

doi:10.3389/fenrg.2022.906458

METHODS article

Front. Energy Res., 22 July 2022

Sec. Smart Grids

Volume 10 - 2022 | https://doi.org/10.3389/fenrg.2022.906458

This article is part of the Research TopicAdvanced AI Applications for Modelling, Optimization, Control, and Planning of Smart GridView all 39 articles

Nonintrusive Load Monitoring Method Based on Color Encoding and Improved Twin Support Vector Machine

Ruoyuan Zhang¹*

Yuan Wang²

Yang Song¹

¹Anhui Water Conservancy Technical College, Hefei, China
²School of Computer Science and Engineering, North Minzu University, Yinchuan, China

In the process of traditional power load identification, the load information of V-I track is missing, the image similarity of V-I track of some power loads is high and the recognition effect is not good, and the training time of recognition model is too long. In view of the abovementioned situation, this study proposes a power load recognition method based on color image coding and the improved twin support vector machine (ITWSVM). First, based on the traditional voltage–current gray trajectory method, the bilinear interpolation technique is used to solve the pixel discontinuity problem effectively. Considering the complementarity of features, the numerical features are embedded into the gray V-I trajectory by constructing three channels, namely, current (R), voltage (G), and phase (B), so the color V-I image with rich electrical features is obtained. Second, the two-dimension Gabor wavelet is used to extract the texture features of the image, and the dimension is reduced by means of local linear embedding (LLE). Finally, the artificial fish swarm algorithm (AFSA) is used to optimize the twin support vector machine (TWSVM), and the ITWSM is used to train the load recognition model, which greatly enhances the model training speed. Experimental results show that the proposed color V-I image coding method and the ITWSVM classification method, compared with the traditional V-I track image construction method and image classification algorithm, improve the accuracy by 6.12% and reduce the model training time by 1071.23 s.

1 Introduction

NILM is the main tool to analyze the electrical behavior of residential users. It can collect, store, and analyze important electrical information through the data acquisition and communication device installed at the user’s power supply entrance. In this way, users can accurately perceive the running state and energy consumption of each electrical equipment (Sun et al., 2017; Guo et al., 2021). Compared with the traditional invasive monitoring device for the analysis of each electrical equipment installation, NILM without further user internal can grasp the electricity situation of all kinds of equipment, on the one hand; reduce the hardware investment, on the other hand; and also remove the existing transformation and maintenance of electrical lines, to a great extent, to protect the user’s privacy (Deng et al., 2020). NILM has a large number of applications in energy efficiency monitoring, fault diagnosis, load modeling, and demand response (Wang and Liu, 2019). In particular, the energy consumption information provided by NILM is of great value for users to understand their own energy consumption structure and guide them to use electricity reasonably, thus realizing energy saving and loss reduction and reducing electricity costs (Zhou et al., 2018; Sun et al., 2020). Therefore, NILM has received extensive attention and strong support from the industry in recent years, and significant progress has been made in related research (Cui et al., 2020; Xiang et al., 2022).

The flow of NILM can be divided into data acquisition, feature extraction, model training, and online recognition. Among them, feature extraction and model training directly affect the accuracy of the algorithm. Feature extraction refers to the use of digital signal processing technology or circuit analysis theory to extract valuable indicators from the collected electrical signals to distinguish different types of electrical equipment. In noninvasive load identification tasks, common load characteristics include voltage and current waveform, current harmonics (Cui et al., 2020), active and reactive power, V-I trajectory (Liu et al., 2021; Xiang et al., 2022), instantaneous power (Li et al., 2021), etc. Among them, V-I trajectory features are characterized by high repeatability and strong stability (Gao et al., 2016; Wu et al., 2020). However, traditional identification methods are prone to misjudge different types of equipment with highly overlapping load characteristics into the same category.

In order to reduce the category and number of misjudgment and realize more efficient model training, new recognition techniques still need to be developed. Scholars at home and abroad have conducted a large number of in-depth studies on V-I trajectory. Liu et al. (2020) extracted the binary V-I trajectory contour to realize the full mining of trajectory shape features. Tu et al. (2018) proposed for the first time to map V-I trajectories into binary grid images, which reduced the computational cost. Zhang et al. (2020) took the binary V-I track image as input to transform the load identification problem into an image classification problem. Due to the outstanding performance of the artificial intelligence algorithm in the field of image classification, the ant colony algorithm is introduced by Du et al. (2016) to extract key features of the weighted pixelated track image, which improves the accuracy of load recognition. Niu et al. (2009) adopted the Fryze power theory to extract reactive current from current, which increased the distinguishing degree of current characteristics. Li et al. (2019) constructed the voltage-reactive current trajectory on the basis of Fryze theory and color-coded the trajectory to integrate other load characteristics. However, its defect was that the trajectory could not reflect the power of the device without combining power characteristics. Wang et al. (2019a) and Chen et al. (2019) used Gram matrix transformation and genetic algorithm, respectively, for power feature fusion, which improves load feature diversity. Although the V-I load identification model and algorithm of trajectory is increasingly mature, there still exists the following problems: trajectory image binary V-I can only transfer trajectory shape information, in principle cannot reflect the power, phase information, such as equipment, and because there are many different kinds of electrical appliances and working principle of the similarity between the different kinds of load V-I trajectory characteristics of overlapping phenomenon. Although the electrical information contained in binary images is more comprehensive, the discontinuity of pixels in V-I image construction will lead to the loss of a lot of useful information, especially the traditional methods cannot fully excavate the advanced features of images. Therefore, the accuracy of load identification still has room for further improvement.

Different from the abovementioned feature extraction and recognition methods, a noninvasive load recognition method based on image coding and the improved twin support vector machine is proposed. This method combines digital features with image features and exploits the outstanding advantages of the improved twin support vector machine in image recognition field to mine the important information contained in electrical signals as much as possible. The main methods are as follows: first, based on the grayscale V-I image, continuous V-I pixels are realized by bilinear interpolation, and three channels, current (R), voltage (G), and phase (B), are constructed by image coding technology to form a continuous color image. Second, two-dimensional Gabor wavelet (Wei et al., 2020) is used to effectively filter the image data to obtain the key texture features of the color V-I image, and LLE dimension reduction is carried out for multiple texture features to reduce the huge amount of calculation caused by high dimension. Third, the parameters of the TWSVM were taken as the position information of artificial fish, and the classification accuracy was taken as the objective function. Then, the optimal location and optimal solution were updated by foraging, clustering, trailing, and random behaviors of ant colony, and the optimal parameters and optimal classification accuracy were obtained at the end of iteration. The algorithm can automatically determine the parameters of the TWSVM in the training process, avoid the blindness of parameter selection, and improve the classification performance of the TWSVM. The characteristics of V-I images are classified by the ITWSVM, the classification results of color V-I images are obtained, and the corresponding electrical equipment classification is completed. Finally, the effectiveness of the proposed method is verified by using the PLAID public data set.

1.1 Continuous V-I Color Image Encoding of Pixel Points

V-I trajectory is a two-dimensional image drawn by a series of voltage and current sampling points in a steady period. For most electrical equipment with different operating principles, the V-I trajectory presents great differences in shape, so various shape parameters (such as area, curvature, number of self-intersection points, and circulation direction) can be extracted from it and used as the basis to distinguish different types of electrical equipment. It is difficult to extract shape parameters, and the parameters after dimensionality reduction cannot fully reflect their original information. Therefore, in the study by Fan et al. (2020), the V-I track is mapped to a binary gray image by using the gridding method while preserving the shape information as much as possible, and the image is used for load identification directly. Compared with the extraction of shape parameters, the process of constructing the binary gray image is simpler. At the same time, the retention of original information is higher, so the accuracy of load identification is further improved.

1.2 Continuous V-I Image Mapping Method

Considering that the V-I image may have pixel discontinuity in the process of mapping, it is not conducive to subsequent training and recognition. Therefore, bilinear interpolation technology is used to improve the traditional mapping method, and the specific process is as follows:

1) A sampling system comprising current clamp, voltage probe, and high-frequency oscilloscope is used to sample the voltage and current waveform of electrical equipment at high frequency, and $M$ is volt-current sampling point $(V_{m}, I_{m}), m = 1, 2, \dots, M$ .

2) Given a grid or image resolution of $N \times N$ , if all sample points are mapped to the grid, then the size of each cell (pixel point) is

{\begin{matrix} Δ i = \frac{i_{m a x} - i_{m i n}}{N} \\ Δ v = \frac{v_{m a x} - v_{m i n}}{N} \end{matrix} . (1)

In Eq. 1, $i_{m a x}$ and $i_{m i n}$ are the minimum and maximum values of the current sampling value, respectively. $v_{m a x}$ and $v_{m i n}$ are the minimum and maximum voltage sampling values, respectively. $Δ i$ and $Δ v$ are the dimensions of each cell (pixel point).

3) According to Eq. 2, the distance between two adjacent sampling points after mapping is $D_{m} (m = 1, 2, \dots, M)$ . If $D_{m} > 1$ , it indicates that the distance between two points is greater than the length or width of the cell, indicating discontinuity. Interpolation is needed to complete the interval between two points. For simplicity, bilinear interpolation technology is adopted to realize the filling of discontinuity points, as shown in Eq. 3 and Eq. 4. After filling, the new set of sample points is denoted as $(v_{j}, i_{j}), j = 1, 2, \dots, J$ .

D_{m} = \sqrt{{(\frac{v_{m + 1} - v_{m}}{Δ v})}^{2} + {(\frac{i_{m + 1} - i_{m}}{Δ i})}^{2}} . (2)

v_{m + k}^{'} = v_{m} + \frac{v_{m + 1} - v_{m}}{K_{m} + 1} k . (3)

i_{m + k}^{'} = i_{m} + \frac{i_{m + 1} - i_{m}}{K_{m} + 1} k . (4)

In the equations, $K_{m} = D_{m}$ is the number of interpolation points to be supplemented between $m$ and $(m + 1)$ sampling points. $(v_{m + k}^{'}, i_{m + k}^{'})$ is the $k$ interpolation point of filling, $k = 1, 2, \dots, K_{m}$ .

4) According to Eq. 5, the mapping coordinates of sample points $(v_{i}, i_{j})$ are calculated as

{\begin{matrix} r_{j} = \frac{i_{j} - i_{m i n}}{Δ i} \\ c_{j} = \frac{v_{j} - v_{m i n}}{Δ v} \end{matrix} . (5)

5) Construct a zero matrix with dimension $N \times N$ , and then take out coordinates of all points one by one from the first sample point, and set the elements of row $r_{j}$ and column $c_{j}$ in the matrix to one until the last sample point. The obtained matrix is the continuous coordinate matrix of pixel points.

According to the abovementioned method, the gray V-I image of a fluorescent lamp device can be obtained, as shown in Figure 1, with a resolution of 32 × 32. The V-I image obtained by the traditional mapping method has the phenomenon of pixel discontinuity, while the image obtained by the method in this study does not have this problem. The results demonstrate the effectiveness of the abovementioned image construction method.

FIGURE 1

FIGURE 1. Binary V-I trajectory mapping for a fluorescent lamp.

1.3 V-I Color Image Coding Method

According to the analysis in Section 2.1, voltage and current signals of all kinds of electrical equipment are normalized in the process of forming V-I images, and only 0 or one state tables are used for each pixel point. As a result, the V-I image only retains the shape characteristics of voltage–current signals but cannot reflect the numerical differences between devices, especially the numerical characteristics of average current, voltage, power, and phase of devices. For example, the average current of the washing machine and air conditioner is bigger, and the average current of the equipment such as incandescent lamp and computer is lesser. When normalized and mapped, all devices have current values of 0 or 1. Obviously, the gray V-I image has lost a lot of valuable information, so it is difficult to improve the accuracy of load recognition by relying on it alone. Therefore, it is necessary to combine the gray V-I image with the numerical feature to enhance the recognition ability of the algorithm by using the complementarity between the two.

As mentioned earlier, the gray V-I image is a single-channel two-dimensional pixel matrix. Each point in the matrix is 0 or 1, which, respectively, represents whether the point has a V-I trajectory, while the size and direction of the trajectory and other information cannot be reflected. In contrast, for a color image, as shown in Figure 2, it is formed by the superposition of three channels: current (R), voltage (G), and phase (B). Each channel corresponds to a two-dimensional matrix, in which each element can change continuously from 0 to 1. Color images contain more information than gray images. If the numerical information can be embedded into the RGB channel in combination with the shape characteristics of the V-I track, the gray V-I image can be transformed into the corresponding color image. Using this image to classify will undoubtedly improve the recognition accuracy of the whole algorithm.

FIGURE 2

FIGURE 2. Image of each channel of a fluorescent lamp and the synthesized V-I color image.

The core of color image coding lies in the formation of $R$ , $G$ , and $B$ matrices. The numerical features such as current, voltage (power), and phase are embedded into the corresponding channels to form a color V-I image.

1) Initialize $N \times N$ dimensional $A$ , $R$ , $G$ ,and $B$ zero matrices.

2) According to $(v_{j}, i_{j})$ and Eq. 5, the frequency of occurrence of each coordinate point $A (r_{j}, c_{j})$ is counted.

3) Construct the current $R$ matrix. Sample points $(v_{j}, i_{j})$ are taken out one by one, and elements in the $R$ matrix are calculated according to Eq. 6. After the calculation, calculate the average value of each coordinate point according to Eq. 7.

R (r_{j}, c_{j}) = R (r_{j}, c_{j}) + f (i_{j}) . (6)

R (r_{j}, c_{j}) = \frac{R (r_{j}, c_{j})}{A (r_{j}, c_{j})} . (7)

In Eq. 8, the equation is used to scale the current signal to ensure that elements change continuously from $0 - 1$ . After scaling, the current difference between different devices can be maintained, and the current information of other devices will not be drowned due to the excessive current of some devices.

f (i_{j}) = \frac{1}{(1, e^{2 ij})} . (8)

4) Construct voltage $G$ matrix. Take out sample points $(v_{j}, i_{j})$ one by one and calculate elements in $G$ matrix according to Eq. 9. After the calculation, calculate the average value of each coordinate point according to Eq. 10.

G (r_{j}, c_{j}) = G (r_{j}, c_{j}) + \frac{v_{j}}{V} . (9)

G (r_{j}, c_{j}) = \frac{G (r_{j}, c_{j})}{A (r_{j}, c_{j})} . (10)

Similarly, $V$ maximizes the difference between the maximum and minimum voltages of all devices. If power characteristics are used, $v_{j}$ and $V$ are replaced by power $p_{j}$ and $P$ , respectively, where $p_{j}$ is calculated by $p_{j} = v_{j} i_{j}$ and $P$ is the maximum power of all devices.

5) Construct the phase $B$ matrix. Sample points $(v_{j}, i_{j})$ were taken out one by one, and elements in $B$ matrix were calculated according to Eq. 11 and Eq. 12. After the calculation, calculate the average value of each coordinate point according to Eq. 12.

B (r_{j}, c_{j}) = B (r_{j}, c_{j}) + \frac{θ_{j}}{2 π} . (11)

θ_{j} = {\begin{matrix} δ_{j}, \\ π + δ_{j}, \\ 2 π + δ_{j}, \end{matrix} \begin{matrix} Δ i_{j} \geq 0, Δ v_{j} \geq 0 \\ Δ i_{j} < 0, Δ v_{j} \in \forall \\ Δ i_{j} \geq 0, Δ v_{j} < 0 \end{matrix} . (12)

B (r_{j}, c_{j}) = \frac{B (r_{j}, c_{j})}{A (r_{j}, c_{j})} . (13)

In the equation, $θ_{j}$ is the phase difference between adjacent points, $δ_{j} = arc \tan (\frac{Δ v_{j}}{Δ i_{j}})$ , $Δ v_{j} = v_{j + 1} - v_{j}$ , and $Δ i_{j} = i_{j + 1} - i_{j}$ , $δ_{j} \in [- \frac{π}{2}, \frac{π}{2}]$ .

After $R$ , $G$ , and $B$ matrices are obtained, color V-I images can be obtained by superposing them. According to the abovementioned method, each channel matrix of fluorescent lamp equipment in Figure 1 is constructed, and the results are shown in Figure 2. In Figure 2, the inconsistent light and shade of pixel points in each channel reflect the size and difference of the value (current, voltage, and phase) represented by the point. Because of this difference, the resultant color image can contain more information. To further illustrate this problem, color V-I images of 11 categories of electrical equipment randomly selected from the PLAID dataset are drawn, as shown in Figure 3. Although most of the devices in the figure have different V-I shapes, a few devices are similar, such as air conditioners, electric fans, hair dryers, and incandescent lamps. After careful observation, it can be found that there are obvious differences in color distribution of V-I images in the abovementioned four types of equipment. This is because the average current, phase, and other numerical characteristics of the four types of equipment are inconsistent, resulting in color difference in the corresponding images.

FIGURE 3

FIGURE 3. Color V-I images of various electrical equipment.

2 Feature Extraction and Dimension Reduction of V-I Color Image

2.1 Two-Dimensional Gabor Wavelet Feature Extraction

In order to extract V-I color image features, this study tries to use two-dimensional Gabor wavelet to extract image texture features and achieve more effective image key texture extraction. Combining with LLE, feature dimension reduction can alleviate feature redundancy and efficiency of high-dimensional feature operation to a certain extent.

As a common tool of image-scale representation and feature analysis, two-dimensional Gabor wavelet can easily realize image scale change. For gray image $z = (v_{j}, i_{j})$ , its filter expression (Li et al., 2019) is as follows:

φ_{α, β} (z) = \frac{‖ k_{α, β} ‖}{σ^{2}} e x p (- \frac{{‖ k_{α, β} ‖}^{2} {‖ z ‖}^{2}}{2 σ^{2}}) \cdot [e x p (i k_{α, β}) - (- \frac{σ^{2}}{2})] . (14)

In Eq. 14, $k_{α, β} = (k_{β} \cos φ_{α}, k_{β} \sin φ_{α})$ represents the fundamental frequency vector; $φ_{α}$ indicates the direction of $k_{α, β}$ ; $k_{β}$ indicates the scale of $k_{α, β}$ , generally $φ_{α} = \frac{π μ}{8}$ , $k_{β} = \frac{k_{m a x}}{f^{β}}$ , $f = \sqrt{2}$ , $k_{m a x} = \frac{π}{2}$ , $α$ and $β$ are the direction and scale parameters of two-dimensional Gabor wavelet transform, respectively; and $σ$ represents the filtering bandwidth. The main function of $\frac{‖ k_{α, β} ‖}{σ^{2}}$ is to compensate the energy attenuation (Wang et al., 2019b) caused by sampling, and $‖ k_{α, β} ‖$ refers to the two-norm operation. Eq. 15 is divided according to the real and imaginary parts (Moosaei et al., 2021).

Re (φ_{α, β} (z)) = \frac{‖ k_{α, β} ‖}{σ^{2}} e x p (- \frac{{‖ k_{α, β} ‖}^{2} {‖ z ‖}^{2}}{2 σ^{2}}) \cdot [\cos (i k_{α, β}) - e x p (- \frac{σ^{2}}{2})] . (15)

lm (φ_{α, β} (z)) = \frac{‖ k_{α, β} ‖}{σ^{2}} e x p (- \frac{{‖ k_{α, β} ‖}^{2} {‖ z ‖}^{2}}{2 σ^{2}}) \cdot [\sin (i k_{α, β})] . (16)

When two-dimensional Gabor wavelet is carried out, in order to obtain comprehensive image data without loss, it is necessary to set the main parameters $α$ , $β$ , and $σ$ of two-dimensional Gabor wavelet reasonably.

2.2 LLE Dimension Reduction

The dimension of the image features obtained by Gabor filtering is high. Considering the problem of feature redundancy and the efficiency of high-dimensional feature operation and storage, it is necessary to effectively reduce the dimension of the image features. The following is a mathematical description of LLE dimension reduction. To achieve dimension reduction for m sample points, suppose that sample $x_{i}$ can be obtained from its adjacent samples $x_{j}$ , $x_{k}$ , and $x_{l}$ through linear operation (Gupta and Gupta, 2021).

x_{i} = ω_{ij} x_{j} + ω_{ik} x_{k} + ω_{il} x_{l} . (17)

In Eq. 17, $ω_{i j}$ , $ω_{i k}$ , and $ω_{i l}$ are the linear coefficients of sample $x_{i}$ and its adjacent samples $x_{j}$ , $x_{k}$ , and $x_{l}$ , respectively.

In the actual operation, multiple adjacent samples can be selected for $x_{i}$ . Let the set comprising $k$ neighbor samples of $x_{i}$ be $Q_{i}$ . In order to maintain the previous linear relationship of sample points after dimensionality reduction, its objective function is Eq. 18 [26].

m i n \sum_{i = 1}^{m} {‖ x_{i} - \sum_{j \in Q_{i}} ω_{ij} x_{j} ‖}^{2} . (18)

Suppose $C_{j k} = {(x_{i} - x_{j})}^{T} (x_{i} - x_{k})$ , then

ω_{ij} = \frac{\sum_{k \in Q_{i}} C_{jk}^{- 1}}{\sum_{l, s \in Q_{i}} C_{ls}^{- 1}} . (19)

In Eq. 19, $C_{l s} = {(x_{i} - x_{l})}^{T} (x_{i} - x_{s})$ .

LLE can keep $ω_{i j}$ unchanged in the dimensionality reduction process, so according to $ω_{i j}$ , the sample set after dimensionality reduction can be solved (Li and Ding, 2019).

m i n \sum_{i = 1}^{m} {‖ z_{i} - \sum_{j \in Q_{i}} ω_{ij} z_{j} ‖}^{2} . (20)

In Eq. 20, $z_{i}$ is the value of $x_{i}$ after dimensionality reduction. By solving the eigenvector corresponding to the eigenvalue of $Z$ , the dimensionality reduction set $Z = [z_{1}, z_{2}, \dots, z_{m}]$ can be obtained.

3 AFSA Optimize TWSVM

3.1 TWSVM

The TWSVM uses two hyperplanes for classification. The number of samples of the two types is $m_{1}$ and $m_{2}$ and the dimension is $n$ . The mathematical representation of the two hyperplanes of the TWSVM is Eq. 21 (Shao et al., 2013).

K (x^{T}, C^{T}) ω^{(1)} + b^{(1)} = 0; K (x^{T}, C^{T}) ω^{(2)} + b^{(2)} = 0. (21)

In Eq. 21, $K$ is the kernel function, $C = [A; B]$ represents all training samples, $ω_{i} (i = 1, 2)$ is the normal vector of the classified hyperplane, and $b_{i} (i = 1, 2)$ is the bias. Let $A \in R^{m_{1} \times n}$ , $B \in R^{m_{2} \times n}$ , $A = {(x_{1}^{(1)}, x_{2}^{(1)}, \dots, x_{m_{1}}^{(1)})}^{T}$ , and $B = {(x_{1}^{(2)}, x_{2}^{(2)}, \dots, x_{m_{1}}^{(2)})}^{T}$ , then the solution of the two hyperplanes can be converted to Eq. 22 and Eq. 23 (Gupta and Gupta, 2021).

TWSVM1:

\min_{ω^{(1)}, b^{(1)}} \frac{1}{2} {(A ω^{(1)} + e_{1} b^{(1)})}^{T} (A ω^{(1)} + e_{1} b^{(1)}) + c_{1} e_{2}^{T} ξ \lim_{x \to \infty} .

s .t . - (B ω^{(1)} + e_{2} b^{(1)}) + ξ \geq e_{2}, ξ \geq 0. (22)

TWSVM2:

\min_{ω^{(2)}, b^{(2)}} \frac{1}{2} {(B ω^{(12)} + e_{2} b^{(2)})}^{T} (B ω^{(2)} + e_{1} b^{(2)}) + c_{2} e_{1}^{T} η .

s .t . - (A ω^{(2)} + e_{1} b^{(2)}) + η \geq e_{1}, η \geq 0. (23)

In the equations, $ξ$ and $η$ are slack variables, $e_{1} = {(1, 1, \dots, 1)}^{T} \in R^{m_{1}}$ , $e_{2} = {(1, 1, \dots, 1)}^{T} \in R^{m_{2}}$ , and $c_{1}$ and $c_{2}$ are the penalty parameters.

The test sample belongs to whichever hyperplane it is near, if $x$ is in the $r$ class, where $r \in {1, 2}$ , such as Eq. 24.

K (x^{T}, C^{T}) ω^{(r)} + b^{(r)} = \min_{i = 1,2} | K (x^{T}, C^{T}) ω^{i} + b^{(r)} | . (24)

3.2 AFSA

The artificial fish swarm algorithm is an optimization algorithm based on swarm intelligence, which is inspired by the behavior of fish. In the AFSA, each AF adjusts its behavior according to its current state and the state of the surrounding environment. During each iteration, the AF updated themselves through four behaviors: foraging, clustering, tailgating, and randomness (Wu et al., 2007).

Suppose $X_{i} = (x_{1}, x_{2}, \dots, x_{n})$ is the current state of the artificial fish $A F_{i}$ , $Y_{i} = f (X_{i})$ is the fitness function of $X_{i}$ , $V i s u a l$ is the field of vision of the artificial fish, $t r y_n u m b e r$ is the maximum number of foraging attempts, $δ$ is the crowding factor, $S t e p$ represents the step length of artificial fish, and $n_{f}$ indicates the number of artificial fish in the field of view. For the artificial fish $A F_{i}$ , a random state $X_{j}$ in its field of vision can be represented by Eq. 25, where it is a random number between 0 and 1. When the update condition is met, $A F_{i}$ updates its status with Eq. 26.

X_{j} = X_{i} + Visual \cdot R a n d () . (25)

X_{i}^{t + 1} = X_{i}^{t} + \frac{X_{j} - X_{i}^{t}}{‖ X_{j} - X_{i}^{t} ‖} Step \cdot R a n d () . (26)

The four behaviors of artificial fish are described as follows:

1) Foraging behavior: $A F_{i}$ randomly selects a state within its field of vision with $X_{j}$ if, in the case of the maxima problem, $Y_{i} < Y_{j}$ (which is $Y_{i} > Y_{j}$ in the case of minima and can be converted between them) moves one step toward $X_{j}$ according to Eq. 26. Otherwise, the state $X_{j}$ is randomly selected again to determine whether the requirement $Y_{i} < Y_{j}$ is met. If the requirements are still not met after $t r y_n u m b e r$ times of repeated attempts, the random behavior is performed.

2) Swarm behavior: Assuming $X_{c}$ is the central position in the field of vision, if $Y_{c} > Y_{i}$ and $\frac{Y_{c}}{n_{f}} > δ \cdot Y_{i}$ , it indicates that the partner center has more food and is not crowded and then moves one step toward $X_{c}$ according to Eq. 26; otherwise, foraging behavior is performed.

3) Rear-end behavior: Assume that $X_{b}$ is the best position found in the field of vision. If $\frac{Y_{b}}{n_{f}} > δ \cdot Y_{i}$ indicates that $X_{b}$ has more food and is not crowded, it moves further in the direction of $X_{b}$ according to Eq. 26; otherwise, foraging behavior is performed.

4) Random behavior: $A F_{i}$ randomly selects a position in its field of vision and then moves one step in that direction, which is a missing behavior of foraging behavior.

These four behaviors switch between each other under different conditions, and the artificial fish will choose the appropriate behavior to find the location of the better solution.

3.3 TWSVM Improved Based on the AFSA

The core idea of the ITWSVM is to find the optimal parameters of the TWSVM through the AFSA. The position $X_{i} = (x_{1}, x_{2}, \dots, x_{n})$ of artificial fish $A F_{i}$ corresponds to a set of parameters of the TWSVM. The position is a vector whose dimension represents the number of parameters. The objective function of the TWSVM is the classification accuracy of the TWSVM, and the best position found by the AFSA is the best parameter of the TWSVM.

The algorithm steps of the ITWSVM are as follows:

1) Initial settings include artificial fish swarm size $N$ , maximum number of iterations $K$ , initial position of each artificial fish, step size $S t e p$ , field of vision $V i s u a l$ , number of attempts $t r y_n u m b e r$ , crowding factor $δ$ , and parameter upper and lower limits of the TWSVM.

2) Taking the position of artificial fish as a parameter, the classification accuracy of the twin support vector machine was calculated, and the classification accuracy was optimized as the objective function. The fitness value of each artificial fish was obtained, and the optimal position of artificial fish in the whole bureau was recorded.

3) Each artificial fish performed swarm and tail chasing behaviors and judged whether the individual had improved. If it is improved, a better behavior is selected; otherwise, the foraging behavior is performed.

4) Perform the action of artificial fish selection and update the position of each artificial fish.

5) Update the status of the globally optimal artificial fish.

6) Judge whether the maximum number of iterations is reached. If so, output the optimal solution and the corresponding parameter combination; otherwise, increase the number of iterations by one and jump to two).

The flow chart of the ITWSVM is shown in Figure 4, through which the process of the algorithm proposed in this study can be seen intuitively.

FIGURE 4

FIGURE 4. Flow chart of the AFSA–TWSVM algorithm.

3.4 Experimental Results and Analysis

In order to prove the effectiveness of the ITWSVM proposed in this study after dimension reduction by LLE, twin support vector machines based on artificial fish swarm algorithm (AFSA–TWSVM) (Li and Ding, 2019) without LLE dimension reduction, twin support vector machines based on particle swarm optimization (PSO–TWSVM) (Shao et al., 2013), twin support vector machines based on fruit fly optimization algorithm (FOA–TWSVM) (Ding et al., 2016), twin support vector machines based on genetic algorithm (GA–TWSVM) (Wang et al., 2013), and twin support vector machines based on glowworm swarm optimization algorithm (GSO–TWSVM) (Ding et al., 2017) were selected as comparative experiments to compare. Considering the rapid development of deep learning in recent years and the excellent performance of the convolutional neural network (CNN) in image classification, this study takes it as a comparative experiment. The V-I color images in section 2 were input into the VGGNet-16 model (Simonyan and Zisserman, 2014) and Fast R-CNN model (Girshick, 2015) for training, and the results were compared.

3.5 The Data Set

The PLAID common data set is used to test the load identification algorithm. The data set comprises voltage and current operation sampling data of 235 electrical equipment in 11 categories of 55 households, with a sampling frequency of 30 kHz and a total sample number of 1074 groups. Considering that the large difference in sample numbers of various electrical equipment in the PLAID data set, it is easy to lead to poor recognition effect of some equipment. For this reason, the synthetic minority over-sampling technique (SMOTE) was used to synthesize and expand a few samples, and the number of expanded samples was 1925. A total of 220 samples (20 samples for each type of equipment) were randomly selected to form the test set, and the remaining 1705 samples were used as the training set.

3.6 Evaluation Functions

For binary classification problems, receiver operating characteristic (ROC) and confusion matrix are important performance indicators for classifier comparison (Zhu and Tang, 2004). Four basic indicators of confusion matrix can be obtained by using the classification results of the test set: true positive (TP), false positive (FP), false negative (FN), and true negative (TN). Based on the abovementioned four basic indicators, the harmonic mean F1 score of accuracy, precision, and recall can be calculated using Eqs. 27,28,29, and30.

Accuracy = \frac{TP + TN}{TP + FP + TN + FN} \times 100 % . (27)

Precision = \frac{TP}{TP + FP} \times 100 % . (28)

Recall = \frac{TP}{TP + FN} \times 100 % . (29)

F 1 score = \frac{2 \times Precision \times Recall}{Precision + Recall} \times 100 % . (30)

The ordinate of the ROC curve is true positive rate (TPR), which is the true positive rate, and the ordinate is false positive rate (FPR), which is the false positive rate. TPR and FPR can be obtained by using the basic indicators in the confusion matrix, such as Eq. 31 and Eq. 32.

TPR = \frac{TP}{TP + FN} \times 100 % . (31)

FPR = \frac{FP}{FP + TN} \times 100 % . (32)

3.7 Analysis

In order to ensure the fairness of algorithm comparison, V-I color track images drawn by the PLAID dataset are used as samples. The maximum number of iterations is 100 because the running period of each intelligent optimization algorithm is different. Each algorithm can only be tested three times, and $(c_{1}, c_{2}, σ)$ of the optimal fitness is taken as the optimal parameter of the TWSVM. Table 1 shows the optimal parameter $(c_{1}, c_{2}, σ)$ found by different algorithms and the optimal fitness value corresponding to the optimal parameter. Figure 5 shows the curve of LLE + ITWSVM, ITWSVM, PSO-TWSVM, GA-TWSVM, GSO-TWSVM, and FOA-TWSVM parameter optimization process. In the experiment, the optimal parameter $(c_{1}, c_{2}, σ)$ obtained by parameter optimization was used as the final TWSVM parameter, and the TWSVM model was established on all the training sets for testing.

TABLE 1

TABLE 1. Comparison of optimization results of different algorithms.

FIGURE 5

FIGURE 5. Optimization process curve of different algorithm parameters.

Figure 6 shows the ROC curves of eleven algorithms. AUC (area under curve) is defined as the area enclosed by the ROC curve and the coordinate axis. AUC provides a digital basis for performance comparison of classification algorithms. To calculate the AUC, only the area under the ROC curve needs to be obtained. Table 2 shows the comparison of the eight algorithms in the four performance indicators of accuracy, F1 score, AUC, and algorithm running time (the running time of TWSVM algorithms includes parameter optimization time) in the test set.

FIGURE 6

FIGURE 6. ROC curves of different algorithms.

TABLE 2

TABLE 2. Comparison of performance indexes of different algorithms.

According to Table 2 and Figure 6, LLE + ITWSVM proposed in this study can find the optimal parameters of the TWSVM faster with fewer iterations. According to Table 2, the proposed LLE + ITWSVM achieves optimal results in the three performance indicators of accuracy, F1 score, and AUC. Meanwhile, the proposed algorithm has the shortest running time and the best real-time performance. It can also be seen from Table 2 that although the Fast R-CNN algorithm achieves a good classification effect, the algorithm is time-consuming because it uses the selective search method to extract candidate regions and there are many redundant operations. All performance indicators of the VGGNET-16 deep convolutional neural network are close to that of the ITWSVM algorithm. In addition, the VGGNET-16 algorithm runs for a long time due to the large amount of computation in the convolutional layer of VGGNET-16 and the larger number of parameters compared with the LLE + ITWSVM algorithm.

Among the six algorithms based on the TWSVM, the proposed LLE + ITWSVM algorithm has the shortest operation time and the best classification performance index. This is mainly due to the dimensionality reduction of V-I color image by LLE, and the calculation of the algorithm after dimensionality reduction is greatly reduced. In addition, the AFSA improved by the TWSVM can jump out of the local optimal solution at a faster speed and find the global optimal parameters suitable for the TWSVM. It solves the problems of TWSVM parameter selection difficulty and parameter optimization algorithm time-consuming in image recognition.

4 Conclusion

Due to the lack of load information of the V-I track in the traditional power load identification process, some power load features overlap and it is difficult to perform equipment identification. The identification model training time is too long. This study presents an intelligent sensing method of power load based on color coding and improved TWSVM. In this method, continuous V-I pixels are realized by bilinear interpolation technology, and the numerical characteristics such as current, voltage, and phase are embedded into the V-I trajectory in the form of different channels so as to form a high-resolution color V-I image. The two-dimensional Gabor wavelet is used to extract image features, and the feature vectors obtained by LLE dimensionality reduction are used for recognition. In addition, an image recognition method based on the AFSA–TWSVM is proposed. This method uses the AFSA algorithm to find the optimal parameters of the TWSVM, which improves the convergence speed and recognition rate of the TWSVM algorithm and overcomes the shortcomings of previous optimization algorithms such as slow convergence speed and easy to fall into local optimal. It provides a new and effective method for the application of TWSVM in V-I color image recognition. Compared with some advanced algorithms, the accuracy of load identification and the speed of model training can be significantly improved by the proposed method, which proves the superiority of the proposed method.

The identification effect of the proposed method for multistate loads needs to be further improved, and a more advanced identification model needs to be built. At the same time, the practical application is still faced with the lack of the domestic data set, high cost of high-frequency sampling, and low universality of the recognition model. Therefore, NILM technology needs further research.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material; further inquiries can be directed to the corresponding author.

Author Contributions

Conceive, experiment, and write articles.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fenrg.2022.906458/full#supplementary-material

References

Chen, T., Gao, T., and Zhao, X. M. (2019). Single Sample Description Based on Gabor Fusion. IET image process 13 (14), 2840–2849. doi:10.1049/iet-ipr.2018.6665

CrossRef Full Text | Google Scholar

Cui, L. J., Sun, Y., and Liu, Y. X. (2020). Non-intrusive Load Disaggregation Method Considering Time-Phased State Behavior. Automation Electr. Power Syst. 44 (5), 215–222.

Google Scholar

Deng, X., Zhang, G. Q., and Wei, Q. L. (2020). A Survey on the Non-intrusive Load Monitoring. Acta Autom. Sin. 47 (2), 1–21.

Google Scholar

Ding, S., An, Y., Zhang, X., Wu, F., and Xue, Y. (2017). Wavelet Twin Support Vector Machines Based on Glowworm Swarm Optimization. Neurocomputing 225, 157–163. doi:10.1016/j.neucom.2016.11.026

CrossRef Full Text | Google Scholar

Ding, S., Zhang, X., and Yu, J. (2016). Twin Support Vector Machines Based on Fruit Fly Optimization Algorithm. Int. J. Mach. Learn. Cyber. 7 (2), 193–203. doi:10.1007/s13042-015-0424-8

CrossRef Full Text | Google Scholar

Du, L., He, D., Harley, R. G., and Habetler, T. G. (2016). Electric Load Classification by Binary Voltage-Current Trajectory Mapping. IEEE Trans. Smart Grid 7 (1), 358–365. doi:10.1109/tsg.2015.2442225

CrossRef Full Text | Google Scholar

Fan, L. Y., Hu, C. Z., and Chen, S. Y. (2020). Dimensionality Reduction of Image Feature Based on Geometric Parameter Adaptive LLE Algorithm. Ifs 38 (2), 1569–1577. doi:10.3233/jifs-179520

CrossRef Full Text | Google Scholar

Gao, J., Kara, E. C., and Giri, S. (2016). “A Feasibility Study of Automated Plug-Load Identification from High-Frequency Measurements,” in IEEE Global Conference on Signal ＆ Information Processing Piscataway, New Jersey, USA, 220–224.

Google Scholar

Girshick, R. (2015). “Fast R-CNN,” in IEEE International Conference on Computer Vision. doi:10.1109/iccv.2015.169

CrossRef Full Text | Google Scholar

Guo, H. X., Liu, J. W., and Yang, P. (2021). Review on Key Techniques of Non-intrusive Load Monitoring. Electr. Power Autom. Equip. 41 (1), 135–144.

Google Scholar

Gupta, U., and Gupta, D. (2021). Regularized Based Implicit Lagrangian Twin Extreme Learning Machine in Primal for Pattern Classification. Int. J. Mach. Learn. Cyber. 12 (5), 1311–1342. doi:10.1007/s13042-020-01235-y

CrossRef Full Text | Google Scholar

Li, C., Liang, G. Q., Zhao, G., and Chen, G. (2021). A Demand-Side Load Event Detection Algorithm Based on Wide-Deep Neural Networks and Randomized Sparse Backpropagation. Front. Energy Res. 9, 720831. doi:10.3389/fenrg.2021.720831

CrossRef Full Text | Google Scholar

Li, C. R., Huang, Y. Y., and Xue, Y. (2019). Dependence Structure of Gabor Wavelets Based on Copula for Face Recognition. Expert Syst. Appl. 137, 453–470. doi:10.1016/j.eswa.2019.05.034

CrossRef Full Text | Google Scholar

Li, J. C., and Ding, S. F. (2019). Twin Support Vector Machines Based on Artificial Fish Swarm Algorithm. J. Intelligent Syst. 14 (6), 1121–1126.

Google Scholar

Liu, S., Liu, Y., and Gao, S. (2020). Non-intrusive Load Monitoring Method Based on PCA-ILP Considering Multi-Feature Objective Function. Electr. Power Constr. 41 (8), 1–8.

Google Scholar

Liu, Y., Wang, J. R., Deng, P. X., Sheng, W., and Tan, P. (2021). Non-Intrusive Load Monitoring Based on Unsupervised Optimization Enhanced Neural Network Deep Learning. Front. Energy Res. 9, 718916. doi:10.3389/fenrg.2021.718916

CrossRef Full Text | Google Scholar

Moosaei, H., Ketabchi, S., Razzaghi, M., and Tanveer, M. (2021). Generalized Twin Support Vector Machines. Neural Process Lett. 53 (2), 1545–1564. doi:10.1007/s11063-021-10464-3

CrossRef Full Text | Google Scholar

Niu, H., Quan, C., and Tay, C. J. (2009). Phase Retrieval of Speckle Fringe Pattern with Carriers Using 2D Wavelet Transform. Opt. Lasers Eng. 47 (12), 1334–1339. doi:10.1016/j.optlaseng.2008.10.005

CrossRef Full Text | Google Scholar

Shao, Y., Wang, Z., Chen, W., and Deng, N.-Y. (2013). Least Squares Twin Parametric-Margin Support Vector Machine for Classification. Appl. Intell. 39 (3), 451–464. doi:10.1007/s10489-013-0423-y

CrossRef Full Text | Google Scholar

Simonyan, K., and Zisserman, A. (2014). Very Deep Convolutional Networks for Large-Scale Image Recognition. Available at https://www.arXiv.com/1409-1666.

Google Scholar

Sun, Y., Cui, Can., and Lu, Jun. (2017). Non-intrusive Load Monitoring Method Based on Delta Feature Extraction and Fuzzy Clustering. Automation Electr. Power Syst. 41 (4), 86–91.

Google Scholar

Sun, Y., Li, H. Y., and Liu, Y. X. (2020). Non-intrusive Home-Load Identification Based on Improved Hidden Markov Model. Electr. Power Constr. 41 (4), 73–80.

Google Scholar

Tu, J., Zhou, M., and Song, X. F. (2018). Comparison of Supervised Learning-Based Non-intrusive Load Monitoring Algorithms. Electr. Power Autom. Equip. 38 (12), 128–134.

Google Scholar

Wang, J. S., Ruan, Y. L., and Zheng, B. W. (2019). Face Recognition Method Based on Improved Gabor Wavelet Transform Algorithm. IAENG Int. J. Comput. Sci. 46 (1), 12–24.

Google Scholar

Wang, J., Wong, R. K., and Lee, T. C. M. (2019). Locally Linear Embedding with Additive Noise. Pattern Recognit. Lett. 123, 47–52. doi:10.1016/j.patrec.2019.02.030

CrossRef Full Text | Google Scholar

Wang, P., and Liu, M. (2019). Day-ahead Dispatching Optimization of Active Distribution Network Considering Demand Response. Sci. Technol. Eng. 19 (28), 152–158.

Google Scholar

Wang, Z., Shao, Y., and Wu, T. (2013). A GA-based Model Selection for Smooth Twin Parametric-Margin Support Vector Machine. Pattern Recognit. 46 (8), 2267–2277. doi:10.1016/j.patcog.2013.01.023

CrossRef Full Text | Google Scholar

Wei, H., Hu, C. Z., and Chen, S. Y. (2020). Establishing a Software Defect Prediction Model via Effective Dimension Reduction. Inf. Sci. 477, 399–409.

Google Scholar

Wu, J., Liu, J., and Song, G. T. (2007). Artificial Fish Swarm Algorithm Suitable to Transmission Network Planning. Power Syst. Technol. 31 (8), 63–67.

Google Scholar

Wu, X., Jiao, D., and Gao, Y. C. (2020). Construction of Adaptive Feature Library and Load Identification Based on Decomposition of Non-intrusive Power Consumption Data. Automation Electr. Power Syst. 44 (4), 101–109.

Google Scholar

Xiang, Y. Z., Ding, Y. F., Luo, Q., Wang, P., Li, Q., Liu, H., et al. (2022). Non-Invasive Load Identification Algorithm Based on Color Coding and Feature Fusion of Power and Current. Front. Energy Res. 10, 899669. doi:10.3389/fenrg.2022.899669

CrossRef Full Text | Google Scholar

Zhang, T. Y., Deng, C. Y., and Liu, Y. K. (2020). Non-intrusive Load Identification Algorithm Based on Convolution Neural Network. Power Syst. Tech. 44 (6), 2038–2044.

Google Scholar

Zhou, M., Song, X. F., and Tu, J. (2018). Residential Electricity Consumption Behavior Analysis Based on Non-intrusive Load Monitoring. Power Syst. Technol. 42 (10), 3268–3276.

Google Scholar

Zhu, H., and Tang, X. L. (2004). Classifier Geometrical Characteristic Comparison and its Application in Classifier Selection. PATTERN Recognit. Lett. 26 (6), 829–842.

Google Scholar

Keywords: nonintrusive load monitoring ₁, V-I trajectory ₂, color encoding ₃, two-dimensional Gabor wavelet ₄, local linear embedding ₅, artificial fish swarm algorithm ₆, win support vector machine ₇

Citation: Zhang R, Wang Y and Song Y (2022) Nonintrusive Load Monitoring Method Based on Color Encoding and Improved Twin Support Vector Machine. Front. Energy Res. 10:906458. doi: 10.3389/fenrg.2022.906458

Received: 28 March 2022; Accepted: 22 June 2022;
Published: 22 July 2022.

Edited by:

Bo Yang, Kunming University of Science and Technology, China

Reviewed by:

Jieming Ma, Xi’an Jiaotong-Liverpool University, China
Puyu Wang, Nanjing University of Science and Technology, China

Copyright © 2022 Zhang, Wang and Song. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ruoyuan Zhang, MzE0MDU2MzI1QHFxLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.