Skip to main content

ORIGINAL RESEARCH article

Front. Energy Res.
Sec. Energy Efficiency
Volume 12 - 2024 | doi: 10.3389/fenrg.2024.1500548

A Novel Method for Power Transformer Fault Diagnosis Considering Imbalanced Data Samples

Provisionally accepted
Jun Chen Jun Chen 1Yong Wang Yong Wang 1Lingming Kong Lingming Kong 1Yilong Chen Yilong Chen 1Mianzhi Chen Mianzhi Chen 1Qian Cai Qian Cai 2*Gehao Sheng Gehao Sheng 2
  • 1 Guangzhou Power Supply Bureau, Guangdong Power Grid Co., LTD., Guangdong, China
  • 2 Shanghai Jiao Tong University, Shanghai, China

The final, formatted version of the article will be published soon.

    Power transformer fault diagnosis methods based on machine learning struggle with the im-balanced distribution of fault cases across different categories, which may lead to decreased diagnosis accuracy. To improve the accuracy and operation efficiency of the power transformer fault diagnosis model, this paper constructed a fault diagnosis model that integrates neighborhood component analysis (NCA) and k-nearest neighbor (KNN) learning, with the introduction of correction factors. Firstly, a correction factor was introduced into the objective function of the NCA algorithm to mitigate the in-fluence of sample imbalance on model training. The sample parameter correlation quantization matrix was obtained by combining oil chromatography fault data through the association rule, which would be used as the initial value for the NCA algorithm's training metric matrix. The metric matrix obtained from the training was then used to perform a mapping transformation on the input data of the KNN classifier to reduce the distance between samples of the same type, thereby improving the KNN classification performance. Finally, the Bayesian optimization algorithm was employed to tune the hyperparameters of the model to obtain the model parameter set that can achieve the highest accuracy of the test set. Through the analysis of the transformer fault case library, the model proposed in this paper saved nearly half the time compared with traditional machine learning diagnosis models. Furthermore, the diagnosis accuracy of the minority sample classes was improved by at least 15% compared with other models.

    Keywords: Fault diagnosis, transformer, K-nearest neighbor learning, machine learning, Imbalanced samples

    Received: 23 Sep 2024; Accepted: 04 Dec 2024.

    Copyright: © 2024 Chen, Wang, Kong, Chen, Chen, Cai and Sheng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Qian Cai, Shanghai Jiao Tong University, Shanghai, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.