• CSCD核心库收录期刊
  • 中文核心期刊
  • 中国科技核心期刊

ELECTRIC POWER CONSTRUCTION ›› 2021, Vol. 42 ›› Issue (5): 9-15.doi: 10.12204/j.issn.1000-7229.2021.05.002

• Key Technologies and Applications of Artificial Intelligence in Internet of Energy·Hosted by Associate Professor LIU Youbo, Associate Professor HU Wei, Dean WANG Yingxin and Senior Engineer GU Yujia· • Previous Articles     Next Articles

An Abnormal Data Identification Method Based on Improved Generative Adversarial Network

CHEN Jie1, ZHANG Haotian2, TANG Yi3   

  1. 1. State Grid Liyang County Electric Power Supply Company, Liyang 213300, Jiangsu Province, China
    2. School of Cyber Science and Engineering, Southeast University, Nanjing 210096, China
    3. School of Electrical Engineering, Southeast University, Nanjing 210096, China
  • Received:2020-08-09 Online:2021-05-01 Published:2021-05-06
  • Contact: ZHANG Haotian
  • Supported by:
    National Key Research and Development Program of China(2018YFB0904501)

Abstract:

The identification method based on data driving for identifying abnormal data in power grid has become the focus of research in the field of power grid security. Due to the small number of abnormal data in the actual statistical data of power generation, it is extremely difficult to identify abnormal data through data mining. This paper proposes an improved generative adversarial network and isolated forests based abnormal data identification method. Firstly, by using WGAN alternating training generator and discriminator, the distribution characteristics of power generation statistical data are learned and samples are generated, which will generate abnormal samples to enhance the original abnormal samples. According to the accuracy of outlier data identification, the expansion ratio of outlier samples is determined. Then the isolated forest algorithm is used to identify the abnormal data on the expanded balanced dataset. Finally, an evaluation index system consisting of accuracy, recall and precision is selected to evaluate and compare the identification effects of models before and after category equalization. The results show that the proposed method can effectively improve the classification preference of the identification model for most classes and improve the overall identification accuracy.

Key words: generative adversarial networks, Wasserstein distance, sample generating, unbanced data, abnormal data identification

CLC Number: