• CSCD核心库收录期刊
  • 中文核心期刊
  • 中国科技核心期刊

Electric Power Construction ›› 2020, Vol. 41 ›› Issue (1): 32-38.doi: 10.3969/j.issn.1000-7229.2020.01.004

Previous Articles     Next Articles

Missing Load Data Completion Based on Curve Similarity and Low-Rank Matrix

QIAO Wenyu1, LI Ye2, LIU Haoyu2, LI Yang3, YANG Ting3   

  1. 1. China Electric Power Research Institute, Beijing 100192, China;2. Electric Power Research Institute of State Grid Tianjin Electric Power Corporation, Tianjin 300384, China;3. School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China
  • Online:2020-01-01
  • Supported by:
    This work is supported by National Key Research and Development Program of China (No. 2017YFE0132100) , National Natural Science Foundation of China (No. 61571324) and State Grid Corporation of China Research Program (No. KJ18-1-39).

Abstract: Many advanced applications in the power grid are based on load data. However, in the process of data collection, transmission and storage, there will inevitably meet with data missing, which may bring bad effect to data analysis. In allusion to the above problem, this paper proposes a missing data completion strategy combining load curve similarity and singular value threshold algorithm. Firstly, through analyzing the distribution of singular values in electric data matrix, a low rank characteristic is revealed. On the basis of that, considering the differences between power consumption curves, a data recovery method of pre-filling, curve similarity classification and secondary filling is proposed. After the pre-filling, for each power curve to be recovered, the k curves which are most similar to it are found on the basis of the improved similarity index to form the data matrix, and then the low-rank matrix completion is applied again. Through the above steps, the missing recovery is supposed to be improved. The load data from the power grid in north China is used for test, and the method proposed is compared with the classical interpolation method to verify the effectiveness. Results prove that the combination of load curve similarity and singular value thresholding (SVT) can effectively complete the missing data and provide strong support for load data repair.

Key words: load data, missing data, curve similarity, low-rank matrix completion

CLC Number: