PDF(1822 KB)
基于集成图卷积变分变换器的电力负荷数据补全方法
严莉, 呼海林, 史磊, 吴钦政, 吕天光, 徐英东, 张闻彬, 王高洲
电力建设 ›› 2025, Vol. 46 ›› Issue (4) : 49-57.
PDF(1822 KB)
PDF(1822 KB)
基于集成图卷积变分变换器的电力负荷数据补全方法
Power Load Data Completion Method Based on Integrated Graph Convolutional Variational Transformer
【目的】随着电力系统的发展和能源系统规模的不断扩大,产生了海量的负荷功率数据,但在数据采集、传输中不可避免会发生数据缺失现象,这极大制约了系统协调优化和高级数据应用的发展。【方法】为此,提出了一种基于集成图卷积变分变换器(integrated graph convolutional variational transformer, IGCVT)网络的新型电力负荷缺失数据补全模型。IGCVT模型将改进的图卷积网络(graph convolutional network, GCN)和Transformer模型利用变分自编码器(variational auto-encoder, VAE)的架构进行聚合。通过GCN对原始数据进行处理,以学习空间特征并深度挖掘空间依赖关系;利用VAE对隐藏层数据进行重构,更为有效地还原数据的分布特性;基于Transformer模型对序列时间自相关信息进行挖掘。此外,引入了改进的鲸鱼优化算法(whale optimization algorithm, WOA)以优化网络模型超参数,以提高补全精度和模型的适用性。同时,针对电力负荷数据极端变化点补全误差较大的问题,采用了数据双向补全方法,充分利用缺失点前后的数据信息。【结果】实验结果表明,与基准模型相比,均方根误差(root mean square error, RMSE)指标分别提升了24.3%、44.0%和47.9%,验证了所提方法的优越性。【结论】文章所提方法为解决电力负荷数据缺失问题提供了可行的解决方案,并有望进一步扩展该模型的应用范围。
[Objective] With the development of power systems and continuous expansion of energy systems, massive load power data have been generated. However, missing data are inevitable in the collection and transmission of power data, which greatly restricts the development of system-coordination optimization and advanced data applications. [Methods] To this end, this paper proposes a new power load missing data completion model based on an integrated graph convolutional variational transformer (IGCVT) network. The IGCVT model aggregates an improved graph convolutional network (GCN) and Transformer model using the variational auto-encoder (VAE) architecture. The raw data are processed by the GCN to learn spatial features and deeply mine spatial dependencies; the hidden layer data are reconstructed by the VAE to more effectively restore data distribution characteristics; and the temporal autocorrelation information of the sequence is mined based on the Transformer model. In addition, an improved whale optimization algorithm (WOA) is introduced to optimize the network model hyperparameters and improve the completion accuracy and applicability of the model. Simultaneously, to solve the problem of large errors in the completion of extreme change points of power load data, a two-way data completion method is adopted to make full use of the data information before and after the missing points. [Results] Experimental results show that, compared with the baseline model, the RMSE index is improved by 24.3%, 44.0%, and 47.9%, which verifies the superiority of the proposed method. [Conclusions] The results show that the proposed method provides a feasible solution to the problem of missing power load data and is expected to further expand the application scope of the model.
数据补全 / 图卷积网络 / Transformer模型 / 电力负荷数据
data imputation / graph convolutional networks / transformer model / power load data
| [1] |
杨玉莲, 齐林海, 王红, 等. 基于生成对抗和双重语义感知的配电网量测数据缺失重构[J]. 电力系统自动化, 2020, 44(18): 46-54.
|
| [2] |
张东英, 李伟花, 刘燕华, 等. 风电场有功功率异常运行数据重构方法[J]. 电力系统自动化, 2014, 38(5): 14-18, 24.
|
| [3] |
张科鑫, 窦晓波, 李炜祺, 等. 基于量测数据补全的有源配电网电压优化技术[J]. 电力自动化设备, 2023, 43(11): 67-74.
|
| [4] |
李培冠, 於志勇, 黄昉菀. 基于稀疏表示的电力负荷数据补全[J]. 计算机科学, 2021, 48(2): 128-133.
数据缺失在电力负荷数据采集过程中经常发生,对提高算法的预测精确度带来了不利影响。现有的缺失数据补全算法只适用于缺失数据量较少的情况,而对于缺失数据较多的情况表现不佳。面对严重数据缺失的挑战,文中提出了一种基于稀疏表示的电力负荷缺失数据补全方法。首先以数据随机缺失为前提,将训练数据中假定缺失后的数据与完整的训练数据上下拼接构成训练矩阵;其次,利用离散余弦变换(Discrete Cosine Transform,DCT)生成一个过完备字典,并根据训练矩阵对其进行学习,旨在通过调优得到一个合适的字典,能对训练矩阵中的样本进行最好的稀疏表示。最后,在测试阶段,先利用学习后字典的上半部分获得测试集缺失数据的稀疏表示,然后利用稀疏表示和学习后字典的下半部分重构出无缺失的完整数据。实验结果表明,使用该方法对电力负荷数据缺失值进行补全,可以获得比传统插值方法、基于相关性的KNN算法、时空压缩感知估计算法以及时序压缩感知预测算法更高的精度。即使数据缺失率高达95%,该方法依然可以有效地补全缺失数据。
Data loss often occurs in the process of power load data collection,which adversely affects the accuracy of algorithm prediction.The existing missing data completion algorithm is only suitable for the case with less missing data,but performs poorly for the case with more missing data.Faced with the challenge of severe data loss,a method for power load missing data completion based on sparse representation is proposed.First of all,we assume that the data is randomly missing,and stitch the assumed missing data in the training data and the complete training data to form a training matrix.Secondly,an over-complete dictionary is generated by discrete cosine transform (DCT),and is learned according to the training matrix,aims to obtain a suitable dictionary for the best sparse representations of the samples in the training matrix.Finally,in the test phase,the upper part of the learned dictionary is used to obtain sparse representations of the missing data in the test set,and then the sparse representations and the lower part of the learned dictionary are used to reconstruct the complete data without missing.Experimental results show that using this method to complete missing values of power load data can achieve higher accuracy than traditional interpolation me-thods,correlation-based KNN algorithm,spatiotemporal compressed sensing estimation algorithm and time-series compressed sen-sing prediction algorithm.Even if the data miss rate is as high as 95%,this method can still effectively complete the missing data.
|
| [5] |
刘佳星, 张宏烈, 刘艳菊, 等. 基于缺失率的不完整数据填补算法[J]. 统计与决策, 2021, 37(2): 39-41.
|
| [6] |
|
| [7] |
崔阳阳, 赵洪山, 曲岳晗, 等. 基于残差U型网络的低压台区电力缺失数据补全方法[J]. 电力系统自动化, 2022, 46(9): 83-90.
|
| [8] |
|
| [9] |
费思源. 大数据技术在配电网中的应用综述[J]. 中国电机工程学报, 2018, 38(1): 85-96, 345.
|
| [10] |
赵龙, 杨波, 卢志鹏, 等. 基于谱聚类和AM-LSTM的分布式光伏集群超短期预测方法[J]. 供用电, 2023, 40(7): 10-17.
|
| [11] |
慕福林, 陈静, 骆凯波, 等. 电力营销缺失数据补全研究[J]. 电气应用, 2015, 34(S1): 147-150, 154.
|
| [12] |
何宁辉, 吴旭涛, 沙伟燕, 等. 电力变压器油中溶解气体在线监测数据修复方法[J]. 高压电器, 2024, 60(11): 37-48.
|
| [13] |
陈晓华, 吴杰康, 蔡锦健, 等. 基于猎人猎物优化算法优化BiLSTM的电力负荷短期预测[J]. 山东电力技术, 2024, 51(4): 64-71.
|
| [14] |
高盛, 许沛华, 陈正洪, 等. 基于改进LSTM神经网络的风电功率短期预报算法[J]. 南方能源建设, 2024, 11(1): 112-121.
|
| [15] |
申洪涛, 李飞, 史轮, 等. 基于气象数据降维与混合深度学习的短期电力负荷预测[J]. 电力建设, 2024, 45(1): 13-21.
精确的短期负荷预测能通过预知所需用电负荷来指导电网的调度运行。但电力负荷不仅与用户用电习惯相关,还容易受到温度、湿度等气象因素的影响。因此,文章在现有的使用负荷历史数据基础上,增加了影响区域型电力负荷的气象数据,并考虑高维气象参量数据对预测算法的过拟合问题,提出了基于稀疏核主成分分析(sparse kernel principal component analysis, SKPCA)的高维气象数据降维方法。进而以历史负荷功率和SKPCA重构后主成分为输入,构建了基于卷积神经网络(convolutional neural network, CNN)和长短时记忆(long-short-term memory, LSTM)神经网络的混合深度学习预测模型。CNN-LSTM模型可以同时提取负荷功率及气象数据的空间和时间相关特征,从而全面利用数据的时-空相关性特征,提升负荷功率的短期预测精度。相比于常见的数据降维和负荷预测方法,文章所提方法的数据维度降低了71.43%,预测精度达到了98.92%。结果表明,所提算法融合通过SKPCA降维后的气象数据能够显著提升区域型电力短期负荷预测准确度。
Accurate short-term load forecasting can provide guidance for the dispatching operation of power grids by predicting the required power loads. However, the power load is not only related to the user’s electricity consumption habits but is also easily affected by meteorological factors such as temperature and humidity, Therefore, based on existing historical load data, this paper incorporates meteorological data that affect regional power loads, considers the overfitting problem of high-dimensional meteorological parameter data to the prediction algorithm, and proposes a dimensionality reduction method for high-dimensional meteorological data based on sparse kernel principal component analysis (SKPCA). Subsequently, taking the historical load power and principal components reconstructed by SKPCA as the input, we construct a hybrid deep learning prediction model based on a convolutional neural network (CNN) and a long short-term memory (LSTM) neural network. The CNN-LSTM model can extract the spatial and temporal correlation characteristics of the load power and meteorological data simultaneously to fully utilize the temporal-spatial correlation characteristics of the data and improve the short-term prediction accuracy of the load power. Compared with common methods of data dimension reduction and load forecasting, the data dimensions of this method decrease by 71.43%, and the prediction accuracy reaches 98.92%. The results show that the proposed algorithm can significantly improve the accuracy of regional power short-term load forecasting by fusing meteorological data after dimensionality reduction using SKPCA. |
| [16] |
宋铁维, 施伟锋, 毕宗. 基于ST-SSIM的电力系统缺失数据重建方法[J]. 电力建设, 2022, 43(7): 103-112.
电力系统数据采集、测量、传输和存储等过程均可能出现数据缺失问题,威胁电网安全。针对传统电力系统缺失数据重建方法仅考虑数据分布规律,忽略了数据时序与空间特性的问题,提出一种考虑时空特性的电力系统缺失数据重建模型(spatial-temporal seq2seq imputation model, ST-SSIM)。ST-SSIM具备编码-解码结构,编码器由基于图卷积层与长短时记忆单元构造的时空信息提取单元组成,用于提取数据高维时空特征,解码器由长短时记忆单元与全连接层组成,用于解码高维特征,生成电力系统数据。所提模型的输入包括电力系统数据时间序列与电网拓扑邻接矩阵,因此ST-SSIM可实现对电力系统数据复杂时空关系的自动学习。算例中,将所提方法与现有方法在不同规模电网下比较,ST-SSIM具有最高的重建精度,证明了ST-SSIM能有效地学习到电力系统数据的时空特性。通过讨论重建误差与数据缺失节点数以及缺失时间跨度的关系,验证了所提模型重建效果较稳定。
Power system data may miss during acquisition, measurement, transmission and storage, which threatens the security of power grid. Since traditional missing data reconstruction methods only consider the data distribution and ignore the spatio-temporal characteristics, a power system missing data reconstruction model called ST-SSIM (spatio-temporal seq2seq imputation model) is proposed in this paper. ST-SSIM has an encoder-decoder structure. The encoder is composed of a spatio-temporal information extraction unit which is constructed by graph convolution layer and long short-term memory cell. The decoder is composed of long short-term memory cell and full connection layer. The input of the proposed model includes power system timeseries and adjacency matrix, so ST-SSIM can realize the automatic learning of complex time-space relationship of data. In experiment, compared the proposed method with the existing methods in power grids of different scales, ST-SSIM has the highest reconstruction accuracy, which proves that ST-SSIM can effectively learn the spatio-temporal characteristics of power system data. By discussing the relationship between reconstruction error and the number of missing nodes and time span, it is verified that the reconstruction effect of the proposed model is stable. |
| [17] |
徐任超, 阎威武, 王国良, 等. 基于周期性建模的时间序列预测方法及电价预测研究[J]. 自动化学报, 2020, 46(6): 1136-1144.
|
| [18] |
骆钊, 吴谕侯, 朱家祥, 等. 基于多尺度时间序列块自编码Transformer神经网络模型的风电超短期功率预测[J]. 电网技术, 2023, 47(9): 3527-3537.
|
| [19] |
黄亮. 基于改进粒子群算法的LSTM结构优化研究及应用[D]. 哈尔滨: 哈尔滨师范大学, 2022.
|
| [20] |
姚芳, 汤俊豪, 陈盛华, 等. 基于ISSA-CNN-GRU模型的电动汽车充电负荷预测方法[J]. 电力系统保护与控制, 2023, 51(16): 158-167.
|
| [21] |
|
| [22] |
龙文, 蔡绍洪, 焦建军, 等. 求解大规模优化问题的改进鲸鱼优化算法[J]. 系统工程理论与实践, 2017, 37(11): 2983-2994.
提出一种基于非线性收敛因子的改进鲸鱼优化算法(简记为IWOA)用于求解大规模复杂优化问题.为算法全局搜索奠定基础,在搜索空间中利用对立学习策略进行初始化鲸鱼个体位置;设计一种随进化迭代次数非线性变化的收敛因子更新公式以协调WOA算法的探索和开发能力;对当前最优鲸鱼个体执行多样性变异操作以减少算法陷入局部最优的概率.选取15个大规模(200维、500维和1000维)标准测试函数进行数值实验,结果表明,IWOA在求解精度和收敛速度方面明显优于其他对比算法.
|
| [23] |
尹元亚, 潘文虎, 赵文广, 等. 基于CEEMDAN和BiLSTM-AM的超短期风速预测方法[J]. 电测与仪表, 2024, 61(9): 77-84.
|
| [24] |
|
| [25] |
谢宏, 张华赢, 梁晓锐, 等. 基于关系图卷积神经网络的新能源配电台区拓扑识别方法[J]. 电测与仪表, 2024, 61(7): 94-102.
|
| [26] |
徐博, 钱成功, 牛军伟, 等. 基于深度自编码器的分钟级负荷数据聚类分析[J]. 广东电力, 2023, 36(3): 57-67.
|
| [27] |
郭铁峰, 贺建军, 申帅, 等. 基于动态规整与改进变分自编码器的异常电池在线检测方法[J]. 电子与信息学报, 2024, 46(2): 738-747.
|
| [28] |
|
| [29] |
桂芳旭, 陈衡, 赵欣悦, 等. 变压器全生命周期成本分析及优化[J]. 广东电力, 2024, 37(3): 44-53.
|
| [30] |
王晟睿. 基于改进鲸鱼优化算法的Stanley算法研究[D]. 长春: 吉林大学, 2022.
|
| [31] |
洪德华, 张翠翠, 宫政, 等. 基于改进双向GABP神经网络的电力负荷缺失数据补全方法[J]. 电气自动化, 2022, 44(5): 41-45.
|
| [32] |
|
AI小编
/
| 〈 |
|
〉 |