Intelligent Diagnosis of Hydroelectric Power Station Alarm Based on Hybrid-Augmentation of Expert Knowledge and Large Language Model

SUN Guoqiang, SHI Xueheng, LUO Zheqing, CHENG Lilin, ZANG Haixiang, WEI Zhinong

Electric Power Construction ›› 2025, Vol. 46 ›› Issue (10) : 44-57.

PDF(2025 KB)
PDF(2025 KB)
Electric Power Construction ›› 2025, Vol. 46 ›› Issue (10) : 44-57. DOI: 10.12204/j.issn.1000-7229.2025.10.005
Planning & Construction

Intelligent Diagnosis of Hydroelectric Power Station Alarm Based on Hybrid-Augmentation of Expert Knowledge and Large Language Model

Author information +
History +

Abstract

[Objective] To address the problems of large sample imbalance and low diagnostic efficiency in traditional hydropower station monitoring and alarm event diagnosis, as well as the lack of interpretability and difficulty in ensuring diagnostic accuracy in artificial intelligence-based diagnostic methods, a deep learning framework based on a large language model (LLM) for hydropower station alarm event sample enhancement and fault diagnosis is designed. Moreover, regular expressions based on expert knowledge are introduced to verify the alarm diagnosis results and improve their credibility. [Methods] First, considering the issue of imbalanced monitoring of event samples in a hydropower station, the SimBERT model is used to increase the amount of data for types with fewer samples. The enhanced data are then mixed with the original data to form the input samples of the deep learning network. Second, based on power grid monitoring alarm rules, regular expressions for typical faults are constructed to achieve accurate discrimination of typical faults. Finally, based on the ERNIE LLM, word vector embeddings of hydropower station monitoring events are constructed and input into the hierarchical attention network to learn the features of each event. The classification results are then produced. The final fault diagnosis results are then obtained by verifying the results produced using regular expressions. [Results] The test results show that, compared with traditional monitoring alarm event diagnosis methods for hydropower stations, this model can improve the diagnostic accuracy by more than 2%, reduce the model training time by 30%, and achieve diagnosis within 0.1 seconds. [Conclusions] The proposed deep learning framework based on an LLM for enhancing alarm event samples and fault diagnosis of hydropower stations can achieve fast and accurate results during system failures, which is conducive to ensuring long-term stable and safe operation of the power system.

Key words

SimBERT / regular expression / ERNIE / hierarchical attention network / hydropower station intelligent alarm

Cite this article

Download Citations
SUN Guoqiang , SHI Xueheng , LUO Zheqing , et al . Intelligent Diagnosis of Hydroelectric Power Station Alarm Based on Hybrid-Augmentation of Expert Knowledge and Large Language Model[J]. Electric Power Construction. 2025, 46(10): 44-57 https://doi.org/10.12204/j.issn.1000-7229.2025.10.005

References

[1]
闪鑫, 戴则梅, 张哲, 等. 智能电网调度控制系统综合智能告警研究及应用[J]. 电力系统自动化, 2015, 39(1): 65-72.
SHAN Xin, DAI Zemei, ZHANG Zhe, et al. Research on and application of integrated smart alarm based on smart grid dispatching and control systems[J]. Automation of Electric Power Systems, 2015, 39(1): 65-72.
[2]
王静智, 赵磊, 邓方明, 等. 居民用户参与电网调峰激励智慧用能策略研究[J]. 电力科学与技术学报, 2024, 39(4): 121-127.
WANG Jingzhi, ZHAO Lei, DENG Fangming, et al. Research on smart energy consumption strategy of residents participating in peak load regulation[J]. Journal of Electric Power Science and Technology, 2024, 39(4): 121-127.
[3]
赵峰, 李炳森, 粘为帆. 基于人工智能技术的数字化供电所设计与应用[J]. 山东电力技术, 2023, 50 (10): 50-57.
ZHAO Feng, LI Bingsen, NIAN Weifan. Design and application of digital power supply station based on artificial intelligence technology[J]. 山东电力技术, 2023, 50(10):50-57.
[4]
汪崔洋, 江全元, 唐雅洁, 等. 基于告警信号文本挖掘的电力调度故障诊断[J]. 电力自动化设备, 2019, 39(4): 126-132.
WANG Cuiyang, JIANG Quanyuan, TANG Yajie, et al. Fault diagnosis of power dispatching based on alarm signal text mining[J]. Electric Power Automation Equipment, 2019, 39(4): 126-132.
[5]
刘梓权, 王慧芳, 曹靖, 等. 基于卷积神经网络的电力设备缺陷文本分类模型研究[J]. 电网技术, 2018, 42(2): 644-651.
LIU Ziquan, WANG Huifang, CAO Jing, et al. A classification model of power equipment defect texts based on convolutional neural network[J]. Power System Technology, 2018, 42(2): 644-651.
[6]
BAI Z Y, SUN G Q, ZANG H X, et al. Identification technology of grid monitoring alarm event based on natural language processing and deep learning in China[J]. Energies, 2019, 12(17): 3258.
[7]
康佳宇, 张沈习, 张庆平, 等. 基于ANOVA和BO-SVM的变压器故障诊断方法[J]. 高电压技术, 2023, 49(5): 1882-1891.
KANG Jiayu, ZHANG Shenxi, ZHANG Qingping, et al. Fault diagnosis method of transformer based on ANOVA and BO-SVM[J]. High Voltage Engineering, 2023, 49(5): 1882-1891.
[8]
冯斌, 张又文, 唐昕, 等. 基于BiLSTM-Attention神经网络的电力设备缺陷文本挖掘[J]. 中国电机工程学报, 2020, 40(增刊1): 1-10.
FENG Bin, ZHANG Youwen, TANG Xin, et al. Text mining of power equipment defects based on BiLSTM-attention neural network[J]. Proceedings of the CSEE, 2020, 40(S1): 1-10.
[9]
汪洋, 杨仕伟, 王宝华, 等. 基于深度置信网络的交直流配电网直流故障检测技术[J]. 电力工程技术, 2023, 42(1): 251-259.
WANG Yang, YANG Shiwei, WANG Baohua, et al. DC fault detection technology for AC/DC distribution network based on DBN[J]. Electric Power Engineering Technology, 2023, 42(1): 251-259.
[10]
赵妍, 孙延, 聂永辉. 基于格拉姆角差场和迁移残差网络的HVDC线路故障识别[J]. 电力建设, 2024, 45(8): 118-127.
Abstract
为了提高高压直流(high voltage direct current,HVDC)输电线路在样本数量不足和高阻抗条件下的识别准确率,提出了一种基于格拉姆角差场(Gramian angular difference field,GADF)和迁移残差网络(ResNet18)结合的高压直流输电线路故障识别方法。首先利用格拉姆角差场将一维时序信号转化为二维角差场图,然后将在源域ImageNet-1K数据集上训练好的ResNet18模型的权重参数迁移至以角场图为目标域的ResNet18模型中,自适应提取故障相关特征,进行故障类型识别。实验结果证明:相较于其他深度学习方法,所提方法在小样本条件下能够正确识别区内正极接地故障、区内负极接地故障、区内双极短路故障和区外故障,识别准确率达到99.67%,并且具有较强的耐受过渡电阻能力、抗噪性和泛化性。
ZHAO Yan, SUN Yan, NIE Yonghui. HVDC line fault identification based on the gram angle difference field and transfer residual network[J]. Electric Power Construction, 2024, 45(8): 118-127.
To improve the identification accuracy of high-voltage direct current (HVDC) transmission line faults under conditions of limited sample size and high impedance, a fault identification method for high-voltage direct current transmission lines that combines the gram angle difference field (GADF) and transfer learning using Residual Network 18 (ResNet18-TL) is proposed. First, one-dimensional time-domain signals were transformed into two-dimensional angle-difference field maps using GADF. Subsequently, the weight parameters of a ResNet18 model pre-trained on the source domain ImageNet-1K dataset were transferred to a ResNet18 model with angle-field maps as the target domain, enabling the adaptive extraction of fault-related features for fault-type recognition. Experimental results demonstrate that, compared with other deep learning methods, the proposed approach can correctly identify internal positive-polarity ground faults, internal negative-polarity ground faults, internal bipolar short-circuit faults, and external faults under small-sample conditions, achieving an accuracy of 99.67%. Additionally, it exhibits a strong tolerance to transient resistance, noise resistance, and generalization capabilities. 
[11]
曹祎, 张莉, 郭静, 等. 基于大语言模型的低碳电力市场发展应用前景[J]. 智慧电力, 2024, 52 (2): 8-16.
CAO Yi, ZHANG Li, GUO Jing, et al. Prospects for development of low-carbon electricity markets based on large language models[J]. Smart Power, 2024, 52(2):8-16.
[12]
DEVLIN J, CHANG M, LEE K, et al. BERT: Pre-training of deep bidirectional transformers for language understanding[DB/OL]. (2019-05-24) [2023-11-17]. https://arxiv.org/abs/1810.04805.
[13]
LAN Z Z, CHEN M D, GOODMAN S, et al. ALBERT: a lite BERT for self-supervised learning of language representations[DB/OL]. 2019: 1909. 11942. (2020-02-09) [2024-09-12]. https://arxiv.org/abs/1909.11942v6.
[14]
LIU Y H, OTT M, GOYAL N, et al. RoBERTa: a robustly optimized BERT pretraining approach[DB/OL]. 2019: 1907. 11692. (2019-07-26) [2024-09-12]. https://arxiv.org/abs/1907.11692v1.
[15]
田雪涵, 董坤, 赵剑锋, 等. 基于增强优化预训练语言模型的电力数据实体识别方法[J]. 智慧电力, 2024, 52 (6): 100-107.
TIAN Xuehan, DONG Kun, ZHAO Jianfeng, et al. Entity Recognition method for power data based on enhanced optimization pre-trained language model[J]. Smart Power, 2024, 52(6):100-107.
[16]
REIMERS N, GUREVYCH I. Sentence-BERT:sentence embeddings using Siamese BERT-networks[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Stroudsburg, PA, USA: ACL, 2019: 3980-3990.
[17]
SUN Z J, LI X Y, SUN X F, et al. ChineseBERT: Chinese pretraining enhanced by glyph and pinyin information[DB/OL]. 2021: 2106. 16038. (2021-06-30) [2024-09-12]. https://arxiv.org/abs/2106.16038v1.
[18]
贾骏, 杨强, 付慧, 等. 基于电力设备大数据的预训练语言模型构建和文本语义分析[J]. 中国电机工程学报, 2023, 43(3): 1027-1037.
JIA Jun, YANG Qiang, FU Hui, et al. Research on pre-training language model construction and text semantic analysis based on power equipment big data[J]. Proceedings of the CSEE, 2023, 43(3): 1027-1037.
[19]
SUN Y, WANG S H, LI Y K, et al. ERNIE: enhanced representation through knowledge integration[DB/OL]. 2019: 1904. 09223. (2019-04-19) [2024-09-12]. https://arxiv.org/abs/1904.09223v1.
[20]
LI J, ZHANG D Z, WULAMU A. Chinese text classification based on ERNIE-RNN[C]// 2021 2nd International Conference on Electronics, Communications and Information Technology (CECIT). IEEE, 2021: 368-372.
[21]
李刚, 方鸿, 刘云鹏, 等. 新型电力系统中的大模型驱动技术: 现状、机遇与挑战[J]. 高电压技术, 2024, 50(7): 2864-2878.
LI Gang, FANG Hong, LIU Yunpeng, et al. Large-model drive technology in new power system: status, challenges and prospects[J]. High Voltage Engineering, 2024, 50(7): 2864-2878.
[22]
ISSIFU A M, GANIZ M C. A simple data augmentation method to improve the performance of named entity recognition models in medical domain[C]//2021 6th International Conference on Computer Science and Engineering (UBMK). IEEE, 2021: 763-768.
[23]
唐子卓, 刘洋, 许立雄, 等. 基于负荷数据频域特征和LSTM网络的类别不平衡负荷典型用电模式提取方法[J]. 电力建设, 2020, 41(8): 17-24.
Abstract
现有用户用电模式提取技术主要基于负荷数据时域特征提取,无法准确分辨时域上欧式距离接近但频域上波动特性差异较大的负荷数据,且对类别不平衡负荷数据的分类准确率较低。为解决上述问题,文章首先通过基于样本支持向量的过采样方法(support vector machines-synthetic minority over-sampling technique, SVM-SMOTE)对存在类别不平衡问题的负荷数据进行处理;然后,通过极大重叠离散小波变换(maximal overlap discrete wavelet transform, MODWT)对负荷数据进行分解,并将分解后的尺度系数和细节系数组成频域的特征矩阵;最后将频域特征矩阵输入深度长短期记忆(long short-term memory, LSTM)神经网络进行负荷分类并通过求取各个类别质心来获取典型用电模式。实验结果表明,该方法具有良好的类别不平衡数据处理能力和负荷分类效果。
TANG Zizhuo, LIU Yang, XU Lixiong, et al. Imbalanced-load pattern extraction method based on frequency domain characteristics of load data and LSTM network[J]. Electric Power Construction, 2020, 41(8): 17-24.
The current user mode extraction technology is mainly based on the time-domain characteristics of load data. It cannot accurately distinguish load data with close Euclidean distance in time domain and different fluctuation characteristics in frequency domain, and the classification accuracy of imbalanced data is low. This paper proposes a user mode extraction model to solve the above problems. The model firstly processes the data set with class imbalance by SVM-SMOTE over-sampling method, secondly obtains the scale and wavelet coefficients which is used to form the characteristic matrix in frequency domain through overlapping discrete wavelet transform, finally inputs the characteristic matrix into the deep LSTM network for classification and obtains the typical user mode by calculating the centroid of each category. The experimental results show that the method can effectively process imbalanced data and increase classification accuracy.
[24]
游文霞, 梁皓, 杨楠, 等. 基于重采样和混合集成学习的不平衡窃电检测[J]. 电网技术, 2024, 48(2): 730-742.
YOU Wenxia, LIANG Hao, YANG Nan, et al. Class imbalanced electricity theft detection based on resampling and hybrid ensemble learning[J]. Power System Technology, 2024, 48(2): 730-742.
[25]
王绪亮, 顾媛丽, 张鸿儒, 等. 基于知识集成流形的电力设备缺陷文本数据增强方法与应用研究[J]. 电网技术, 2024, 48(4): 1690-1702.
WANG Xuliang, GU Yuanli, ZHANG Hongru, et al. Data augmentation and application of defect texts for power equipment based on knowledge integration manifold[J]. Power System Technology, 2024, 48(4): 1690-1702.
[26]
卫志农, 石东明, 张明, 等. 考虑样本类别不平衡的电网故障事件智能识别方法[J]. 电力自动化设备, 2021, 41(11): 93-99.
WEI Zhinong, SHI Dongming, ZHANG Ming, et al. Intelligent identification method of power grid fault events considering sample classification imbalance[J]. Electric Power Automation Equipment, 2021, 41(11): 93-99.
[27]
WEI Z L, WANG C T, YANG X X, et al. Imbalanced sentiment classification of online reviews based on SimBERT[J]. Journal of Intelligent & Fuzzy Systems, 2023, 45(5): 8015-8025.
[28]
YANG Z C, YANG D Y, DYER C, et al. Hierarchical attention networks for document classification[C]//Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies. Stroudsburg, PA, USA: ACL, 2016: 1480-1489.
[29]
VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[J]. Advances in Neural Information Processing Systems, 2017, 30:20-26.
[30]
LAMPLE G, CONNEAU A. Cross-lingual language model pretraining[DB/OL]. 2019: 1901. 07291. (2019-01-22)[2024-09-12]. https://arxiv.org/abs/1901.07291v1.
[31]
孙国强, 章逸舟, 唐杰阳, 等. 基于数据增强和深度学习的水电站告警事件诊断[J]. 电力自动化设备, 2023, 43(8): 88-95.
SUN Guoqiang, ZHANG Yizhou, TANG Jieyang, et al. Diagnosis method of hydropower alarm events based on data augmentation and deep learning[J]. Electric Power Automation Equipment, 2023, 43(8): 88-95.

Funding

National Natural Science Foundation of China(U24B2088)
PDF(2025 KB)

Accesses

Citation

Detail

Sections
Recommended

/