• CSCD核心库收录期刊
  • 中文核心期刊
  • 中国科技核心期刊

电力建设 ›› 2021, Vol. 42 ›› Issue (7): 101-109.doi: 10.12204/j.issn.1000-7229.2021.07.012

• 电力规划 • 上一篇    下一篇

基于深度强化学习的输电网网架规划方法

刘帅1, 孔亮1, 刘自发2, 李玉文1, 陈逸轩2   

  1. 1.国网山东省电力公司威海供电公司,山东省威海市 264200
    2.华北电力大学电气与电子工程学院,北京市 102206
  • 收稿日期:2020-10-20 出版日期:2021-07-01 发布日期:2021-07-09
  • 通讯作者: 刘帅
  • 作者简介:孔亮(1986),男,本科,工程师,研究方向为电气工程、电力营销等;|刘自发(1973),男,博士,教授,研究方向为电网规划;|李玉文(1986),男,本科,工程师,研究方向为电网规划;|陈逸轩(1996),男,硕士研究生,研究方向为配电网规划。
  • 基金资助:
    国网山东省电力公司科技项目(52061318006P)

Transmission Network Planning Method Based on Deep Reinforcement Learning

LIU Shuai1, KONG Liang1, LIU Zifa2, LI Yuwen1, CHEN Yixuan2   

  1. 1. State Grid Weihai Power Supply Company, Weihai 264200, Shandong Province, China
    2. School of Electrical and Electronic Engineering, North China Electric Power University, Beijing 102206, China
  • Received:2020-10-20 Online:2021-07-01 Published:2021-07-09
  • Contact: LIU Shuai
  • Supported by:
    State Grid Shandong Electric Power Company Research Program(52061318006P)

摘要:

为解决现有输电网规划方法在多场景情况下存在的灵活性不足的问题,同时进一步提高规划方法的运算效率,文章提出一种基于深度强化学习的输电网规划方法。首先,通过聚类方法,以系统信息熵最小为目标,生成用于规划的电网典型场景,并建立适用于多场景的输电网灵活规划模型。其次,综合运用深度强化学习方法及Actor-Critic方法,提出适用于输电网规划的改进指针网络模型,并采用改进指针网络与Actor-Critic结合的方法(revised pointer network with Actor-Critic, RPNAC)对规划模型进行求解。最后,基于IEEE标准算例进行计算及数据分析,验证了文章所提方法的科学性和高效性。

关键词: 输电网网架规划, 信息熵, 多场景, 指针网络, 深度强化学习

Abstract:

In order to solve the problem that the existing transmission network planning methods are not flexible enough in multiple scenarios, and to improve the operation efficiency of the planning methods, this paper proposes a transmission network planning method based on deep reinforcement learning. Firstly, this paper uses scenario information entropy to generate a variety of typical scenarios, and establishes a flexible transmission network planning model suitable for multiple scenarios. Secondly, an improved pointer network model suitable for transmission network planning is proposed by using the deep reinforcement learning method and the actor critical method, and the revised pointer network with Actor-Critic (RPNAC) method is used to solve the planning model. Finally, an example verifies the effectiveness and feasibility of the proposed method.

Key words: transmission network planning, information entropy, multiple scenarios, pointer network, deep reinforcement learning

中图分类号: