• CSCD核心库收录期刊
  • 中文核心期刊
  • 中国科技核心期刊

电力建设 ›› 2024, Vol. 45 ›› Issue (5): 59-70.doi: 10.12204/j.issn.1000-7229.2024.05.007

• 大模型小样本条件下新能源规划设计与优化运行技术·栏目主持 葛磊蛟副教授、孙铭阳教授、郑锋副教授、黄文焘副教授· • 上一篇    下一篇

多智能体深度强化学习驱动的跨园区能源交互优化调度

李扬1(), 马文捷1, 卜凡金2, 杨震3, 王彬4, 韩猛2   

  1. 1.东北电力大学电气工程学院,吉林省吉林市 132012
    2.国网淄博供电公司,山东省淄博市 255022
    3.国网北京市电力公司,北京市 100032
    4.国网济宁供电公司,山东省济宁市 272000
  • 收稿日期:2023-07-19 出版日期:2024-05-01 发布日期:2024-04-29
  • 通讯作者: 李扬(1980),男,教授,博士生导师,主要研究方向为综合能源系统优化调度、电力系统稳定评估,E-mail:liyang@neepu.edu.cn
  • 作者简介:马文捷(1999),男,硕士研究生,主要研究方向为人工智能在综合能源系统中的应用;
    卜凡金(1996),男,硕士,主要研究方向为人工智能在综合能源系统中的应用;
    杨震(1992),男,硕士,工程师,主要研究方向为电力系统优化运行与控制;
    王彬(1995),男,硕士,工程师,主要研究方向为电力系统优化运行与控制;
    韩猛(1996),男,硕士,工程师,主要研究方向为电力系统优化运行与控制。
  • 基金资助:
    吉林省自然科学基金项目(YDZJ202101ZYTS149)

Deep Reinforcement Learning-driven Cross-Community Energy Interaction Optimal Scheduling

LI Yang1(), MA Wenjie1, BU Fanjin2, YANG Zhen3, WANG Bin4, HAN Meng2   

  1. 1. School of Electrical Engineering, Northeast Electric Power University, Jilin 132012, Jilin Province, China
    2. State Grid Zibo Power Supply Company, Zibo 255022, Shandong Province, China
    3. State Grid Beijing Electric Power Company, Beijing 100032, China
    4. State Grid Jining Power Supply Company, Jining 272000, Shandong Province, China
  • Received:2023-07-19 Published:2024-05-01 Online:2024-04-29
  • Supported by:
    Natural Science Foundation of Jilin Province(YDZJ202101ZYTS149)

摘要:

为协调多园区综合能源系统各个园区之间的能量交互,多能源子系统之间的能源转换,实现综合能源系统整体优化调度,提出一种利用多智能体深度强化学习算法学习不同园区的负荷特征,并在此基础上进行决策的综合调度模型。该模型将多园区综合能源系统的调度问题转化为马尔科夫决策过程,并利用深度强化学习算法进行求解,避免了对多园区、多能源子系统之间复杂的能量耦合关系进行建模。仿真结果表明,所提方法可以很好地捕捉到不同园区的负荷特性,并利用其中的互补特性协调不同园区之间进行合理的能量交互,可以实现弃风率由16.3%降低至0,并可以使总运行成本降低5 445.6元,具有良好的经济效益和环保效益。

关键词: 多智能体深度强化学习, 综合能源系统, 优化调度, 可再生能源消纳, 负荷特征学习, 多园区能量交互

Abstract:

In order to coordinate energy interactions among various communities and energy conversions among multi-energy subsystems within the multi-community integrated energy system under uncertain conditions, and achieve overall optimization and scheduling of the comprehensive energy system, this paper proposes a comprehensive scheduling model that utilizes a multi-agent deep reinforcement learning algorithm to learn load characteristics of different communities and make decisions based on this knowledge. In this model, the scheduling problem of the integrated energy system is transformed into a Markov decision process and solved using a data-driven deep reinforcement learning algorithm, which avoids the need for modeling complex energy coupling relationships between multi-communities and multi-energy subsystems. The simulation results show that the proposed method effectively captures the load characteristics of different communities and utilizes their complementary features to coordinate reasonable energy interactions among them. This leads to a reduction in wind curtailment rate from 16.3% to 0% and lowers the overall operating cost by 5445.6 Yuan, demonstrating significant economic and environmental benefits.

Key words: multi-agent deep reinforcement learning, integrated energy system, optimal scheduling, renewable energy consumption, load characteristic learning, energy interaction among communities

中图分类号: