• CSCD核心库收录期刊
  • 中文核心期刊
  • 中国科技核心期刊

电力建设 ›› 2016, Vol. 37 ›› Issue (11): 48-.doi: 10.3969/j.issn.1000-7229.2016.11.008

• 电力大数据 • 上一篇    下一篇

 基于Spark的大电网广域时空序列分析平台构建

 袁宝超1,刘道伟2,刘丽平2,王泽忠1   

  1.  1.华北电力大学电气与电子工程学院,北京市 102206;
    2.中国电力科学研究院,北京市 100192
  • 出版日期:2016-11-01
  • 作者简介:袁宝超(1990),男,硕士研究生,研究方向为基于广域信息的电网扰动特性及大数据技术; 刘道伟(1977),男,博士,高级工程师,主要研究方向为响应式大电网稳定态势量化评估与自适应控制; 刘丽平(1964),女,硕士研究生,教授级高工,主要研究方向为电力系统自动化; 王泽忠(1960),男,教授,博士生导师,研究方向为电力系统电磁兼容和电磁场数值计算。
  • 基金资助:
     国家自然科学基金项目(51207143);国家电网公司科技项目(XT71-15-056)

 Platform Building for Wide-Area Spatiotemporal Sequences Analysis of Large-Scale Power Grid Based on Spark

 YUAN Baochao1, LIU Daowei2, LIU Liping2, WANG Zezhong1   

  1.  1. North China Electric Power University, Beijing 102206, China;
    2. China Electric Power Research Institute, Beijing 100192, China
  • Online:2016-11-01
  • Supported by:
     Project supported by National Natural Science Foundation of China(51207143)

摘要:  为了适应能源互联网发展趋势及日益复杂的运行环境,亟需依托大数据技术,提升能源互联网多源大数据的挖掘深度及应用效率。首先,针对大电网广域时空序列数据,阐述了Spark在分布式计算中的优势,阐明大数据平台建设目标,设计了基于Spark的电力大数据平台架构,并对平台各个层次进行详细的论述。其次,描述了Spark针对电网时空序列数据的处理过程。最后,在搭建的Spark和Hadoop实验环境基础上,对典型聚类算法进行性能对比测试,验证了Spark相对于Hadoop的MapReduce计算模型数据处理的优势,为下一步研究工作奠定了基础。

关键词:  能源互联网, Spark, 时空序列, 流计算, 聚类

Abstract:   To address the energy internet trends and increasingly complex operating environment, we need to enhance the mining depth and utilization capability of energy internet multi-source data relying on big data technology. First, in the view of the wide-area spatiotemporal sequences data of large power grid, this paper expounds the Sparks advantages in distributed computing and the goal of big data platform, designs the big data platform architecture of power grid based on Spark, and describes each level of the platform in detail. Secondly, this paper describes the Sparks advantage in processing the spatiotemporal sequences data. Finally, on the basis of Spark and Hadoop experiment environment, this paper carries out typical clustering algorithm to compare the performance between Spark and Hadoop. The results verifies that Spark has a great advantage in data processing comparing with Hadoop MapReduce, which lays the foundation for the next step research.

Key words:  energy internet, Spark, spatiotemporal sequences, streaming computing, cluster

中图分类号: