
Discussion of Training Samples-Oreinted Sampling Methods for Data-driven Power Systems Analysis
Author(s) -
Lei Jiang-long,
Jie Yu,
Yan Yang,
Mingxu Xiang,
Hongxin Yu,
Hongbin Wang
Publication year - 2021
Publication title -
iop conference series. earth and environmental science
Language(s) - English
Resource type - Journals
eISSN - 1755-1307
pISSN - 1755-1315
DOI - 10.1088/1755-1315/714/4/042049
Subject(s) - computer science , sampling (signal processing) , latin hypercube sampling , scale (ratio) , data mining , sample (material) , power (physics) , adaptive sampling , training (meteorology) , artificial neural network , machine learning , monte carlo method , statistics , mathematics , physics , chemistry , filter (signal processing) , meteorology , computer vision , chromatography , quantum mechanics
Data-driven methods such as deep neural networks (DNNs) and Gaussian processes (GPs), have been a promising way to achieve the balance of the calculation accuracy and efficiency in power system analysis. For instance, studies are utilizing the DNN-based method to achieve the fast and accurate calculation of probabilistic power flow. These methods rely on the sample data for training, which is mainly obtained by sampling. However, there is still no guidance on how to select the suitable sampling method to effectively generate the representative training samples. To address this problem, this paper explores the impact of different sampling methods on data-driven methods through theoretical analysis and simulation. Then some reasonable conclusions are drawn in this paper: i) Random sampling (RS) is suitable for data-driven methods that require large-scale training samples. ii) Latin hypercube sampling (LHS) is suitable for data-driven methods that only work well under small-scale training samples. iii) The assumption that the training samples generated by uniform sampling (US), which covers all sampling intervals, could not lead to good performance. Finally, simulations are implemented on the IEEE 39-bus system and the results demonstrate the effectiveness of the proposed conclusions