PDF(2221 KB)
PDF(2221 KB)
PDF(2221 KB)
基于流形学习的风电机组异常数据识别方法
Anomaly Detection Method for Wind Turbine Data Based on Manifold Learning
为有效识别和剔除风电机组实测数据中的异常数据,通过分析风电机组实测数据的高维特征,提出一种基于流形学习的异常数据识别算法。首先,采用k-近邻互信息算法实现风电机组特征变量选择;随后,使用将样本间距离度量替换为欧几里得度量和局部主成分分析(local principal component analysis, LPCA)差别加权和的优化t-分布随机近邻嵌入(t-distributed stochastic neighbor embedding, t-SNE)算法挖掘出高维流形数据中具有内在规律的低维特征,使得具有不同分布特征的数据在可视化二维空间中显著分离;最后,采用基于密度的噪声空间聚类(density-based spatial clustering of applications with noise, DBSCAN)算法对二维空间中的数据进行聚类。结果表明,与主成分分析(principal component analysis, PCA)算法、局部线性嵌入(locally linear embedding, LLE)算法和原t-SNE算法相比,所提方法能够对各种复杂工况数据进行可视化分离聚类,并对异常数据进行识别和剔除。
To effectively identify and eliminate abnormal data in the measured data of wind turbines, an anomaly detection algorithm based on manifold learning is proposed through the analysis of high-dimensional measured data from wind turbines. Firstly, the k-nearest neighbor mutual information algorithm is employed to select feature variables for the wind turbine. Subsequently, an optimized t-distributed stochastic neighbor embedding (t-SNE) algorithm is utilized. This optimized algorithm replaces the sample distance metric with a weighted sum of the Euclidean distance and the local principal component analysis (LPCA) difference, enabling the extraction of low-dimensional features with inherent patterns from the high-dimensional manifold data. This facilitates the distinct separation of data with different distribution characteristics in a visualized two-dimensional space. Furthermore, the density-based spatial clustering of applications with noise (DBSCAN) algorithm is applied to cluster the data within this two-dimensional space. The results demonstrate that, compared to the principal component analysis (PCA) algorithm, locally linear embedding (LLE) algorithm, and the original t-SNE algorithm, the proposed method can effectively achieve visual separation and clustering for data under various complex operating conditions, successfully identifying and eliminating abnormal data.
风电机组 / 异常数据 / 流形学习 / 降维 / 基于密度的噪声空间聚类(DBSCAN)算法
wind turbines / anomalous data / manifold learning / dimensionality reduction / density-based spatial clustering of applications with noise (DBSCAN) algorithm
| [1] |
俞颖. 全球电力发展现状[J]. 电力科技与环保, 2025, 41(1): 173.
YU Ying. Current status of global power development[J]. Electric Power Technology and Environmental Protection, 2025, 41(1): 173.
|
| [2] |
张旻. 风力发电机组现状与发展趋势探析[J]. 中国机械, 2024(3): 58-61.
ZHANG Min. Exploration of the current situation and development trend of wind turbines[J]. Machine China, 2024(3): 58-61.
|
| [3] |
严新荣, 张宁宁, 马奎超, 等. 我国海上风电发展现状与趋势综述[J]. 发电技术, 2024, 45(1): 1-12.
YAN Xinrong, ZHANG Ningning, MA Kuichao, et al. Overview of current situation and trend of offshore wind power development in China[J]. Power Generation Technology, 2024, 45(1): 1-12.
|
| [4] |
周吉, 汤亚芳, 王金埭, 等. 基于直流母线电压动态响应特性的双馈风电场故障暂态等值建模方法[J]. 智慧电力, 2025, 53(7): 84-91.
ZHOU Ji, TANG Yafang, WANG Jindai, et al. Fault transient equivalent modeling method for doubly-fed wind farms based on dynamic response characteristics of DC bus voltage[J]. Smart Power, 2025, 53(7): 84-91.
|
| [5] |
ZHANG G Y, LI Y T, JIANG W B, et al. A fault diagnosis method for wind turbines with limited labeled data based on balanced joint adaptive network[J]. Neurocomputing, 2022, 481: 133-153.
|
| [6] |
张晓英, 段赛赛, 张兴平, 等. 融合时空相关性和权重系数分配的风电机组出力异常值处理方法[J]. 电力建设, 2024, 45(6): 100-110.
ZHANG Xiaoying, DUAN Saisai, ZHANG Xingping, et al. Outlier processing method for wind turbine output integrated with spatiotemporal correlation and weight coefficient distribution[J]. Electric Power Construction, 2024, 45(6): 100-110.
|
| [7] |
陈小乾, 尹亮, 展宗辉, 等. 基于注意力机制和RCN-BiLSTM融合的风电机组故障识别[J]. 中国电力, 2025, 58(8): 94-102.
CHEN Xiaoqian, YIN Liang, ZHAN Zonghui, et al. Fault identification for wind turbine based on attention mechanism and RCN-BiLSTM fusion[J]. Electric Power, 2025, 58(8): 94-102.
|
| [8] |
刘朋义. 基于SCADA系统的风力发电场远程监控方法[J]. 电器工业, 2025(2): 35-38, 43.
LIU Pengyi. Remote monitoring methods for wind farms based on SCADA systems[J]. China Electrical Equipment Industry, 2025(2): 35-38, 43.
|
| [9] |
肖碧涛, 赖晓路, 郭鹏, 等. 高斯混合模型风电机组异常数据识别方法研究[J]. 可再生能源, 2023, 41(8): 1051-1056.
XIAO Bitao, LAI Xiaolu, GUO Peng, et al. Abnormal data identification for wind turbine based on Gaussian Mixture Model[J]. Renewable Energy Resources, 2023, 41(8): 1051-1056.
|
| [10] |
雷萌, 郭鹏, 刘博嵩. 基于自适应DBSCAN算法的风电机组异常数据识别研究[J]. 动力工程学报, 2021, 41(10): 859-865.
LEI Meng, GUO Peng, LIU Bosong. Study on abnormal data recognition of wind turbines based on adaptive DBSCAN algorithm[J]. Journal of Chinese Society of Power Engineering, 2021, 41(10): 859-865.
|
| [11] |
LUO Z H, FANG C Y, LIU C L, et al. Method for cleaning abnormal data of wind turbine power curve based on density clustering and boundary extraction[J]. IEEE Transactions on Sustainable Energy, 2022, 13(2): 1147-1159.
|
| [12] |
莫丰源, 王卫华, 郭前. 基于MADM-QM的风电机组风功率异常数据处理方法[J]. 可再生能源, 2025, 43(3): 339-345.
MO Fengyuan, WANG Weihua, GUO Qian. Wind speed-power abnormal data processing method of wind turbine based on MADM-QM[J]. Renewable Energy Resources, 2025, 43(3): 339-345.
|
| [13] |
LIANG G Y, SU Y H, CHEN F, et al. Wind power curve data cleaning by image thresholding based on class uncertainty and shape dissimilarity[J]. IEEE Transactions on Sustainable Energy, 2021, 12(2): 1383-1393.
|
| [14] |
ZHU A F, XIAO Z, ZHAO Q C. Power data preprocessing method of mountain wind farm based on POT-DBSCAN[J]. Energy Engineering, 2021, 118(3): 549-563.
|
| [15] |
孔维胜, 朱海鹏, 王晓东, 等. 基于DBSCAN的风机功率异常数据清洗[J]. 计算机科学与应用, 2021, 11(10): 2517-2528.
KONG Weisheng, ZHU Haipeng, WANG Xiaodong, et al. Cleaning of abnormal data of wind turbine power based on DBSCAN[J]. Computer Science and Application, 2021, 11(10): 2517-2528.
|
| [16] |
魏泰, 贺少雄, 胡子武, 等. 基于改进孤立森林算法的风电机组异常数据清洗[J]. 科学技术与工程, 2024, 24(9): 3691-3699.
WEI Tai, HE Shaoxiong, HU Ziwu, et al. Wind turbine abnormal data cleaning based on an improved isolation forest algorithm[J]. Science Technology and Engineering, 2024, 24(9): 3691-3699.
|
| [17] |
郭慧军, 李永亭, 齐咏生, 等. 两阶段CP-Copula的风电机组异常数据清洗算法[J]. 计算机仿真, 2022, 39(11): 85-91.
GUO Huijun, LI Yongting, QI Yongsheng, et al. A two-stage CP-Copula algorithm for clearing abnormal data of wind turbine[J]. Computer Simulation, 2022, 39(11): 85-91.
|
| [18] |
刘伟民. 风电功率曲线异常数据清洗及考虑风速风向的风能资源评估[D]. 兰州: 兰州理工大学, 2022.
LIU Weimin. Abnormal data cleaning of wind power curve and wind energy assessment considering wind speed and direction[D]. Lanzhou: Lanzhou University of Technology, 2022.
|
| [19] |
刘启鹏. 基于多元数据重构的风电机组异常变量识别分析[J]. 电力设备管理, 2025(9): 203-205.
LIU Qipeng. Identification and analysis of abnormal variables of wind turbines based on multivariate data reconstruction[J]. Electric Power Equipment Management, 2025(9): 203-205.
|
| [20] |
李特, 王荣喜, 高建民. 风电机组数据采集与监控系统异常数据识别方法[J]. 西安交通大学学报, 2024, 58(3): 106-116.
LI Te, WANG Rongxi, GAO Jianmin. A method for abnormal data recognition of wind turbine supervisory control and data acquisition systems[J]. Journal of Xi'an Jiaotong University, 2024, 58(3): 106-116.
|
| [21] |
龚亚雄, 高佳浩, 李逸, 等. 基于CNN-BiLSTM的风功率异常数据识别和清洗方法研究[J]. 能源与环境, 2023(5): 60-63.
GONG Yaxiong, GAO Jiahao, LI Yi, et al. Research on methods for identifying and cleaning abnormal wind power data based on CNN-BiLSTM[J]. Energy and Environment, 2023(5): 60-63.
|
| [22] |
王晓宇, 阿木古楞, 李关平, 等. 基于改进组内方差算法识别风电机组异常数据[J]. 风机技术, 2023, 65(4): 80-84.
WANG Xiaoyu, AMU Guleng, LI Guanping, et al. Abnormal data of wind turbine identification based on improved group internal variance algorithm[J]. Chinese Journal of Turbomachinery, 2023, 65(4): 80-84.
|
| [23] |
PEI F Q, ZHANG B W, LONG S Y, et al. BiLSTM-PINN-based wind turbine power prediction architecture and anomaly data identification[J]. Renewable Energy, 2026, 261: 125182.
|
| [24] |
杨锡运, 米尔扎提·买合木提, 刘思渠, 等. 基于Vine-Copula贝叶斯网络模型的风机高温降容状态评估方法[J]. 中国电机工程学报, 2020, 40(11): 3583-3591.
YANG Xiyun, MAIHEMUTI M, LIU Siqu, et al. State assessment method of capacity reduction at high temperature for wind turbine based on vine-copula bayesian network model[J]. Proceedings of the CSEE, 2020, 40(11): 3583-3591.
|
| [25] |
杨艳平, 李荣. 基于机器学习的高维数据分类特征选择[J]. 湖南文理学院学报(自然科学版), 2025, 37(1): 23-31.
YANG Yanping, LI Rong. Feature selection of high-dimensional data classification based on machine learning[J]. Journal of Hunan University of Arts and Science (Science and Technology), 2025, 37(1): 23-31.
|
| [26] |
李二超, 张宝新, 贾彬彬. KNN特征增强与互信息特征选择的两阶段多维分类方法[J]. 计算机工程与应用, 2025, 61(15): 167-177.
LI Erchao, ZHANG Baoxin, JIA Binbin. Two-stage multi-dimensional classification method combining KNN feature enhancement and mutual information feature selection[J]. Computer Engineering and Applications, 2025, 61(15): 167-177.
|
| [27] |
刘衍平, 奚金明, 郑荣艳, 等. 解决高维优化和特征选择问题的多策略改进麻雀搜索算法[J]. 科学技术与工程, 2024, 24(31): 13450-13466.
LIU Yanping, XI Jinming, ZHENG Rongyan, et al. Multi-strategy improved sparrow search algorithm for solving high dimensional optimization and feature selection problems[J]. Science Technology and Engineering, 2024, 24(31): 13450-13466.
|
| [28] |
王振飞, 袁佩瑶, 曹中亚, 等. 面向高维不平衡数据的特征选择算法[J]. 小型微型计算机系统, 2024, 45(8): 1839-1846.
WANG Zhenfei, YUAN Peiyao, CAO Zhongya, et al. Feature selection algorithm for high dimensional unbalanced data[J]. Journal of Chinese Computer Systems, 2024, 45(8): 1839-1846.
|
| [29] |
高永杰, 党建武, 张希权, 等. 基于互信息自适应的多模态实体对齐方法[J]. 计算机应用研究, 2025, 42(1): 106-110.
GAO Yongjie, DANG Jianwu, ZHANG Xiquan, et al. Multi-modal entity alignment method based on adaptive mutual information[J]. Application Research of Computers, 2025, 42(1): 106-110.
|
| [30] |
王子顺, 赵东辉, 苑骁明, 等. 基于互信息与CNN的风电机组齿轮箱故障智能诊断方法[J]. 风机技术, 2025, 67(5): 78-84.
WANG Zishun, ZHAO Donghui, YUAN Xiaoming, et al. An intelligent fault diagnosis approach for wind turbine gearboxes using mutual information and convolutional neural networks[J]. Chinese Journal of Turbomachinery, 2025, 67(5): 78-84.
|
| [31] |
王威, 阳春华, 韩洁, 等. 基于k-近邻互信息的前向式变量选择方法及在水质参数软测量中的应用[J]. 系统工程理论与实践, 2022, 42(1): 253-261.
WANG Wei, YANG Chunhua, HAN Jie, et al. Forward variable selection method based on k-nearest neighbor mutual information and its application in soft sensor modeling of water quality parameters[J]. Systems Engineering-Theory & Practice, 2022, 42(1): 253-261.
|
| [32] |
冯琳琳, 王长鹏, 吴田军, 等. 基于变分自编码器的流形学习降维方法[J]. 计算机辅助设计与图形学学报, 2025, 37(3): 439-445.
FENG Linlin, WANG Changpeng, WU Tianjun, et al. Dimensionality reduction method for manifold learning based on variational autoencoder[J]. Journal of Computer-Aided Design & Computer Graphics, 2025, 37(3): 439-445.
|
| [33] |
VAN DER MAATEN L, HINTON G. Visualizing data using t-SNE[J]. Journal of Machine Learning Research, 2008, 9(86): 2579-2605.
|
| [34] |
边荣正, 张鉴, 周亮, 等. 面向复杂多流形高维数据的t-SNE降维方法[J]. 计算机辅助设计与图形学学报, 2021, 33(11): 1746-1754.
BIAN Rongzheng, ZHANG Jian, ZHOU Liang, et al. t-SNE for complex multi-manifold high-dimensional data[J]. Journal of Computer-Aided Design & Computer Graphics, 2021, 33(11): 1746-1754.
|
/
| 〈 |
|
〉 |