Acessibilidade / Reportar erro

Short-Term Wind Power Interval Forecasting Based on Hybrid Modal Decomposition and Improved Optimization

Abstract

Accurate wind power prediction can effectively alleviate the pressure of the power system peak frequency regulation, and is more conducive to the economic dispatch of the power system. To enhance wind power forecasting accuracy, a hybrid approach for wind power interval prediction is proposes in this study. Firstly, an Improved Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (ICEEMDAN) is applied to decompose the initial wind power sequence into multiple modes, and Variational Mode Decomposition is used to further decompose the high-frequency non-stationary components. Next, Fuzzy Entropy (FE) is utilized to assess the complexity of the post-decomposed Intrinsic Mode Functions (IMFs), and different forecasting methods are employed accordingly, the point predictions were obtained by linearly summing the component predictions.Additionally, an improved sparrow search algorithm (ISSA) is used to seek the optimal hyperparameters of the prediction algorithm. Finally, the prediction intervals are constructed using the point prediction results based on kernel density estimation (KDE). The root mean square errors (RMSE) of deterministic predictions are 2.8458 MW and 1.8605 MW, with uncertainty coverage rates of 95.83% and 97.92% at a 95% confidence level.

Key words
uncertainty forecasts; wind power; combinatorial decomposition; ISSA; KDE

Introduction

Due to the continuous development of economy and society, the environmental pollution caused by fossil energy is becoming more and more serious, and non-fossil energy is gradually becoming the main force of energy (Yan et al. 2022YAN Y, WANG X, REN F, SHAO Z & TIAN C. 2022. Wind speed prediction using a hybrid model of eemd and lstm considering seasonal features. Energy Rep 8: 8965-8980.). Wind energy as one of the non-fossil energy sources, due to its clean and non-polluting characteristics of the wind power industry to promote the rapid development of wind power, but the wind power generation of its own intermittency and randomness, on the reliability of the power system operation has brought serious challenges. Therefore, the establishment of a stable and reliable wind power prediction model can effectively improve the power quality and enhance the reliability of the power system (Suo et al. 2023SUO L, PENG T, SONG S, ZHANG C, WANG Y, FU Y & NAZIR MS. 2023. Wind speed prediction by a swarm intelligence based deep learning model via signal decomposition and parameter optimization using improved chimp optimization algorithm. Energy 276: 127526., Zhang et al. 2022ZHANG W & LIU S. 2022. Improved sparrow search algorithm based on adaptive t-distribution and golden sine and its application. Microelectronics & Computer 39(3): 17-24.). The types of wind power prediction are categorized as physical, statistical and artificial intelligence models. Physical models are based on numerical weather predictions (NWP) considering factors such as topography, climate, and season to establish wind power prediction models. However, these models are computationally intensive and complex (Castorrini et al. 2022CASTORRINI A, GENTILE S, GERALDI E & BONFFGLIOLI A. 2022. Increasing spatial resolution of wind resource prediction using nwp and rans simulation. J Wind Eng Ind Aerod 210: 104499.). Statistical modeling combines statistical methods to create a prediction of the wind power value at the next moment in time, without taking into account physical factors such as climate and topography, and is therefore widely used for short-term forecasts (Chen 2022CHEN H. 2022. A comprehensive statistical analysis for residuals of wind speed and direction from numerical weather prediction for wind energy. Energy Rep 8: 618–626.). Statistical models include grey prediction models, time series forecasting models, etc. Nevertheless, relying on local patterns, statistical models can lead to long-term forecasting errors.

Limited by the high computational complexity and intricate processes of physical models and the potential for long-term errors in statistical models, artificial intelligence models have been widely applied in wind power forecasting. Artificial intelligence models include machine learning models such as Support Vector Machine (SVM) (Bi & Qiu 2019BI K & QIU T. 2019. An intelligent svm modeling process for crude oil properties prediction based on a hybrid ga-pso method. Chinese J Chem Eng 27(8): 1888–1894.), Least-Squares SVM (LSSVM) (Li & Jin 2018LI R & JIN Y. 2018. A wind speed interval prediction system based on multi-objective optimization for machine learning method. Appl Energ 228: 2207–2220.), as well as neural network models like Backpropagation (BP) (Feng & Wencheng 2023FENG R & WENCHENG L. 2023. Lssa-bp-based cost forecasting for onshore wind power. Energy Rep 9: 362–370.), Elman Neural Network (Shi et al. 2015SHI Y, YU DL, TIAN Y & SHI Y. 2015. Air–fuel ratio prediction and nmpc for si engines with modified volterra model and rbf network. Engineering Applications of Artiffcial Intelligence 45: 313–324.), Long Short-Term Memory (LSTM) (Bao et al. 2022BAO R, HE Z & ZHANG Z. 2022. Application of lightning spatio-temporal localization method based on deep lstm and interpolation. Measurement 189: 110549., Gao et al. 2023GAO X, GUO W, MEI C, SHA J, GUO Y & SUN H. 2023. Short-term wind power forecasting based on ssa-vmd-lstm. Energy Rep 9: 335–344.), Bidirectional LSTM (Bilstm) (Li et al. 2022LI LL, CHANG YB, TSENG ML, LIU JQ & LIM MK. 2020. Wind power prediction using a novel model on wavelet decomposition-support vector machines-improved atomic search algorithm. J Clean Prod 270: 121817.), and Gated Recurrent Unit (GRU) (Xiao et al. 2023XIAO B & ZHANG B. 2023. Short-term wind power interval prediction based on combined modal decomposition and deep learning. Power System Automation 47: 110–117., Zhao et al. 2023ZHAO Z & WANG X. 2020. Multi-step forecasting of ultra-short-term wind power based on CEEMD and improved time series model. J Sol Energ-T Asme 41(07): 352–358.). Combining signal processing with artificial neural networks has been shown to achieve better forecasting results for wind power. Widely used signal processing methods include Wavelet Decomposition (WD) (Motlagh et al. 2021MOTLAGH MM, BAHAR A & BAHAR O. 2021. Damage detection in a 3d wind turbine tower by using extensive multilevel 2d wavelet decomposition and heat map, including soil-structure interaction. Structures 31: 842–861.), Empirical Mode Decomposition (EMD), Ensemble Empirical Mode Decomposition (EEMD) (Wu et al. 2023WU H, GUO C & SU C. 2023. Short-term wind power combination prediction method based on EEMD-GRU-MC. Southern Power Grid Technology 17(02): 66–73.), Complementary Empirical Mode Decomposition (CEEMD) (Zhao & Wang 2020ZHAO Z, YUN S, JIA L, GUO J, MENG Y, HE N, LI X, SHI J & YANG L. 2023. Hybrid vmd-cnn-gru-based model for short-term forecasting of wind power considering spatiotemporal features. Eng Appl Artif Intel 121: 105982.), Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) (Zhang et al. 2020ZHANG J, LU J, PAN J, TAN Y, CHENG X & LI Y. 2022. Implications of the development and evolution of global wind power industry for china—an empirical analysis is based on public policy. Energy Rep 8: 205–219.), Improved Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (ICEEMDAN) (Xiao & Zhang 2023XIAO Y, ZOU C, CHI H & FANG R. 2023. Boosted gru model for short-term forecasting of wind power with feature-weighted principal component analysis. Energy 267: 126503.), and Variational Mode Decomposition (VMD) (Liu 2020LIU H. 2020. Wind power prediction based on vmd decomposition and mrmr feature information selection. Master’s thesis. Xi’an University of Technology (Unpublished).). However, WD requires the prior determination of the mother wavelet (Li et al. 2020LI J, ZHANG S & YANG Z. 2022. A wind power forecasting method based on optimized decomposition prediction and error correction. Electr Pow Syst Res 208: 107886.), EMD is prone to endpoint effects and mode mixing issues (Nguyen & Phan 2023NGUYEN THT & PHAN QB. 2023. Hourly day ahead wind speed forecasting based on a hybrid model of eemd, cnn-bi-lstm embedded with ga optimization. Energy Rep 8: 53–60.), EEMD and CEEMD may have some residual noise (Wu et al. 2023WU H, GUO C & SU C. 2023. Short-term wind power combination prediction method based on EEMD-GRU-MC. Southern Power Grid Technology 17(02): 66–73., Zhang et al. 2020ZHANG J, LU J, PAN J, TAN Y, CHENG X & LI Y. 2022. Implications of the development and evolution of global wind power industry for china—an empirical analysis is based on public policy. Energy Rep 8: 205–219., Zhao & Wang 2020ZHAO Z, YUN S, JIA L, GUO J, MENG Y, HE N, LI X, SHI J & YANG L. 2023. Hybrid vmd-cnn-gru-based model for short-term forecasting of wind power considering spatiotemporal features. Eng Appl Artif Intel 121: 105982.). CEEMDAN suffers from residual noise and pseudo-mode problems, while ICEEMDAN effectively addresses these issues (Bommidi et al. 2023BOMMIDI BS, TEEPARTHI K & KOSANA V. 2023. Hybrid wind speed forecasting using iceemdan and transformer model with novel loss function. Energy 265: 126383.). Wang et al. (Wang et al. 2022WANG F, WANG S & ZHANG L. 2022. Ultra short term power prediction of photovoltaic power generation based on vmd-lstm and error compensation. J Sol Energ-T Asme 43(8): 96.) decomposed the wind power sequence into multiple sub-sequences using VMD and then input them into an LSTM network for prediction, but the forecasting results depended on hyperparameter selection. He et al. (He & Tsang 2021HE Y & TSANG KF. 2021. Universities power energy management: A novel hybrid model based on iceemdan and bayesian optimized lstm. Energy Rep 7: 6473–6488.) applied ICCEMDAN to decompose university electricity consumption data into multiple sub-sequences, which were individually predicted using a LSTM optimized by Bayesian optimization, and finally reconstructed to obtain the final prediction. Yang et al. (Yang et al. 2023YANG Q, HUANG G, LI T, XU Y & PAN J. 2023. A novel short-term wind speed prediction method based on hybrid statistical-artificial intelligence model with empirical wavelet transform and hyperparameter optimization. J Wind Eng Ind Aerod 240: 105499.) used EWT to decompose wind speed sequences and optimized the hyperparameters of LSSVM and Gaussian Process Regression (GPR) models using the Differential Evolution Grey Wolf Optimization (DE-GWO) algorithm. They sequentially predicted the decomposed components from low to high frequencies using ARIMA, LSSVM, and GPR models. However, the high-frequency components generated after decomposition suffered from spectral aliasing, leading to significant prediction errors. Liu et al. (Liu et al. 2018LIU H, MI X & LI Y. 2018. Smart multi-step deep learning model for wind speed forecasting based on variational mode decomposition, singular spectrum analysis, lstm network and elm. Energ Convers Manage 159: 54–64.) used VMD for a first-round decomposition of wind speed data and applied Singular Spectrum Analysis (SSA) to further extract the trend information of the data, resulting in improved prediction accuracy. Xiang et al. (Xiang et al. 2020XIANG L, LIU J, SU H, HU A & NING Z. 2020. Study on multi-step prediction of wind speed based on CEEMDAN quadratic decomposition and LSTM. J Sol Energ-T Asme 43(08): 334-339.) used CEEMDAN for a second-round decomposition and fed all the sub-sequences into an LSTM network to obtain wind speed predictions with higher accuracy. However, the large number of sub-components generated by the second-round decomposition made the computation relatively cumbersome. To address this, aggregation processing was combined to significantly reduce the computational burden and enhance the model’s prediction accuracy and stability (Suo et al. 2023SUO L, PENG T, SONG S, ZHANG C, WANG Y, FU Y & NAZIR MS. 2023. Wind speed prediction by a swarm intelligence based deep learning model via signal decomposition and parameter optimization using improved chimp optimization algorithm. Energy 276: 127526.). Xiong et al. (Xiong et al. 2023XIONG J, PENG T, TAO Z, ZHANG C, SONG S & NAZIR MS. 2023. A dual-scale deep learning model based on elm-bilstm and improved reptile search algorithm for wind power prediction. Energy 266: 126419.) employed CEEMD for a first-round decomposition, used Sample Entropy (SE) to reduce sequence complexity, treated the sequences with Random Forest in the second round, and then applied BiLSTM and ELM models to separately predict high-frequency versus low-frequency sequences. They optimized the model parameters using the Improved Spider Monkey Algorithm (IRSA) and finally aggregated the results to obtain the final wind power prediction. However, CEEMD suffered from noise residue issues and RF’s instability, rendering the overall model unstable. Xiao et al (Xiao et al. 2023XIAO B & ZHANG B. 2023. Short-term wind power interval prediction based on combined modal decomposition and deep learning. Power System Automation 47: 110–117.) who used quadratic decomposition for wind power series, calculated complexity using SE, and aggregated the components before inputting them into a Bayesian optimized BiLSTM network to obtain point predictions with good forecasting performance. However, the model does not treat the modal components of the different features obtained from the decomposition, and the Bayesian optimized BiLSTM network had limited accuracy in predicting low-frequency components, which in turn affected the model’s overall predictive performance.

Wind speed forecasting involves both deterministic and non-deterministic predictions. While deterministic forecasting can closely approximate the true wind speed, it may fail to represent the uncertainty in the wind power sequence. Non-deterministic or interval forecasting, on the other hand, can better quantify the prediction bias caused by the uncertainty in wind power, thus providing a range of possible outcomes (Xiao & Zhang 2023XIAO Y, ZOU C, CHI H & FANG R. 2023. Boosted gru model for short-term forecasting of wind power with feature-weighted principal component analysis. Energy 267: 126503.). Previous literature has applied Kernel Density Estimation (KDE) to interval forecasting (Niu et al. 2022NIU D, SUN L, YU M & WANG K. 2022. Point and interval forecasting of ultra-short-term wind power based on a data-driven method and hybrid deep learning model. Energy 254: 124384., Zhang et al. 2023ZHANG Y, PAN G, ZHAO Y, LI Q & WANG F. 2020. Short-term wind speed interval prediction based on artificial intelligence methods and error probability distribution. Energ Convers Manage 224: 113346.) and achieved promising results by combining point predictions to obtain interval prediction results.

In view of this, this paper comes up with a hybrid wind power interval prediction model. First, considering the stochastic and intermittent nature of wind power sequences, a two-stage decomposition process using ICEEMDAN, FE, and VMD (IVF) is established to address the issue of excessive sub-component decomposition caused by second-round decomposition. FE is used to aggregate the decomposed components, reducing the non-stationarity of the wind power sequence. Next, GRU and ELMAN models are employed to predict different components, leveraging the time characteristics of the wind power sequence. To select the optimal hyperparameters for the GRU network, an Improved Sparrow Search Algorithm (ISSA) is proposed for optimization. Finally, to overcome the limitations of point predictions, a kernel density estimation model is constructed based on the error sequence of the point predictions to perform wind power interval forecasting. The innovative aspects and contributions of this paper are as follows:

  1. Combining the advantages of composite models in wind power forecasting, this study constructs a (IVF-ISSA-GRU/ELMAN) model for wind power prediction, incorporating second-order signal decomposition, deep learning models, evolutionary algorithms, and aggregation framework.

  2. ICEEMDAN is used to decompose wind power data. To minimize the complexity of the high-frequency components after decomposition, VMD is applied to process the high-frequency components obtained from ICEEMDAN, enhancing their trend and stability.

  3. FE is used to analyze the complexity of the decomposed components, aggregating and classifying components with correlation. Different components are predicted using GRU and ELMAN models.

  4. Two improvement strategies are employed to enhance the optimization capability of SSA for optimizing GRU model parameters. Optimizing the position update formulation of the SSA algorithm using the golden sine strategy, and the golden section coefficients c1 and c2 are improved, with the introduction of the Tent chaotic map for population initialization.

  5. To fully exploit wind power forecasting information, this study performs interval forecasting by combining wind power point prediction results with their error sequences, using KDE to construct wind power interval predictions.

Basic Methods

Signal processing methods

ICEEMDAN

ICEEMDAN addresses the issues of residual noise and pseudo-modes by selecting white noise from the EMD-decomposed modal components (Xiao et al. 2023XIAO B & ZHANG B. 2023. Short-term wind power interval prediction based on combined modal decomposition and deep learning. Power System Automation 47: 110–117.). The basic steps of ICEEMDAN decomposition are as follows:

  1. Use the EMD method to decompose the original wind power signal x once, obtaining the first-order residual r1, as shown in Equation (1).

    z(i)=z+βEk(ω(i))(1)
    r1=τ(xi(t))(2)
    Where Ek() represents the kth-order modal component from EMD decomposition; ω(i)represents Gaussian white noise; z(i) is the time series with white noise added in the i-th iteration; β=φ0std(x(t))/std(Ek(ηi(t))), φ0 is the reciprocal of the expected signal-to-noise ratio between the added noise signal and the analysis signal, where i denotes the number of noise additions, std represents the standard deviation, . denotes the overall mean.

  2. When k=1, calculate the modal component obtained from the first-order residual:

    IMF1(t)=x(t)r1(t)(3)

  3. Obtain the second-order residual and the second-order modal component:

    r2(t)=τ(r1(t)+α1E2(ηi(t)))(4)
    IMF2(t)=r1(t)r2(t)(5)

  4. Similarly in Step 3, calculate the kth-order residual and modal component:

    rk(t)=τ(rk1(t)+αk1Ek(ηi(t)))(6)
    IMFk(t)=rk1(t)rk(t)(7)

  5. Repeat Step 4 to obtain all the residuals and IMF components.

Second decomposition stage based on VMD

VMD decomposes time series by iteratively searching for the optimal solution to a variational problem with constraints, making it a fully non-recursive time-frequency domain signal processing method. It performs well in the decomposition and denoising of high-frequency non-stationary sequences, ensuring data reliability while minimizing noise. The basic steps of VMD decomposition are as follows:

  1. Construct the variational problem with the expression shown in Equation (2).

    {min{uk},{ωk}{k=1kt[δ(t)+jπt*uk(t)]ejωkt22}s.tk=1kuk=s(8)
    Where {ωk} denotes the set of central frequencies for each component after decomposition; {uk} represents the set of modes after decomposition; * denotes the convolution process; δ(t) denotes the unit pulse signal; (t) is the Dirac function; k is the number of modal components; s is the original load sequence at moment t; 22and denotes the number of L2 paradigms.

  2. Introduce Lagrange multipliers λ(t) and the second-order penalty factor α to transform the constrained variational problem into an unconstrained one, resulting in the augmented Lagrange expression shown below (Equation (3)):

    L({uk},{ωk},λ)=αk=1kt[(δ(t)+jπt)]ejωkt22+f(t)k=1kuk(t)22+λ(t),f(t)k=1kuk(t)(9)
    Where λ is the Lagrange multiplication operator; αis the quadratic penalty factor.

  3. Use the alternating direction method of multipliers to update the "saddle points" of the Equation (3), namely, ukn+1, ωkn+1, and λkn+1. Obtain the optimal solution and assess the accuracy of each iteration until the desired accuracy is achieved, terminating the iterations. The update formulas are as follows:

    ûkn+1(ω)=f̂(ω)ikûi(ω)+λ̂(ω)21+2α(ωωk)2(10)
    ωkn+1=0ω|ukn+1(ω)|dω0|ukn+1(ω)|dω(11)
    λn+1(ω)=λn(ω)+λ[f(ω)k|ukn+1(ω)|](12)

  4. Repeat updating ukn+1, ωkn+1, λkn+1, and until the termination condition k(ukn+1ukn/ukn2)<ε is satisfied, then terminate the iteration. Where ûkn+1(ω) is the Vienna filter; f̂(ω), ûi(ω), λ̂(ω) are the Fourier transforms of f(t), u(t), and λ(t), respectively.

Wind power modal decomposition reconstruction based on fuzzy entropy (FE)

Fuzzy entropy is an improved version of sample entropy that combines fuzzy set theory and adopts fuzzy membership degree as the threshold criterion in entropy calculation. Compared to sample entropy, it can better measure the complexity of time series. The calculation steps of fuzzy entropy are as follows:

  1. Perform phase space reconstruction on the time series x(i):

    X(i)={x(i),x(i+1),,x(i+m1)}u(i)(13)
    Where X(i) is the reconstructed sequence, and u(i) is the mean of m consecutive x(i) values.

  2. Define the distance between two vectors X(i) and X(j), expressed as:

    dijm=max{|(x(i+k)u(i))(x(j+k)u(i))|}(14)

  3. Introduce the fuzzy membership degree function to define the similarity degree Aijm, as follows:

    Aijm={1,dijm=0exp[ln2(dijmr)2],dijm>0(15)
    Where r is the similarity tolerance parameter, and r=R×δ , where δ is the standard deviation of the original one-dimensional time series.

  4. Define the function Cim(r), as follows:

    Cim(r)=1Nmj=1,i1Nm+1Aijm(16)

    From the above equation, the relationship dimensionality in m-dimension can be obtained as:

    ϕm+1(r)=1Nmj=1,i1Nm+1Cim(r)(17)

  5. Set m=m+1, repeat steps (1)-(4) to obtain ϕm+1(r).

  6. Finally, obtain the expression for FE as follows:

    FE(m,r,N)=lnϕm(r)lnϕm+1(r)(18)

Prediction algorithm

GRU (Gated Recurrent Unit) proposed by Cho et al, and it is an improvement of LSTM (Xiao et al. 2023XIAO B & ZHANG B. 2023. Short-term wind power interval prediction based on combined modal decomposition and deep learning. Power System Automation 47: 110–117.). GRU combines the input and forget gates of LSTM into an update gate, which controls the information retained from the previous time step to the current state. It also replaces the output gate with a reset gate, which controls previous moment’s state information message written to current candidate set. The structure of GRU is given as Figure 1, and the update formula is as follows:

{zt=σ(wz[ht1,xt,1])rt=σ(wr[ht1,xt,1])h̃t=tanh(wh[rtht1,xt,1])ht=(1zt)ht+ztht1(19)
Where wz, wr, wh, and wo represent the weight values; zt, rt is the outputs of the update gate and reset gate, respectively; xt is the input at the current moment; ht1 is the hidden state vector of the previous cell; []denotes vector concatenation; represents Hadamard product.

Figure 1
GRU Network Structure.

Improved sparrow search algorithm (ISSA)

When SSA initializes the population, the location information of the population is randomly generated. There exists a limited range of search, relatively poor biodiversity, and easy to fall into the local optimum, so the following strategy is used to optimize the SSA (Huang 2021HUANG J. 2021. Research on sparrow search algorithm combining T distribution and tent chaotic map. Master’s thesis. Lanzhou University.). Its update steps are as follows:

  1. Tent Chaotic Map for Population Initialization

    In this study, the Tent chaotic map accepted as initialize the population. It exhibits randomness, ergodicity, and sensitivity to initial values, which can enhance the convergence speed and global search capability of the original algorithm (Gao et al. 2022GAO C, CHEN J & SHI M. 2022. Multi-strategy sparrow search algorithm combining golden sine and curve adaptation. Computer Science and Application 39(02): 491-499.). The Tent mapping has a simple structure and presents a relatively uniform distribution of density in its resulting distribution. Its mathematical expression is as follows:

    xn+1={2xn0<xn<0.52(1xn)0.5<xn<1(20)

  2. Position Update Improvement Strategy

    The Golden Sine Strategy (Tanyildizi & Demir 2017TANYILDIZI E & DEMIR G. 2017. Golden sine algorithm: a novel math-inspired algorithm. Adv Electr Comput En 17(2)., Zhang & Liu 2022ZHANG Y, LI C, JIANG Y, ZHAO R, YAN K & WANG W. 2023. A hybrid model combining mode decomposition and deep learning algorithms for detecting TP in urban sewer networks. Appl Energ 333: 120600.)(Golden-SA) is based on the relationship between the sine function unit and the circle. It scans the entire unit circle to enhance its global search capability and introduces the golden section factor in the strategy to narrow the search range and enhance its local search solution space ability. This strategy enhances both the global search ability and the local exploitation capability of the sparrow population . The improved position update formula for the discoverer is:

    Xi,jt={Xi,jt1|sin(R1)|R2sin(R1)|c1×Xbestt1c2×Xbestt1|,R2<STXi,jt1+QL,R2ST(21)
    Where R1rand[0,2π], which is a random value in the range [0,2], R2rand[0,π]. The sparrow individual’s position information xit=[xi,1t,xi,2t,xi,3t,,xi,dt], (i=1,2,3,,n), Pit represents the optimal value of the sparrow’s global position at the tth iteration; c1=π(12τ)2, c2=π(2τ1)2 , and τ is the golden section coefficient τ=(51)/2. When R2<ST, the finder continues to perform the search work; when, the scout in the population discovers the predator and sends an alert signal to the other sparrows in the population, and the finder flies quickly to other areas by performing the position update according to Eq. (4). The joiner update equation is:
    Xi,jt+1={Qexp(XworsttXi,jti2)i>n2Xbestt+1+|Xi,jXbestt+1|A+Lin2(22)
    Where A=AT(AAT)1, A is a column vector of individual sparrows of the same latitude, with each element inside randomly assigned a value of 1 or -1; the scout’s position update formula is:
    Xi,jt+1={Xbestt+β|Xi,jtXbestt|iffi<fgXi,jt+k(|Xi,jtXworstt|(fifw)+ε)iffi=fg(23)
    where k[1,1]; fi, fg, fw are the individual fitness value of the ith sparrow, the current global best fitness value, and the worst fitness value, respectively; is the step control coefficient and ; is an infinitesimal constant set to prevent the denominator from being zero.

Error confidence interval based on KDE

KDE is a non-parametric estimation method. The theory is to use a moving window to estimate the sample density and get the probability density value of the point by weighted average, and finally get the probability density function of the wind power sequence. The construction method of the prediction interval is as follows:

{Ure=yre+|F1α2|Lre=yre+|Fα2|(24)
Where yre represents the point prediction result of the wind power series; F1α2 is the upper α2 percentile of the point prediction result error sequence, Fα2 is the lower α2 percentile, and α is the interval confidence level; Ure is the upper limit of the interval prediction, and Lre is the lower limit of the interval prediction.

Model Establishment

Modeling process

  1. Data Collection: Wind power data is collected every 15 minutes and outliers are removed from the dataset for preprocessing.

  2. Data Decomposition: Apply the ICEEMDAN method to perform one decomposition on the dataset obtained in Step 1, the initial eigenmode components are obtained. Calculate their FE values, and for components with higher FE values, apply VMD for secondary decomposition to reduce the signal-to-noise ratio of the decomposition data.

  3. Modal Aggregation: For all components obtained in Step 2, use FE to analyze the sub-components and aggregate them based on their correlation. Use GRU and ELMAN for prediction based on the characteristics of the restructured IMF component.

  4. Parameter Optimization: To select the optimal hyperparameters for the GRU network, ISSA is used to optimize the four hyperparameters of the GRU network.

  5. Uncertainty Prediction: To accurately model the future data trend and obtain detailed interval information, use the error sequence obtained from the deterministic projection results and KDE for interval prediction.

Performance metrics

This study uses the Coefficient Of Determination (R), Root Mean Square Error (RMSE), and Mean Absolute Error (MAE) as predictive indexes to test the predictive performance of the points. The prediction interval coverage probability (PICP) is used as the evaluation metric for interval prediction to test the performance of each prediction model.

RMSE=1Mi=1M(yiŷi)2(25)
MAE=i=1M|yiŷi|M(26)
R2=1i=1M(yiŷi)2i=1M(yiyi)2(27)
PICP=1ni=1npi×100%(28)
Where yi is the true value, yi is the predicted value, yi is the mean value of the samples, and M is the total number of sample points. If the true value falls within the prediction interval, yi(Li,Ui), pi=1; otherwise, pi=0.

The flowchart of the IVF-ISSA-GRU-ELMAN model is shown in Figure 2.

Figure 2
The structure of IVF-ISSA-GRU/ELMAN model.

Calculus analysis

Data introduction

In this experiment, data was observed from a wind farm in Xinjiang, China, for two months, March and July, of a certain year. Wind power data was collected every 15 minutes.The first 30 days of data (a total of 2880 sampling points) were used as the training set, and the last day’s data (a total of 96 sampling points) were used as the test set for analysis. Table 1 presents basic information about the wind power for the two months, and Figure 3 shows the wind power data for these two months.

Table I
Basic statistical information of two months of wind power data.
Figure 3
Wind speed data of the two months.

Data processing

Due to the stochastic nature and strong fluctuations in the wind power series, ICEEMDAN was used to perform one decomposition on the wind power series, resulting in IMF components and a residue, as shown in Figures 4a and 5a. Fig. 6 shows the response data plots of each modal component, and Tables 2 and 3 show the specific values of FE for each modal obtained from decomposition.

Figure 4
Results of two-stage decomposition. (a) Decomposition of March wind power series using ICEEMDAN; (b) Decomposition of IMF3 using VMD.
Figure 5
Results of two-stage decomposition. (a) Decomposition of July wind power series using ICEEMDAN; (b) Decomposition of IMF3 using VMD.
Figure 6
FE value for each component. (a) March FE values for each component; (b) July FE values for each component.
Table II
Modal FE values of wind power series for March.
Table III
Modal FE values of wind power series for July.

The high-frequency IMF component concentrates the wind power time series details are more complex and more difficult to predict compared to the low-frequency component, and according to Table 2 and Table 3, it is known that the fuzzy entropy value of IMF3 is the highest, so the VMD decomposition is performed for IMF3.Therefore, VMD was applied to perform a second decomposition on IMF3 to reduce its complexity and eliminate noise, as shown in Figures 4b and 5b. To facilitate subsequent predictions, FE was used to quantify the complexity of the sub-components obtained from the two decompositions, and the components were categorized. As shown in Table 2, the components obtained from the ICEEMDAN decomposition for March were labeled as 1-10, and the components obtained from VMD decomposition (IMF3) were labeled as 11-15. The IMF fuzzy entropy obtained from the two decompositions was recombined based on the principle of similar entropy values. R1 was reconstructed from IMF2-5 from the first decomposition and IMF11 from the second decomposition. R2 was reconstructed from IMF6 and IMF12. R3 was composed of IMF7, 13, and 15. R4 was composed of IMF8-10. For the data from July, after ICEEMDAN decomposition, the components were labeled as 1-11, and the components obtained from VMD decomposition (IMF3) were labeled as 12-16. The IMF fuzzy entropy obtained from the two decompositions is shown in Figure 6, and the recombination was based on the principle of similar entropy values. R1 was reconstructed from IMF2-4 from the first decomposition. R2 was composed of IMF5, 12, 13, and 16. R3 was composed of IMF6, 7, 14, and 15. R4 was composed of IMF8-11. The threshold for determining IMF complexity in this study was set at 0.25. A sequence was considered complex when it exceeded 0.25, and simple when it was ≤0.25. Based on the FE results, for March, IMF1, IMF2, and IMF4 were considered complex sequences and were predicted using ISSA-GRU; IMF3 was considered a simple sequence and was predicted using ELMAN. For July, IMF1-3 were considered complex sequences and were predicted using ISSA-GRU; IMF4 was considered a simple sequence and was predicted using ELMAN.

Comparison results

To validate the effectiveness of our proposed model, a total of 8 models were established, including single models, hybrid models, and distribution comparison experimental models. LSTM and GRU served as the single model control group, while two hybrid models mentioned in the literature (VMD-ICEEMDAN-LSTM and CEEMDAN-SE-VMD-LSTM[35]) were used as the hybrid model control group. The models included VMD-GRU, where the original data was decomposed using VMD, and GRU was used to predict each component; VMD-ISSA-GRU, where the components were predicted using ISSA-optimized GRU after VMD decomposition; and VMD-ISSA-GRU/ELMAN, where components were predicted using either ISSA-GRU or ELMAN based on their complexity levels, for comparison testing.

The warning value of sparrow population was set ST=0.8 and the number of discoverers SD=0.4; the four hyperparameters (L1, L2, Lr, and iteration number K) of the LSTM network were in the range of [10,200], [10,200], [0.001,0.02], and [10,500] for the optimization search, respectively; The LSTM and the GRU solver were both Adam, and the loss function was the sigmoid function, and the gradient threshold is set to 1.Three evaluation metrics, RMSE, MAE and R, were used to evaluate the model deterministic results, and PICP was used to evaluate the non-deterministic prediction results. Models 1-8 represented LSTM, GRU, ICEEMDAN-LSTM, CEEMDAN-SE-VMD-LSTM, VMD-GRU, VMD-ISSA-GRU, VMD-ISSA-GRU/ELMAN, and IVF-ISSA-GRU/ELMAN, respectively. Table 4 presented a comparison of wind power prediction indicators for March and July, while Figures 7 and 8 illustrated error analysis graphs for March and July, respectively.

Table IV
Prediction results of each model.
Figure 7
Error analysis of March dataset.
Figure 8
Error analysis of July dataset.

By analyzing Table 4, Figures 7 and 8, it can be obtained that:

  1. The predictions of LSTM and GRU models for the two-month data sets are similar, with significant differences from the actual values in peak and valley sections, having RMSE values between [42.4948, 53.6756] and [27.8759, 33.6682], and MAE values between [30.9671, 443.8006] and [21.3486, 26.0803], respectively. The graphs visually demonstrate that the predictive accuracy of single models is lower than that of hybrid models.

  2. The RMSE and MAE of the VMD-GRU model for the March data set decreased from 42.4948 and 30.9671 to 17.9028 and 13.0719, respectively, compared with the single GRU model. The RMSE and MAE of the VMD-GRU model for the July data decreased from 27.8759 MW and 21.3486 MW to 23.6342 MW and 17.624 MW respectively. Additionally, both data sets had an increase in R by 0.332 and 0.129, respectively. Furthermore, Model 7 outperformed Model 8, with RMSE and MAE for the March data set decreasing from 3.863MW and 3.1793MW to 2.8458MW and 2.159MW, and R increasing from 0.9967 to 0.9983. For the July data set, RMSE and MAE decreased from 6.7719MW and 4.9615MW to 1.8605MW and 1.4568MW, respectively, and R increased from 0.9735 to 0.99811. This indicates that the second decomposition improved the error of the first decomposition and enhanced the model’s predictive performance.

  3. The results of the distribution comparison models showed that incorporating the ISSA algorithm to optimize the VMD-ISSA-GRU model for both data sets resulted in a 71%-78% improvement in RMSE and a 70%-75% improvement in MAE. Both comparisons verified that ISSA could optimize model parameters and improve predictive accuracy. Additionally, compared to Model 6, Model 7, which predicted different features using GRU and ELMAN networks, had an increase in RMSE by 0.095MW and 0.041MW, and an increase in MAE by 0.09MW and 0.17MW, for the March and July data sets, respectively. This demonstrates that predicting different features using GRU and ELMAN networks can effectively enhance prediction accuracy.

  4. Our proposed ICEEMDAN-VMD-FE-ISSA-GRU/ELMAN model exhibited better prediction results with significantly lower errors compared to the other 7 models. The RMSE values for our model were 2.8458MW and 1.8605MW for March and July, respectively. The MAE values were 2.159MW and 1.4568MW, and R reached 0.99825 and 0.99814, respectively. Compared to the hybrid model control groups (Models 3 and 4) mentioned in the literature, our proposed model showed an improvement in RMSE of [15.2734, 21.4673] and an improvement in MAE of [11.5658, 13.6904] for the July data set, indicating the best fit.

Nondeterministic prediction and effect evaluation

Although our proposed model achieved more accurate wind power deterministic projection results compared to other models, single wind power values obtained from point predictions may not provide sufficient reference for wind farm decision-making. Therefore, based on the point prediction errors mentioned above, we performed interval prediction on the test set. After obtaining the distribution of the error sequence, we used KDE to obtain the 95% and 90% confidence intervals. To verify the method of KDE, we selected the normal distribution estimation method and performed KDE calculations using the Epanechnikow and Triangle kernel functions, and evaluated the prediction results using PICP and PINAW.

Figures 9 and 10 show the interval prediction results based on the Epanechnikow kernel function for March and July, respectively.

Figure 9
March interval prediction results.
Figure 10
July interval prediction results.

Figures 9 and 10 depict the 90% and 95% confidence interval prediction results for our proposed model for the two months. The 95% confidence interval contains 92 and 94 points in the actual wind power measurements for March and July, respectively. Furthermore, for the fluctuation range of wind power, our prediction model includes most of the peak values of wind power, indicating a better ability to predict wind power fluctuations.

Table V
Interval prediction evaluation.

Table 5 presents the interval evaluation result indicators. According to Table 5, the method based on the Epanechnikow kernel function has higher coverage probability (PICP) compared to the Triangle kernel function and NDE estimation method at the same confidence level. For the July data set at 95% confidence level, the PICP of our proposed model is 97.92%, compared to 91.97% and 87.50% for the Triangle kernel function model and NDE estimation method, respectively. This demonstrates that our proposed model has a higher coverage probability compared to the other two models, making it more valuable for real-world decision-making, and proves the effectiveness and superiority of our proposed interval prediction model.

Conclusions

This study proposed a hybrid wind power interval prediction model based on TVF, GRU, ISSA, and ELMAN. Through verification on two datasets, the following conclusions were drawn:

  1. The RMSE of deterministic prediction results of the proposed model in this paper is only 2.8458 MW and 1.8605 MW, which has the smallest value and the smallest error. The largest R-value and the highest goodness-of-fit. The non-deterministic predictions reach 95.83% and 97.92% coverage at 95% confidence level, our model demonstrated superior predictive performance and provided more accurate wind power predictions.

  2. The application of ICEEMDAN and VMD for wind power data processing through secondary decomposition effectively reduced the complexity of high-frequency components from the first decomposition, mitigating adverse effects caused by strong stationarity. The use of FE fuzzy entropy technique for classifying and aggregating components from the secondary decomposition simplified the model while enhancing its predictive performance.

  3. The application of the ISSA evolutionary algorithm improved the parameters of the GRU network, thereby enhancing the wind power point prediction. By utilizing GRU and ELMAN networks for prediction based on the characteristics of components after the secondary decomposition, the time features of wind power sequences were fully utilized, leading to more effective prediction of wind power sequences.

  4. Using the KDE-Epanechnikow method to estimate the error sequence obtained from point predictions enabled us to fully explore its uncertainty information, reducing the limitations of point prediction results and obtaining reliable interval prediction results.

ACKNOWLEDGMENTS

This research was funded by the Key Project of Education Department of Hebei Province, grant number [ZD2020182, ZD2021021]. This research was funded by the Doctoral Fund of Hebei University of Engineering, grant number [20120134]. We sincerely thank the Key Laboratory of Smart Water Conservancy in Hebei Province and the Doctoral Special Fund Project of Hebei University of Engineering for providing simulation data. The authors declare no conflict of interest.

  • BAO R, HE Z & ZHANG Z. 2022. Application of lightning spatio-temporal localization method based on deep lstm and interpolation. Measurement 189: 110549.
  • BI K & QIU T. 2019. An intelligent svm modeling process for crude oil properties prediction based on a hybrid ga-pso method. Chinese J Chem Eng 27(8): 1888–1894.
  • BOMMIDI BS, TEEPARTHI K & KOSANA V. 2023. Hybrid wind speed forecasting using iceemdan and transformer model with novel loss function. Energy 265: 126383.
  • CASTORRINI A, GENTILE S, GERALDI E & BONFFGLIOLI A. 2022. Increasing spatial resolution of wind resource prediction using nwp and rans simulation. J Wind Eng Ind Aerod 210: 104499.
  • CHEN H. 2022. A comprehensive statistical analysis for residuals of wind speed and direction from numerical weather prediction for wind energy. Energy Rep 8: 618–626.
  • FENG R & WENCHENG L. 2023. Lssa-bp-based cost forecasting for onshore wind power. Energy Rep 9: 362–370.
  • GAO C, CHEN J & SHI M. 2022. Multi-strategy sparrow search algorithm combining golden sine and curve adaptation. Computer Science and Application 39(02): 491-499.
  • GAO X, GUO W, MEI C, SHA J, GUO Y & SUN H. 2023. Short-term wind power forecasting based on ssa-vmd-lstm. Energy Rep 9: 335–344.
  • HE Y & TSANG KF. 2021. Universities power energy management: A novel hybrid model based on iceemdan and bayesian optimized lstm. Energy Rep 7: 6473–6488.
  • HUANG J. 2021. Research on sparrow search algorithm combining T distribution and tent chaotic map. Master’s thesis. Lanzhou University.
  • LI J, ZHANG S & YANG Z. 2022. A wind power forecasting method based on optimized decomposition prediction and error correction. Electr Pow Syst Res 208: 107886.
  • LI LL, CHANG YB, TSENG ML, LIU JQ & LIM MK. 2020. Wind power prediction using a novel model on wavelet decomposition-support vector machines-improved atomic search algorithm. J Clean Prod 270: 121817.
  • LI R & JIN Y. 2018. A wind speed interval prediction system based on multi-objective optimization for machine learning method. Appl Energ 228: 2207–2220.
  • LIU H. 2020. Wind power prediction based on vmd decomposition and mrmr feature information selection. Master’s thesis. Xi’an University of Technology (Unpublished).
  • LIU H, MI X & LI Y. 2018. Smart multi-step deep learning model for wind speed forecasting based on variational mode decomposition, singular spectrum analysis, lstm network and elm. Energ Convers Manage 159: 54–64.
  • MOTLAGH MM, BAHAR A & BAHAR O. 2021. Damage detection in a 3d wind turbine tower by using extensive multilevel 2d wavelet decomposition and heat map, including soil-structure interaction. Structures 31: 842–861.
  • NGUYEN THT & PHAN QB. 2023. Hourly day ahead wind speed forecasting based on a hybrid model of eemd, cnn-bi-lstm embedded with ga optimization. Energy Rep 8: 53–60.
  • NIU D, SUN L, YU M & WANG K. 2022. Point and interval forecasting of ultra-short-term wind power based on a data-driven method and hybrid deep learning model. Energy 254: 124384.
  • SHI Y, YU DL, TIAN Y & SHI Y. 2015. Air–fuel ratio prediction and nmpc for si engines with modified volterra model and rbf network. Engineering Applications of Artiffcial Intelligence 45: 313–324.
  • SUO L, PENG T, SONG S, ZHANG C, WANG Y, FU Y & NAZIR MS. 2023. Wind speed prediction by a swarm intelligence based deep learning model via signal decomposition and parameter optimization using improved chimp optimization algorithm. Energy 276: 127526.
  • TANYILDIZI E & DEMIR G. 2017. Golden sine algorithm: a novel math-inspired algorithm. Adv Electr Comput En 17(2).
  • WANG F, WANG S & ZHANG L. 2022. Ultra short term power prediction of photovoltaic power generation based on vmd-lstm and error compensation. J Sol Energ-T Asme 43(8): 96.
  • WU H, GUO C & SU C. 2023. Short-term wind power combination prediction method based on EEMD-GRU-MC. Southern Power Grid Technology 17(02): 66–73.
  • XIANG L, LIU J, SU H, HU A & NING Z. 2020. Study on multi-step prediction of wind speed based on CEEMDAN quadratic decomposition and LSTM. J Sol Energ-T Asme 43(08): 334-339.
  • XIAO B & ZHANG B. 2023. Short-term wind power interval prediction based on combined modal decomposition and deep learning. Power System Automation 47: 110–117.
  • XIAO Y, ZOU C, CHI H & FANG R. 2023. Boosted gru model for short-term forecasting of wind power with feature-weighted principal component analysis. Energy 267: 126503.
  • XIONG J, PENG T, TAO Z, ZHANG C, SONG S & NAZIR MS. 2023. A dual-scale deep learning model based on elm-bilstm and improved reptile search algorithm for wind power prediction. Energy 266: 126419.
  • YAN Y, WANG X, REN F, SHAO Z & TIAN C. 2022. Wind speed prediction using a hybrid model of eemd and lstm considering seasonal features. Energy Rep 8: 8965-8980.
  • YANG Q, HUANG G, LI T, XU Y & PAN J. 2023. A novel short-term wind speed prediction method based on hybrid statistical-artificial intelligence model with empirical wavelet transform and hyperparameter optimization. J Wind Eng Ind Aerod 240: 105499.
  • ZHANG J, LU J, PAN J, TAN Y, CHENG X & LI Y. 2022. Implications of the development and evolution of global wind power industry for china—an empirical analysis is based on public policy. Energy Rep 8: 205–219.
  • ZHANG W & LIU S. 2022. Improved sparrow search algorithm based on adaptive t-distribution and golden sine and its application. Microelectronics & Computer 39(3): 17-24.
  • ZHANG Y, PAN G, ZHAO Y, LI Q & WANG F. 2020. Short-term wind speed interval prediction based on artificial intelligence methods and error probability distribution. Energ Convers Manage 224: 113346.
  • ZHANG Y, LI C, JIANG Y, ZHAO R, YAN K & WANG W. 2023. A hybrid model combining mode decomposition and deep learning algorithms for detecting TP in urban sewer networks. Appl Energ 333: 120600.
  • ZHAO Z & WANG X. 2020. Multi-step forecasting of ultra-short-term wind power based on CEEMD and improved time series model. J Sol Energ-T Asme 41(07): 352–358.
  • ZHAO Z, YUN S, JIA L, GUO J, MENG Y, HE N, LI X, SHI J & YANG L. 2023. Hybrid vmd-cnn-gru-based model for short-term forecasting of wind power considering spatiotemporal features. Eng Appl Artif Intel 121: 105982.

Publication Dates

  • Publication in this collection
    21 Oct 2024
  • Date of issue
    2024

History

  • Received
    9 Aug 2023
  • Accepted
    5 Jan 2024
Academia Brasileira de Ciências Rua Anfilófio de Carvalho, 29, 3º andar, 20030-060 Rio de Janeiro RJ Brasil, Tel: +55 21 3907-8100 - Rio de Janeiro - RJ - Brazil
E-mail: aabc@abc.org.br