Research on the technical framework and critical path of new energy portfolio prediction based on multi-algorithm fusion
Published Online: Mar 19, 2025
Received: Nov 19, 2024
Accepted: Feb 18, 2025
DOI: https://doi.org/10.2478/amns-2025-0413
Keywords
© 2025 Zhongyuan Yan et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
As environmental protection and energy efficiency issues are increasingly emphasized, the application of renewable energy in power systems has gradually become a research hotspot. However, the intermittency and instability of renewable energy bring new challenges to the power system. In order to solve this problem, renewable energy prediction has an important application value in the power system [1–4].
New energy forecasting techniques are mainly divided into two categories: physical methods and statistical methods. Physical methods are mainly based on meteorological, geographic, environmental and other factors, combined with tools such as numerical weather prediction, to forecast new energy output [5–7]. Statistical methods, on the other hand, are based on historical data and use various statistical models to predict the new energy output. These statistical models include regression analysis, support vector regression, neural network, etc [8–10]. And the current main prediction technique for new energy is support vector regression (SVR), which is used to predict wind and solar power output in power systems [11–14]. By extracting features from historical data and using them as inputs, SVR models can learn patterns in historical data to predict future renewable energy output [15–16]. These techniques can be used individually or in combination to improve the accuracy and reliability of the predictions. And their prediction results are for reference only, because they are affected by many factors, including policy, technology, economy, environment and other factors [17–20]. Therefore, new energy prediction is a complex problem that integrates multiple factors and requires the comprehensive use of various methods and data for analysis [21–22].
This paper analyzes the power calculation of two kinds of power generation, wind power and photovoltaic power generation, sets up the wind and light prediction step, and explains the source of power prediction error of wind and light power generation. Logistic regression model, ARMA model, and gray GM (1, 1) model are proposed respectively, and the induced ordered weighted average operator (IOWA) is introduced to fit each single prediction model, and a new energy power prediction model with multi-algorithm fusion is built. Dividing the weather types, a total of three prediction algorithms, including LSSVM, similar day and combined prediction model based on ordered weighted average operator, are utilized to make short-term prediction of PV power under different weather types. Taking the wind power data as the research object, the ARMA model, LSSVM model and the combined prediction model in this paper are used for the ultra-short-term prediction of wind power.
New energy power prediction technology has developed rapidly in recent years, with hundreds of algorithmic models. There has been a great improvement in data cleaning, meteorological information utilization, and other aspects. However, the new energy-power prediction algorithms from different manufacturers and strategies have their own strengths, and the matching effect with specific prediction scenarios is not consistent. The tolerance of new energy to extreme weather in actual operation is relatively fragile. Existing prediction technology is difficult to achieve accurate predictions in extreme weather, which may further lead to the aggravation of power supply tension.
Therefore, facing the actual production needs of dispatching operation, we explore the use of artificial intelligence technology path, study the new energy power adaptive combination prediction enhancement method considering multi-algorithm access according to local conditions, improve the new energy power prediction accuracy under multi-dimensional scenarios, and provide support for the dispatching operation of new energy-based electric power system under the complex market environment.
The general idea of dividing the weather types is to classify the complex weather types of the study object based on the observed fluctuations in the PV power and the changes in the important factors affecting the power.
In this paper, the weather is classified by calculating and defining the sample entropy of the key influencing factor, solar radiation, for the daily PV power curve under study. Generally, for a dataset consisting of
The data are sequentially composed into a
Define the distance
For a given
For 1 ≤
Increase the number of dimensions to
The sample entropy is defined as:
The sample entropy defines the complexity of this radiometric sequence and has a value between 0 and 1.
Set
The sample entropy reference value is determined by the following equation:
Implementing energy saving and emission reduction measures to promote wind power generation and photovoltaic power generation (hereinafter referred to as wind power) are growing rapidly. Large-scale wind and solar power generation grid operation, its randomness, volatility and uncertainty to the grid safety operation has a profound impact. The contradiction between the safe operation of the power grid and the large amount of new energy consumption is increasingly apparent. In such a situation, the refined management of PV power generation, highly accurate prediction, and the integration of wind power generation into the grid scheduling plan development and real-time operation and control is one of the important measures to solve this problem.
Wind power generation uses the wind to push the windmill blades to rotate, and then through the speed booster to enhance the rotational speed, to push the generator to generate electricity, to realize the process of wind energy to mechanical energy and then to electrical energy. The basic formula for calculating wind power is:
Where
Photovoltaic power generation converts solar energy into electricity through solar photovoltaic panels, and its influencing factors are mainly solar radiation, clear sky index, sunshine hours, clouds, temperature, wind speed and dust. Taking the PV array system as an example, the engineering formula for its output power is:
Where
The basic process of wind and solar forecasting is:
Accurate and detailed research to realize the acquisition and selection of historical wind power generation data and information. Processing of historical wind power generation data. Pre-processing of wind power generation data, including normalization, smoothing and interpolation of missing data, etc. The main processing methods for abnormal data are horizontal processing and vertical processing. Scientific and reasonable construction of wind power generation prediction model, a good prediction model can perfectly summarize the laws and trajectories of the prediction object, the correct selection of the model is a crucial part of the prediction process. According to the prediction error of the comprehensive analysis of the influencing factors, and accordingly the prediction model and algorithm to make reasonable adjustments to the prediction value of the appropriate correction and finalization.
In the actual prediction process, the influence of weather, prediction model, or non-objective factors leads to the generation of prediction errors, which has a certain impact on the prediction accuracy. The existence of wind power prediction errors will have an impact on the reasonable scheduling plan of the power grid, affecting the normal operation of the power grid. In this paper, the main use of historical data information as a reference data information to carry out short-term wind power prediction. Because historical data information is not real-time data information, and there is a certain degree of error, so only to control the effective error in a determined range, to ensure that the prediction value is closer to the actual value, so that the prediction accuracy is higher. The main reasons for the formation of effective errors are usually the following four levels:
The influence of data information. The different sources of data information and the influence of the data collection system cause the collected data to be incomplete, which in turn makes the data information detected by the pre-operational work have a definite difference. When affected by other uncertainties, it can also lead to problems such as lack of data information or errors. Influenced by the role of weather. The power output from wind farms is affected by factors such as wind speed, wind direction and ambient temperature, and the uncertainty of the weather can cause the data information predicted at a certain time period to deviate relative to that output in normal weather, thus seriously affecting the prediction accuracy of power. Selection of prediction model. Since the prediction accuracy of different prediction models is different, in the process of selecting the model, if all the influencing factors are not taken into account, it will also lead to an increase in the error. Therefore, when selecting the prediction model, we need to comprehensively consider the impact of various factors and select the most appropriate model with the highest accuracy after comprehensive consideration and comparative analysis. The influence of human factors. Even if the staff involved in the prediction is experienced, there are limitations in their own technical level. Considering personal subjective factors, there may be data entry errors during data entry, which may cause unpredictable errors in the prediction results.
Combined prediction modeling is the combination of two or more single prediction models in a specific way to form a comprehensive set of combined prediction methods [23–24]. There are many combined prediction methods, but each has its own advantages and disadvantages. Under different data conditions, their prediction effects may be very different, but there is also a certain connection between them, which can complement each other. If the error of a single prediction model is extremely large in the combination of prediction methods, you can opt to eliminate it and use the most suitable combination of several single models.
Considering the characteristics of various single prediction models, the single prediction models are closely linked together by the method of weights, and then the definition of combined prediction model is proposed. In the combined forecasting model, a single forecasting model has a large error, but it will not have a high degree of influence on the whole. The decision maker uses only one forecasting model for forecasting. If the selected model does not match the data at all, the error in the predicted value will be large and may have a great impact on the decision. Whereas, if a combination of forecasting methods is applied for forecasting, there will not be a situation where the error value is too large. Therefore, the combination of prediction methods can further improve the accuracy and reliability of data prediction on the basis of a single prediction method.
Combined prediction aims to organically combine several different prediction methods, and the information and data of these single models are closely combined through the weighting method. Then the weights are calculated using the Shapley value method. Finally, the prediction model is obtained by combining several methods through weights. The first problem to be solved in the combined prediction model is to calculate the weights of each single prediction model, which allows various single prediction methods to be combined more effectively, thus improving the accuracy of the prediction.
The combined prediction weights are determined by the Shapley value method, which is a mathematical method mainly used to solve multi-individual cooperative relationships.The Shapley value method is used to allocate the maximum benefit method, which can reflect the importance of each individual in the cooperation.
By doing so, the method is applied to combinatorial forecasting studies by assuming each forecasting model as an individual in a cooperative relationship. The error generated by each individual model is considered as an economic benefit from the cooperation of n individuals. Finally, the importance of a single model is determined by assigning weights according to the magnitude of its contribution.
The definition is done by the following method:
The prerequisite is that there are various methods of combining forecasts, which are noted here:
For all subsets for any subset The total error value among the combined prediction models is
Assumptions
The weights of the single prediction model are:
In this paper, three forecasting models are combined to propose combined forecasting modeling. The combined prediction modeling can maintain the characteristics of the original system, better reflect the development trend of the system, with higher prediction accuracy, and can be widely adapted to the near-term, short-term, and medium- and long-term predictions.
Logistic regression is a multivariate analysis method to study the relationship between the dependent variable as dichotomous and multicategorical observations and the influencing factors (independent variables), which belongs to probabilistic nonlinear regression. There are dichotomous and multicategorical regression models. For dichotomous logistic regression, the dependent variable Logistic regression model Logistic regression model is to build Since the range of values of Logistic regression model interpretation:
ARMA model is obtained on the basis of AR model and MA model.ARMA model is built as follows.
The ARMA model is identified as follows:
Autocorrelation coefficient The autocorrelation coefficient of the time series The partial autocorrelation coefficient is the conditional correlation between This is a consistent estimate of the partial correlation coefficient. To get a precise estimate of Thus, the partial autocorrelation coefficient of lag Identification of MA models Get:
Identification of the AR model Information about the Multiply both sides together by Identification and establishment of the model In the operation of identifying the model, the autocorrelation coefficient and partial autocorrelation coefficient can not be used to identify the specific form of the model, but only as a basis. Recognition also needs to autocorrelation and partial autocorrelation coefficients constantly data test to select the standard model. For a sequence with number T, its autocorrelation coefficient distribution is:
The distribution of partial autocorrelation coefficients is:
For the smooth sequence
The gray model mainly reveals the process of continuous development and change between things within the system [25].
Grade ratio test judgment By calculating the level ratio for the sequence The sequence of grade ratios is obtained as:
Finally check whether the level ratio is within the tolerable range, if all fall within the acceptable range, modeling for prediction can be done with time series as GM(1, 1). Historical data processing Data change processing is for the level ratio test does not pass the sequence of planning processing to meet the level ratio test, commonly used data transformation methods are, logarithmic transformation, translation transformation and square root transformation method. Establishment of gray prediction model based on differential differential equations Based on differential differential equations to establish the model gray GM (1, 1) prediction model. The following sequence is obtained by summing up the sums of the order Then the difference differential equation corresponding to the gray GM(1, 1) model is:
Its solution is:
A residual test was performed with the test formula:
Generally, the test formula requires
This section introduces the induced ordered weighted average (IOWA) operator. The fitting assignments of each single-phase prediction method are ranked from high to low on the sample interval, and the sum of squared errors is used as a criterion to give the method of determining the weight coefficients of the combined prediction model based on the IOWA operator and analyze the error comparison.
Model construction Let observation { In the prediction method of the Putting The predicted value of the combination of the sequence Therefore, the combinatorial prediction model with the sum-of-squares-of-errors criterion can be equated to the following optimization model:
Let Model Prediction Based on the concept of induced ordered weighted average operator, this section proposes a new combination prediction model, which can be solved by a quadratic programming model to obtain the optimization coefficient of the combination prediction IOWA on the sample space, set to
The ordering principle for the prediction accuracy sequence
The prediction of PV power is an important guide for the operation of PV power plants and the scheduling of power grids. Weather type and PV power show a strong correlation, and changes in weather type cause the PV power curve to change accordingly. PV power has randomness and uncertainty, which makes the current accuracy of PV power prediction error still limited. Therefore, it is particularly important to analyze PV power prediction errors.
PV power has obvious seasonal and daily characteristics, and the influence of the external environment is very obvious in the daily, so the influence of different weather types on the PV power prediction error should not be underestimated.
As a case in point, consider a PV power plant in Ashland, USA in 2019. The weather is roughly divided into three types: sunny, rainy, and cloudy. Then a total of three prediction algorithms are used, including least squares support vector machine (LSSVM), similarity day, and a combined prediction model based on ordered weighted average operator. The short-term prediction of PV power under the three different weather types is shown in Fig. 1.

Photovoltaic power prediction results
As can be seen from the figure, the predictions for sunny days are more accurate and have smaller errors compared to the rainy and cloudy predictions that fluctuate drastically. Whereas in rainy and cloudy weather conditions, the prediction results are relatively poor.
In the rainy day prediction, the similar day with LSSVM prediction model performs better, and the combined prediction model based on ordered weighted average operator proposed in this paper has poor prediction accuracy, and the range of fluctuation always stays above and below 2000W.
The PV power prediction error results are shown in Table 1. From the table, it can be seen that regardless of the prediction method used, the prediction error value on sunny days is the lowest. The prediction error values of the three prediction models on a sunny day are all less than 0.1. This is mainly due to the fact that there are fewer clouds in the case of a sunny day, which reduces the loss of solar rays received by the PV panels. The fluctuations are also relatively smoother and the PV power curve is relatively smoother.
Photovoltaic power prediction error results
| Prediction method | Sunny day | Rainy day | Orb | |||
|---|---|---|---|---|---|---|
| MAE | RMSE/% | MAE | RMSE/% | MAE | RMSE/% | |
| LSSVM | 0.0174 | 3.5965 | 0.0556 | 9.1053 | 0.0689 | 15.0334 |
| Similar day | 0.0136 | 3.1277 | 0.0412 | 8.2114 | 0.0454 | 11.6782 |
| The combination model of this article | 0.0075 | 1.8069 | 0.0295 | 7.6067 | 0.0337 | 6.0381 |
The cloudy weather conditions have the largest prediction error values, which may be mainly due to the fact that cloudy weather conditions are characterized by violent cloud motion disturbances with strong uncertainties, resulting in high output PV power volatility. In the vast majority of rainy weather conditions, the actual PV output power will be lower because of the decrease in solar irradiance on cloudy days, with small disturbances brought about by cloud motion before and after the rainfall. However, the small fluctuations are not as dramatic as in cloudy weather. This shows that the weather conditions have a great influence on the prediction error, and the processing of weather information should not be neglected in the future PV power prediction, and it is necessary to characterize the distribution of the PV power prediction error.
In order to verify the superiority of the new energy power prediction model with the combination of multiple algorithms proposed in this paper, the ultra-short-term prediction of wind power is simulated and compared by using the time series ARMA model, the Least Squares Support Vector Machine (LSSVM), and the combination prediction model in this paper, respectively.
Most of the existing wind power stations are not equipped with meteorological prediction system and lack of corresponding meteorological data, so this section only takes the wind power data as the research object, explores its intrinsic law, and predicts the power generation of wind power stations in the future moment.
Taking wind power data from a wind farm with an installed capacity of 60MW in Henan Province as the research object, 1200 wind power measured data collected during the time period from February 20 to March 8, 2019 are taken as the test data, and the sampling interval of the original data is 15min, and the first 1000 data are taken as the training data of the model, and the last 150 data are taken as the test data of the model for the super short-term prediction. The raw wind power data are shown in Figure 2. The above wind power sequence is obtained as

Original data
The ultra-short-term prediction results of wind power are shown in Figure 3. The experimental ARMA model, LSSVM and the combined prediction model in this paper all have good results for wind power prediction, and can track the changes of the real value in real time, but all have a certain lag. The difference between the real value of wind power and the predicted value of three models is more constant, and the sampling point data reaches the maximum value of wind power between 60 and 80.

Wind power super short-term prediction results
The absolute error of ultra-short-term wind power prediction is shown in Fig. 4.The absolute error value of wind power prediction of LSSVM model is the largest, and the absolute error curve fluctuates significantly.The maximum value of the prediction error of LSSVM model is 5.49.The absolute error value of the combination of prediction model proposed in this paper (the red line in the figure) maintains the overall fluctuation range of the value of the absolute error at [0,3].

The absolute error of the short term prediction of wind power
It can be seen that there is a certain difference in the prediction accuracy of each method, although at different moments, each method has a different degree of error advantage and disadvantage. But in general, the combined prediction model proposed in this paper can maintain a high level of prediction accuracy.
For further objective illustration, the normalized root mean square error
The prediction error of the three prediction models
| Prediction method | |||
|---|---|---|---|
| ARMA | 5.49 | 6.79 | 0.9065 |
| LSSVM | 4.25 | 5.33 | 0.9124 |
| The combination prediction model of this article | 2.73 | 4.26 | 0.9567 |
As with the PV prediction, the combined prediction model in this paper has the smallest prediction error and the largest correlation with the true value, with a correlation coefficient of 0.9567.
This paper is oriented to the actual production needs of power dispatch operation, and proposes a new energy power prediction model with multi-algorithm fusion to improve the accuracy of new energy power prediction under complex weather, reduce the power prediction error of wind power generation, and optimize the power supply.
The Shapley value method is used to determine the weights of the combined model, and the induced ordered weighted averaging operator is introduced to fuse the Logistic model, the time-series ARMA model, and the gray prediction GM (1, 1) model to form the combined prediction model. The prediction errors of the combined prediction model based on the ordered weighted averaging operator are analyzed under three weather types: sunny, rainy, and cloudy. The prediction results of sunny days are more accurate and less inaccurate than those of rainy days and cloudy days, which are characterized by sharp fluctuations and a prediction error value less than 0.1. The wind power sampling data are processed, and the ARMA model, the LSSVM model, and the combination prediction model of this paper are applied to the ultra-short-term prediction of wind power. The prediction accuracies of the three prediction methods have some differences, but the overall fluctuation range of the absolute error of the combination prediction model based on the ordered weighted average operator proposed in this paper is kept at [0,3]. The ultra-short-term prediction performance of the combination prediction model based on ordered weighted average operator proposed in this paper is better than that of ARMA model and LSSVM model, which indicates that the proposed combination prediction model based on ordered weighted average operator is more valuable for engineering practice.
