Study on Electricity Settlement Mechanism Considering Competitiveness of Electricity Markets
Publié en ligne: 29 sept. 2025
Reçu: 28 déc. 2024
Accepté: 23 avr. 2025
DOI: https://doi.org/10.2478/amns-2025-1098
Mots clés
© 2025 Fangping Gao et al., published by Sciendo
This work is licensed under the Creative Commons Attribution 4.0 International License.
Spot market, in economics, generally refers to the market for immediate delivery of commodities, however, due to the special characteristics of real-time balancing of electricity commodities and the inability to store them on a large scale, the trading period of the electricity spot market is often extended to the moment of real-time operation up to the day before the real-time operation, forming a three-tiered market of the day before day-intraday-real-time [1-2]. In terms of the specific market model, the power spot market includes decentralized and centralized market, in which the decentralized market refers to the power generator and power buyer to sign a bilateral contract, and according to the contract of self-scheduling, grid scheduling agencies to ensure the implementation of the contract at the same time, to maintain the balance of the system power, while the centralized market is the use of the whole amount of power centralized bidding in the form of the system security constraints to get the combination of units and the power curve The centralized market is based on the system security constraints to obtain the unit mix and output curve, which is closely connected with the grid operation [3-6].
Different countries and regions design their own spot market structure according to the power structure, power system and other factors. The trading and settlement mechanism of the power market is a prerequisite for the normal operation of the entire power market. The trading and settlement mechanism of the power market directly affects the operation results of the power market [7-9]. Only through a scientific and appropriate transaction and settlement mechanism can the smooth development of the power market be maintained, and only in this way can the fundamental interests of market players be better protected [10-11].
Since China launched a new round of power system reform and development work, with the increasing proportion of power market transactions, as well as the corresponding power trading work continues to deepen, the market access standards are further relaxed, a large number of power sales companies continue to influx, the proportion of renewable energy sources and the corresponding proportion of transactions continue to grow, the power market presents a widening of the scope of the opening up of the diversification of the main body of the transaction, the type of transaction, transaction cycle Diversification of the significant development characteristics, and these changes in turn further increase the power market transaction settlement requirements, it can be seen that the establishment and improvement of the power market transaction settlement mechanism should not be delayed [12-15].
The power market has a variety of trading methods, in order to realize the form of trading as a classification basis can be divided into spot trading, forward contract trading and futures, options trading three types of trading, according to the time scale of the transaction to be divided, can be divided into ultra-short-term power transactions, short-term transactions, medium- and long-term transactions [16-17]. Electricity spot trading can be further divided into day-ahead market electricity trading, real-time market electricity trading and so on. Its main features are shorter time or real-time quotes, real-time transactions, etc., in the price fluctuations have a frequent and large range of characteristics. The power market forward contract type trading, refers to the way through the signing of forward contracts to carry out [18-20]. In the corresponding contract will be clearly put forward the total power trading volume, as well as the corresponding principles and methods of apportionment, so as to facilitate the subsequent operation of power trading.
Electricity market trading is of great significance for the normal functioning of the social economy, in which the trading mechanism, regulatory model has been the research hotspot in this field. Literature [21] conceived an optimized and improved bidding and settlement strategy in the market before the day of generating enterprises, and introduced the league champion algorithm for numerical examination, and found that the power bidding and settlement strategy envisaged in the study enhanced the expected returns. Literature [22] envisioned a dynamic regulatory market mechanism for electricity and corroborated it with three regional cases, confirming that this dynamic market regulatory mechanism for electricity effectively improves the current situation of financial settlement in the wholesale electricity market. Literature [23] verified through simulation experiments that electricity retailers can reduce EDS costs by choosing partners, while the coalition revenue allocation method proposed in the study is more scientific and targeted compared to the cooperative gaming method.
Literature [24] explores the bidding procedures and fairness of market parties under the three settlement mechanisms of locational marginal pricing, regional pricing and average system pricing in the electricity spot market, and proposes a strategy for selecting the electricity pricing mechanism that is suitable for China’s national conditions. Literature [25], based on the current situation of power market transaction settlement, explores and explores the effective operation mode of the transaction settlement mechanism, aiming to promote the establishment of a more advanced power market transaction settlement mechanism. Literature [26], in order to solve the problems of electricity tariff diversion and imbalance cost sharing in the bilateral electricity market under China’s dual-track system, utilizes the dual settlement system of day-ahead benchmark, real-time volume difference and contractual spread, which effectively enhances the market participation enthusiasm of the main parties of electricity sales and consumption. Literature [27] analyzes the Brazilian wholesale electricity trading model in depth, and argues that its liberalized market environment promotes large-scale investment in low-carbon power generation, but it is weak in regulatory design and prone to imbalances that generate financial and fiscal pressures. Literature [28] conducted a study on the single clearing mechanism in the Brazilian electricity market, revealing that the dual clearing system helps to mitigate anti-competitive behavior and promote market efficiency. Literature [29] studied two relatively novel peer-to-peer (P2P) power exchange settlement mechanisms, and concluded that this novel power settlement transaction mechanism reduces the transaction costs of both parties to the transaction compared to the traditional power transaction settlement mechanism. Literature [30] conceptualized an Ethernet-based power energy trading and settlement framework, which is of positive significance for the improvement of transaction transparency and security. The above study investigates the electricity market bidding mechanism from the perspectives of expected benefits, transaction efficiency, transparency and fairness.
In this paper, based on the infinite repeated game theory, the bidding process model of power producers and the International Organization for Standardization (ISO) electricity market clearing model are constructed by using the SFE model, and the MADDPG deep reinforcement learning algorithm is used to realize the solution of the model. In order to verify the feasibility of the model and the algorithm, the improved IEEE 33-node distribution system is selected for case study analysis, which analyzes the superiority of the game theory-based bargaining model over the constant/time-sharing tariff method and compares the performance of the MADDPG algorithm with the traditional reinforcement learning algorithm in terms of the optimization of offer strategies. Finally, based on the model construction and solution, a universal settlement mechanism for the electricity market is designed in combination with the principle of electricity settlement mechanism.
To explore the power settlement mechanism and optimize it in a competitive power market under consideration, the first step is to make the power market reach Nash equilibrium. To this end, this chapter constructs an intelligent body equilibrium model of the electricity market and introduces a reinforcement learning method to solve the model.
Generator Bidding Generation vendor bidding is an act in which a power producer bids to the demand side of the electricity to provide electricity services with the aim of offering the best price in order to obtain an order from the demand side of the electricity. SFE model-based bidding process among power producers The SFE model is a mathematical modeling method used to support the bidding process of power generation manufacturers [31]. The basic idea of the SFE model is that the power generation manufacturers first estimate their optimal bidding price with the maximum profit they can make based on their maximum biddable price, as well as the prices of their competitors in the bidding process. The bidding process of the power producer is studied using the SFE model. The cost function of the power producer is modeled as a quadratic function of its output power:
where
where
At time interval
where
ISO market clearing refers to a set of standards developed globally by the International Organization for Standardization (ISO) for the management and clearing of market transactions.
Electricity consumption under load at interval time t is modeled with a linear demand curve:
Where,
Satisfying the conditions of nodal power balance, branch circuit current constraints, and generator output constraints, the ISO clears the market with the objective of maximizing the total benefit to society. The market clearing in the time interval can be expressed in terms of the DC tidal model as:
During this time interval, the revenue of each generation manufacturer is:
Game Theory Game theory is the branch of mathematics that studies the choices made by decision makers under the influence of each other. The core idea of game theory is to model the behavior of participants to understand the possible outcomes and optimal strategies [32]. The analytical methods of game theory include game tree, Nash equilibrium, optimal strategy, game matrix and so on. Static game theory Static game theory is a branch of game theory, which mainly studies the process of participants making decisions according to their own interests within a certain period of time. Strategic game is a typical static game, which is divided into pure strategic game and mixed strategic game according to whether the participants’ actions are random or not. A purely strategic game consists of a set of participants, a set of actions, and a utility. For a game with Mixed-strategy games add mixed-strategy sets to pure-strategy games. The content of the decision-making of the participants
The constraints on actions in a strategic game are implicit in the set of actions, but are expressed in this way on the assumption that the set of actions
Where:
The operation of the electricity market can be viewed in static game theory as a static game Γ
Infinite Repetition Game Theory Infinite Repetition Game Theory is a theoretical discipline that studies the strategies and outcomes of repeated game situations in the game process. In the infinite repeated game, the game participants will face multiple opportunities to play the game, and the result of each game will affect the result of the next game. Therefore, participants in infinitely repeated games need to consider long-term interests and strategies rather than focusing only on short-term interests. Repeated sales in the electricity market can be modeled as an infinitesimal sequence statistic Γ1, Γ2… discount factor
Factor
If the static game Γ Nash equilibrium Nash equilibrium is an important concept in game theory, which refers to a game in which all players choose the optimal strategy, and no player can obtain a higher payoff by unilaterally changing the strategy. In a Nash equilibrium, if for all participants
Then Mixed Strategy Portfolio
The application of Nash equilibria in infinitely repeated games can help players determine the best strategy, respond to changes in their opponents’ strategies, and deal with issues such as cooperation and betrayal to maximize long-term gains.
The whole model framework can be solved iteratively based on Multi-intelligent Reinforcement Learning (MARL) or Multi-intelligent Deep Reinforcement Learning (MADRL) methods, where the ISO market clearing model belongs to the Mixed-Integer Quadratic Programming Problem (MIQP), which can be computed optimally with the help of commercial solvers Cnley or Gurobi.
Multi-intelligent (deep) reinforcement learning uses each power generator as an intelligent body with autonomous learning capabilities, and the day-ahead market with which it interacts as the environment. Its elements include:
State Action Strategy Payoff The state transfer of the intelligent body is determined by the market clearing process.
In order to avoid the computation falling into local optimal solutions and to improve the convergence speed, the WoLF mechanism (win or learn fast) is combined with the strategy hill-climbing algorithm to form the WoLF-PHC reinforcement learning algorithm, which solves for the optimal offer strategy of the power generator and the market equilibrium outcome [33].
The main idea of the WoLF mechanism is that by adjusting the learning rate to learn slowly and cautiously when the strategy performs well, and learn fast when it performs poorly, so that the intelligences can quickly adapt to the strategies of other intelligences when they perform worse than expected, while reserving enough time for other intelligences to adjust their strategies when they perform better than expected.
The solution process of multi-intelligence reinforcement learning is schematically shown in Fig. 1. Before the start of each trial, each intelligent body selects an offer in the offer space according to its own state and strategy in a sampling way such as roulette, and the returns of each intelligent body and its new state are obtained through market clearing calculation, and the intelligent bodies update their strategies accordingly, and finally reach the market equilibrium.

Solution flow diagram of multi-agent reinforcement learning
The specific process is as follows:
Discretize the state space and action space. Generate strategies and select offer curves. The strategy corresponds to the probability level of selecting each Market clearing. The market clearing model is used to calculate the winning power and its revenue of each intelligent body, and this clearing result is fed back to the intelligent body for updating the strategy.
where Average strategy update. Updates the expected value of the intelligent body’s historical strategies:
where Strategy update:
Among them:
where
Based on Eqs. (15)~(17), the WoLF mechanism is added. In this paper, the difference between the average strategy and the current lower
where
In this paper, the MADDPG method is used to iteratively solve the two-layer model [34]. MADDPG uses the fully connected neural network as the simulation of the policy function
The MADDPG framework is shown in Fig. 2. Each intelligence is mainly composed of three modules, Actor, Critic and experience playback storage. The whole framework is carried out using centralized training and decentralized execution: a centralized approach is used to train each network, where the

MADDPG framework
The process of iteratively solving the market equilibrium two-layer model based on the MADDPG algorithm is shown in Fig. 3.

Flowchart of MADDPG solving two-layer model
First, each strategy network generates action set
Next, a
Then, the main strategy network (AON) determines the sample actions
Finally, the target network periodically copies the parameters from the main network and performs soft updates, as shown in equation (26). The relevant equations are as follows:
Gaussian noise
where Master The training objective of the master
Among them:
Its gradient can be calculated according to the automatic differentiation technique,
where Master policy network training The deterministic strategy gradient formula is:
According to the Monte Carlo method, substituting the sampled dataset into the above equation can be used as an unbiased estimate of this expectation by rewriting the equation as the sampling strategy gradient:
where Update the target network parameters:
where,
Example model setup: in this paper, a modified IEEE33 node power distribution system is used as an example. Its system wiring is shown in Figure 4.

Distribution system of the IEEE33 nodes
The Distributed Energy Operator (DGO) operates WG-BESS only at nodes 24 & 30, and the Load Aggregator (LA) operates PV-BESS only at nodes 8 & 24. The Distribution Network Operator (DNO) puts in place a gas turbine M at node 14, which has a maximum usable power of 160 kW. LA has a controllable IL at node 14, which has a maximum controllable power of 90 kW, and a maximum continuous control duration is 6 hours. Both interruptible loads and gas turbines participating in the market transaction are used to assume the responsibility of guaranteeing power supply. The three-party game relationship is shown in Table 1.
Tripartite game relationship
Players | |||
---|---|---|---|
Market transaction load number | DNO | DGO | LA |
8 | √ | × | √ |
24 | √ | √ | √ |
30 | √ | √ | × |
For
The energy storage system and other related parameters are shown in Tables 2 and 3.
Air storage and optical storage system parameters
ID | Capacity/ (kW-kW·h) | Maximum charge and discharge power /kW | Minimum storage power / kW·h | Charge and discharge efficiency | Final period charge requirement / kW·h | |
---|---|---|---|---|---|---|
8 | PV-BESS | 550-800 | 50 | 120 | 0.9 | 400 |
24 | WG-BESS | 700-800 | 70 | 60 | 0.9 | 300 |
PV-BESS | 700-800 | 70 | 60 | 0.9 | 400 | |
30 | WG-BESS | 550-800 | 60 | 60 | 0.9 | 300 |
Other simulation example parameters
Parameters | Value | Parameters | Value |
---|---|---|---|
0.30 | 12000 | ||
0.07 | 12000 | ||
0.07 | 0.95-1.15 | ||
0.16 | 0.90-1.10 | ||
12 | 0.97-1.12 |
In order to illustrate the validity of the model in this paper, the current mainstream trading mechanism in the electricity sales market is introduced: fixed tariffs or time-sharing tariffs for comparison.
Scenario 1: New energy in the distribution network is sold at a fixed tariff.
Scenario 2: Time-of-day tariffs are adopted, and the new energy sources have the right to formulate their own electricity sales strategies.
In Scenarios 1 and 2, the interruptible loads do not participate in the market, the distribution network maximizes the acceptance of the new energy, and there is no market gaming behavior among the three parties.
Scenario 3: The three-party game is conducted by the method of this paper.
Scenario two is solved by Q-learning algorithm and scenario three is solved by Nash-Q method.
In terms of economic benefits, the benefits and costs of the parties in the three scenarios are shown in Table 4. In Scenario 1, since the fixed tariff has no guiding effect on the new energy output, the new energy output is completely determined by its own characteristics, and its own profit is the lowest level among the three scenarios, while the cost of the gas turbine called by DNO to undertake the guaranteed power supply mechanism is the highest among the three scenarios. Scenario 2 adopts time-sharing tariff, the distribution network to maximize the acceptance of new energy, in the midday and late hours of the price of higher new energy supply power is larger, so the LA and DGO profit over the scenario 1 were increased by 268.27 yuan (12.93%) and 191.89 yuan (6.56%), respectively. 143.35 yuan (6.9%). The time-of-use tariff directs new energy sources to participate in load peaking, so the DNO’s cost of utilizing the gas turbine is reduced by 23.67 yuan (4.81%) compared to Scenario One.
Benefits and partial costs in different scenarios
DNO | LA | DGO | |||||
---|---|---|---|---|---|---|---|
Profit/yuan | Call gas turbine cost / yuan | Purchase IL cost /yuan | Safeguard the cost of the power supply / yuan | Profit/ yuan | IL sells electrical benefits/ yuan | Profit/ yuan | |
Scene 1 | 2157.16 | 492.51 | 0 | 0 | 2074.31 | 0 | 2924.35 |
Scene 2 | 2341.52 | 468.84 | 0 | 0 | 2342.58 | 0 | 3116.24 |
Scene 3 | 2738.67 | 174.38 | 254.17 | 0 | 2617.63 | 254.17 | 3435.19 |
The results of the tripartite game of load supply power at node 24 under the three scenarios are shown in Fig. 5. In the figure, P-DNO, P-WG, and P-PV are the power supply of DNO, DGO, and LA, respectively, and EV-WG and EV-PV are the values of power stored in wind storage and optical storage, respectively. Figures (a)~(c) represent the power supply game results of Scenario 1, Scenario 2 and Scenario 3, respectively.

Tripartite power supply game results of node 24 under different scenarios
From Fig. 5(b), it can be seen that in the midday period the optical storage system and wind storage system participate in the power supply with the maximum power under the premise of satisfying the tripartite revenue, which leads to the DNO power supply power appearing to be a minimum of 11kW, while the new energy source in the evening period reduces its own power supply power in order to satisfy the constraints of the energy storage equipment, and thus the DNO has a maximum power supply power of 347kW. The difference between the peaks and valleys of the supply is 336kW. It can be seen that the time-sharing tariff mechanism can improve the profit of new energy feed-in, but it will sacrifice the profit of DNO and increase the degree of fluctuation of DNO power supply.
Comparing Fig. 5(c) with Fig. 5(a) and Fig. 5(b), it can be seen that the DNO’s output increases significantly during the peak load hours, and the DNO competes with the new energy for the load supply power by adjusting the offer price, which improves the position of the DNO in the market game, and also inhibits the tendency of the new energy and other subjects to arbitrarily supply power for the purpose of pursuing profits to a certain extent. At the same time, because DNO has the bargaining power, DNO output in other hours is more moderate, the difference between peak and valley of power supply is only 128kW, and the pressure of DNO power supply becomes smaller.
Scenario 3 adopts game bargaining, where all three parties have bargaining rights, and the load supply power and benefits obtained by the three parties in each round of the game for a typical time period of

The three forces in each round at t=14

Tripartite benefits in each round at t=14
At the beginning of the game, the initial offer of the three parties differs greatly, and the load supply power obtained from the three-party competition fluctuates greatly in each round of clearing, so the profit obtained by each party also fluctuates. In order to compete for a larger load supply, the three parties adjust the unit price of electricity downward, so the benefit of each party is gradually declining. Late in the game, due to the limitations of the market rules, the parties must adjust the price of electricity within a certain range, the price of electricity tends to stabilize the value of the three parties to profit also tends to stabilize. The game reaches equilibrium. But at this time, the price of electricity is still higher than the fixed price, so compared with scene one, the new mechanism can enhance the interests of all parties, but also incentives for new energy and other subjects to actively participate in the market, is conducive to the promotion of market reform.
The purpose of this section is to compare the differences brought by the traditional reinforcement learning algorithm (DQN) and the MADDPG deep reinforcement learning algorithm in the solution process in order to verify the effectiveness of the proposed MADDPG method. The default hyperparameters of the traditional reinforcement learning algorithm and the MADDPG deep reinforcement learning algorithm are set as follows: training rounds N=12000, time interval
In the experiment, one day is designated as a training round, and the optimal offers of each time interval are added together to obtain a mean value to observe the power generator’s offer strategy in an unknown environment. The results of the convergence curves for the offer strategy variable

Convergence curve of quotation strategy variable
Based on the generator
The comparison results of the optimal offer strategy variable

Comparison of the best quotation strategy variable under the two algorithms
The results of the comparison of the strategy variance as well as the total returns of the two algorithms are shown in Table 5. It can be seen that the strategies learned under the MADDPG algorithm have smaller variance, more stable results, and obtain higher total return on rounds, reaching a mean value of $2206.52. This validates the effectiveness of the MADDPG deep reinforcement learning algorithm proposed in this paper.
Other simulation example parameters
Algorithm | Average round total return |
Standard deviation of total return |
||
---|---|---|---|---|
DQN | 0.2617 | 0.04021 | 812.38 | 542.13 |
MADDPG | 0.2504 | 0.03257 | 2206.52 | 478.24 |
Synthesizing the construction and solution of the intelligent body equilibrium model of the power market in the previous paper, this paper realizes the design of the universal settlement mechanism of the power market on the basis of the principle of power market settlement.
The specificity of the power market and the indispensability of the power spot market deviation power settlement The same as general commodity trading, a variety of power trading products in the trading method, trading volume, price formation mechanism, delivery mode, settlement mode, etc. have their own rules. In addition to power financial derivatives trading, the power market and general commodity trading market is different, must establish centralized trading power spot market, and in the power spot market to establish deviation power settlement mechanism, the reason is four aspects:
electric energy products with hair transmission and distribution of electricity at the same time to complete the specificity of the power system operation and power market transactions must also maintain real-time balance of power generation and consumption of electricity power and meet the safety of the grid operating conditions, in order to ensure the safe, reliable and high-quality operation of the grid and the implementation of power trading contracts, which requires the establishment of a replacement for the original planning and scheduling of the market machine, i.e., the spot market for electric power and electric power auxiliary services market. If each electric energy transactions can be achieved in accordance with the agreed power delivery, in the normal operating state of the grid, through the power market mechanism (including transmission blocking management and then scheduling of power generation and consumption) will be able to fully guarantee the real-time balance of the power system power. However, due to the uncertainty of renewable energy generation output, temporary maintenance of generation and transmission equipment, etc., the supply side may not be able to provide electricity as agreed in the transaction. The demand side may also deviate from the agreed curve due to load forecasting errors, which may cause actual power imbalance and jeopardize the safety of grid operation. Therefore, it is necessary to establish a deviation power settlement mechanism with different constraint strengths according to the performance awareness, performance ability and potential of the market players, so as to promote the market players to assume the responsibility of power balancing according to their power trading agreements, i.e., generating and consuming electricity in accordance with the trading agreements, or paying the cost of breach of contract (i.e., assuming the economic responsibility), which is used to make up for the expenses incurred by the power dispatch organization for maintaining the real-time power balancing of the system. In terms of a time period, each electric energy transaction of the power generation enterprise should be delivered in that time period of electric energy, at the same time sent by the power grid to the power users to complete the delivery of electricity, each market player in a time period of the delivery of electric energy can not be in accordance with each of its transactions individually measured, each time period of the actual power generation or consumption of electricity is only a measurement data available for the settlement of the use of, and thus can only be on the total deviation from the total electric energy transactions Therefore, only a unified calculation and settlement rule can be formulated for the total electric energy transaction deviation. In this way, all the electric energy transaction contracts can be regarded as having carried out the electric power delivery according to the agreement, and can be decoupled from each other and settled separately, and there is no question of the order of settlement. Considering the time sequence, the electricity spot market is the last market mechanism to guarantee real-time power balance, and the power imbalance caused by defaults in the delivery of all electricity transactions and the short-term costs paid by the power dispatch organization for guaranteeing real-time power balance can be reflected in the price of the electricity spot market. Therefore, all power markets are required to establish a deviation power settlement mechanism in the power spot market. Principle of Settlement of Deviation Power in Electricity Markets For centralized power markets, power financial derivatives transactions are used to hedge spot market price risks, and cash settlement is implemented for these trading contracts without physical delivery. The deviation power calculation and settlement rules are only related to the trading products and their trading rules in the electric energy spot market. For decentralized electricity markets, medium- and long-term trading contracts for electricity energy are subject to physical delivery, and electricity spot is traded in the day-ahead market, the intraday market and the real-time balancing market. Considering the time sequence, the real-time balancing market is the last market mechanism to guarantee the real-time power balance of the system. The power demand in the real-time balancing market is the difference between the actual system load and the sum of the traded demand in the medium- and long-term power market, the day-ahead market, and the intraday market, i.e., the demand-side load forecasting errors prior to the real-time balancing market, the power imbalances caused by delivery defaults of all power transactions, and the upward or downward adjustments of unit output by the power dispatchers to guarantee the real-time power balance and their short-term costs are all able to be reflected in the reflected in prices in the real-time balancing market.
In view of the special characteristics of the electricity market described above, in order to study and design the settlement mechanism of the electricity market, it is first necessary to clarify the relationship between the trading of various power products and power dispatch, contractual delivery, and its relationship with the settlement of the spot electricity market.
Electricity financial derivatives trading cash delivery system, its contract power does not need to be physical delivery, and therefore do not need power scheduling agencies to arrange power generation plan, there is no contract power and the actual power generation and use of the difference between the settlement problem. Therefore, the settlement of the electricity spot market has nothing to do with electricity financial derivatives transactions. However, as the spot market price of electricity is borrowed as the settlement reference price for power financial derivatives transactions, its transaction price is greatly influenced by the expectation of the future spot market price. Conversely, the trading scale, trading price and liquidity of the power financial derivatives market have a greater impact on power investment, and will also have a certain impact on the future power spot market price. Therefore, the fairness and reasonableness of the trading mechanism and settlement mechanism of the electric power spot market are crucial to the benign operation of the whole electric power market and the play of the market mechanism to optimize the allocation of resources.
Electricity medium- and long-term transactions belong to physical transactions, which require physical delivery. Even if a power generation enterprise transfers its physical contract to a third party, the third party has to provide electric energy and make electric power delivery according to the contract. The physical delivery of the medium- and long-term power contract has to be transmitted through the power grid, and is therefore subject to transmission capacity constraints. Both parties to the transaction need to formulate a trading strategy and carry out the power transaction in accordance with the available transmission capacity of the relevant transmission line (or section) released by the market, so as to avoid economic losses caused by the blockage of transmission and failure to realize the delivery of power when the power delivery is carried out. If necessary, the contract terms and conditions should also agree on the economic responsibility of both parties in the event of economic losses caused by transmission blockage.
The relationship between trading, scheduling, delivery and settlement in the spot electricity market varies according to the market model. In the centralized power market, except for a few units tasked with guaranteeing the safe and stable operation of the power grid, all centralized power sources are required to trade their feed-in power through the power spot market. The generation curves of the units cleared in the day-ahead market and the real-time market are then used as the generation dispatch planning curves to realize the power delivery and settled in accordance with the market clearing price. The market clearing price can be the market clearing price; if the length of the market clearing period is 5 minutes and the length of the market clearing period is 15 minutes, the market clearing price is the weighted average or arithmetic mean of the market clearing prices of the three 5-minute market clearing prices within 15 minutes. If the market clearing adopts the node marginal tariff and the market settlement adopts the regional tariff, the market settlement price of each region is the weighted average of the market clearing prices of each node in the corresponding region. In addition to captive power plant users, all power users (or their power purchasing agents) are required to settle in the spot market based on time-phased electricity consumption and market clearing prices.
In the decentralized electricity market, the portion of demand that is met through medium- and long-term trading of electricity does not need to be repurchased in the electricity, power spot market. Demand for electricity that is not met outside of medium- and long-term contracts for electricity is declared in the electricity spot market. Power generation enterprises declare to the power dispatching organization the power medium- and long-term contract transactions of the time trading power (trading curve), the corresponding part of the generating capacity is no longer to the spot market offer, the remaining generating capacity to the spot market offer, to participate in the spot market competition. The power dispatching organization arranges the power generation plan for each generating unit based on the medium- and long-term contract trading curve and the power generation curve of the unit cleared in the spot market, carries out power dispatching and realizes power delivery.
In summary, the universal settlement mechanism in the power market is: the establishment of a deviation power settlement mechanism based on the real-time balanced market price, various types of power transactions to realize the decoupling of the settlement power, the independent settlement of different power trading varieties without the need to stipulate the settlement priority. In view of the large differences in the settlement mechanism of each spot pilot power market and the high cost of construction and upgrading, it is proposed to take the establishment of deviation power settlement mechanism as an entry point, standardize the settlement rules of the power market transactions, realize the separate settlement of each contract, the transparency of the market settlement imbalance funds, and the generalization of the settlement software system. In addition, in order to promote the performance of market players and avoid the security risk of power grid caused by excessive deviation of power quantity in regions adopting decentralized power market mode, differentiated deviation power settlement price mechanism can be adopted at the initial stage of the market according to the impact of deviation of power quantity on the balance of the system, i.e., for the deviation of power quantity that helps to promote the real-time balance of the system, the real-time balance of the market price is adopted for settlement; for the deviation of power quantity that is not conducive to the real-time balance of the system, the punitive pricing mechanism is adopted for the deviation of power quantity. For deviation power that is not conducive to real-time system balance, a punitive pricing mechanism is used for settlement.
In this paper, the construction and solution of the equilibrium model of power market intelligences are realized by combining infinite repetitive game theory and MADDPG deep reinforcement learning algorithm, and the pervasive settlement mechanism of power market is designed by combining the principle of power settlement.
The modified IEEE 33-node distribution network system is selected to analyze the arithmetic case. Among them, the profit of load aggregator (LA), distributed energy operator (DGO), and distribution network operator (DNO) obtained from scenario 2, which adopts time-of-use tariff, is increased by 268.27 yuan (12.93%), 191.89 yuan (6.56%), and 143.35 yuan (6.9%) compared with scenario 1, respectively. And the cost of DNO moving gas turbine is reduced by 23.67 yuan (4.81%) compared with Scenario One. Comparing Scenario 2 with time-sharing tariff and Scenario 1 with fixed tariff, Scenario 3 with game bargaining model is more reasonable in the distribution of benefits, which not only guarantees the safety and quality of power supply, but also incentivizes new energy and other major players to participate in the market, and motivates new energy to actively participate in peaking and reduces the risk of new energy consumption in the distribution network.
In addition, the fluctuation range of the generator’s best offer strategy
Finally, on the basis of systematically studying the relationship between various types of power transactions and power dispatch, power delivery and settlement, this paper proposes a universal settlement mechanism applicable to various types of power market models.