Neural Fitted Q Iteration based Optimal Bidding Strategy in Real Time Reactive Power Market_1.

Open AccessPosted Content

Neural Fitted Q Iteration based Optimal Bidding Strategy in Real Time Reactive Power Market_1.

Jahnvi Patel, +3 more

- 07 Jan 2021 -

arXiv: Artificial Intelligence

Chats0

TLDR

In this paper, a pioneer work on learning optimal bidding strategies from observation and experience in a three-stage reactive power market is reported, where a bidding strategy is to be learned from market observations and experience of imperfect oligopolistic competition-based markets.

Abstract:

In real time electricity markets, the objective of generation companies while bidding is to maximize their profit. The strategies for learning optimal bidding have been formulated through game theoretical approaches and stochastic optimization problems. Similar studies in reactive power markets have not been reported so far because the network voltage operating conditions have an increased impact on reactive power markets than on active power markets. Contrary to active power markets, the bids of rivals are not directly related to fuel costs in reactive power markets. Hence, the assumption of a suitable probability distribution function is unrealistic, making the strategies adopted in active power markets unsuitable for learning optimal bids in reactive power market mechanisms. Therefore, a bidding strategy is to be learnt from market observations and experience in imperfect oligopolistic competition-based markets. In this paper, a pioneer work on learning optimal bidding strategies from observation and experience in a three-stage reactive power market is reported.

References

PDF

Open Access

More filters

Proceedings ArticleDOI

A direct adaptive method for faster backpropagation learning: the RPROP algorithm

Martin Riedmiller, +1 more

TL;DR: A learning algorithm for multilayer feedforward networks, RPROP (resilient propagation), is proposed that performs a local adaptation of the weight-updates according to the behavior of the error function to overcome the inherent disadvantages of pure gradient-descent.

...read moreread less

Journal Article

Tree-Based Batch Mode Reinforcement Learning

Damien Ernst, +2 more

- 01 Dec 2005 -

Journal of Machine Learning Research

TL;DR: Within this framework, several classical tree-based supervised learning methods and two newly proposed ensemble algorithms, namely extremely and totally randomized trees, are described and found that the ensemble methods based on regression trees perform well in extracting relevant information about the optimal control policy from sets of four-tuples.

...read moreread less

Book ChapterDOI

Neural fitted q iteration – first experiences with a data efficient neural reinforcement learning method

Martin Riedmiller

TL;DR: NFQ, an algorithm for efficient and effective training of a Q-value function represented by a multi-layer perceptron, is introduced and it is shown empirically, that reasonably few interactions with the plant are needed to generate control policies of high quality.

...read moreread less

Posted Content

Prioritized Experience Replay

Tom Schaul, +3 more

- 18 Nov 2015 -

arXiv: Learning

TL;DR: A framework for prioritizing experience, so as to replay important transitions more frequently, and therefore learn more efficiently, in Deep Q-Networks, a reinforcement learning algorithm that achieved human-level performance across many Atari games.

...read moreread less

Journal ArticleDOI

Deep Reinforcement Learning for Strategic Bidding in Electricity Markets

Yujian Ye, +4 more

- 01 Mar 2020 -

IEEE Transactions on Smart Grid

TL;DR: A novel deep reinforcement learning (DRL) based methodology, combining a deep deterministic policy gradient (DDPG) method with a prioritized experience replay (PER) strategy is proposed, enabling market participants to receive accurate feedback regarding the impact of their bidding decisions on the market clearing outcome.

...read moreread less