|
JAIST Repository >
School of Transdisciplinary Science >
Conference Papers >
Conference Papers >
Please use this identifier to cite or link to this item:
https://hdl.handle.net/10119/20016
|
| Title: | Optimal execution strategy using Deep Q-Network with heuristics policy |
| Authors: | Ogawa, Tatsuyoshi Nakagawa, Kei Ikeda, Kokolo |
| Keywords: | optimal execution problem DQN DDQN TWAP |
| Issue Date: | 2024-07-06 |
| Publisher: | Institute of Electrical and Electronics Engineers (IEEE) |
| Magazine name: | 2024 16th IIAI International Congress on Advanced Applied Informatics (IIAI-AAI) |
| Start page: | 456 |
| End page: | 461 |
| DOI: | 10.1109/IIAI-AAI63651.2024.00089 |
| Abstract: | The optimal execution problem involves planning a stock execution strategy that minimizes trading costs for a specific quantity of stock over a certain timeframe. To tackle this problem, advanced techniques like Deep Reinforcement Learning (DRL), especially the Deep Q-Network (DQN) which employs deep learning to approximate the Q value function, have been introduced to identify the most efficient execution strategies. However, DRL methods face challenges such as learning instability and the extensive data requirements. Therefore, we propose to use prioritized experience replay and to incorporate a strategy derived from the insights of the financial field into the DQN during learning process. Particularly, we introduce a time-weighted average price (TWAP) strategy that has been proven to be optimal under specific conditions as a heuristic policy. This approach is expected to be able to enhance the stability and performance of policy learning. We have conducted numerical experiments in various noise-prone environments to assess the effectiveness of our approach. The findings indicate that our proposed method consistently outperforms conventional benchmarks by reducing costs in all tested environments. |
| Rights: | This is the author's version of the work. Copyright (C) 2024 IEEE. 2024 16th IIAI International Congress on Advanced Applied Informatics (IIAI-AAI), Takamatsu, Japan, pp. 456-461. DOI: https://doi.org/10.1109/IIAI-AAI63651.2024.00089. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
| URI: | https://hdl.handle.net/10119/20016 |
| Material Type: | author |
| Appears in Collections: | d11-1. 会議発表論文 (Conference Papers)
|
Files in This Item:
| File |
Description |
Size | Format |
| T-IKEDA-K-0930-5.pdf | | 1097Kb | Adobe PDF | View/Open |
|
All items in DSpace are protected by copyright, with all rights reserved.
|