JAIST Repository >
d. 融合科学系 >
d11. 会議発表論文 >
d11-1. 会議発表論文 >

このアイテムの引用には次の識別子を使用してください: http://hdl.handle.net/10119/20016

タイトル: Optimal execution strategy using Deep Q-Network with heuristics policy
著者: Ogawa, Tatsuyoshi
Nakagawa, Kei
Ikeda, Kokolo
キーワード: optimal execution problem
DQN
DDQN
TWAP
発行日: 2024-07-06
出版者: Institute of Electrical and Electronics Engineers (IEEE)
誌名: 2024 16th IIAI International Congress on Advanced Applied Informatics (IIAI-AAI)
開始ページ: 456
終了ページ: 461
DOI: 10.1109/IIAI-AAI63651.2024.00089
抄録: The optimal execution problem involves planning a stock execution strategy that minimizes trading costs for a specific quantity of stock over a certain timeframe. To tackle this problem, advanced techniques like Deep Reinforcement Learning (DRL), especially the Deep Q-Network (DQN) which employs deep learning to approximate the Q value function, have been introduced to identify the most efficient execution strategies. However, DRL methods face challenges such as learning instability and the extensive data requirements. Therefore, we propose to use prioritized experience replay and to incorporate a strategy derived from the insights of the financial field into the DQN during learning process. Particularly, we introduce a time-weighted average price (TWAP) strategy that has been proven to be optimal under specific conditions as a heuristic policy. This approach is expected to be able to enhance the stability and performance of policy learning. We have conducted numerical experiments in various noise-prone environments to assess the effectiveness of our approach. The findings indicate that our proposed method consistently outperforms conventional benchmarks by reducing costs in all tested environments.
Rights: This is the author's version of the work. Copyright (C) 2024 IEEE. 2024 16th IIAI International Congress on Advanced Applied Informatics (IIAI-AAI), Takamatsu, Japan, pp. 456-461. DOI: https://doi.org/10.1109/IIAI-AAI63651.2024.00089. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
URI: http://hdl.handle.net/10119/20016
資料タイプ: author
出現コレクション:d11-1. 会議発表論文 (Conference Papers)

このアイテムのファイル:

ファイル 記述 サイズ形式
T-IKEDA-K-0930-5.pdf1097KbAdobe PDF見る/開く

当システムに保管されているアイテムはすべて著作権により保護されています。

 


お問合せ先 : 北陸先端科学技術大学院大学 研究推進課図書館情報係 (ir-sys[at]ml.jaist.ac.jp)