Integration of causal inference in the DQN sampling process for classical control problems

Date

2024-11-29

Embargo

Advisor

Coadvisor

Journal Title

Journal ISSN

Volume Title

Publisher

Springer
Language
English

Research Projects

Organizational Units

Journal Issue

Alternative Title

Abstract

In this study, causal inference is integrated into deep reinforcement learning to enhance sampling in classical control environments. The problem we’re working on is "classical control," where an agent makes decisions to keep systems balanced. With the help of artificial intelligence and causal inference, we have developed a method that adjusts a deep Q-network’s experience memory by adjusting the priority of transitions. According to the agent’s actions, these priorities are based on the magnitude of causal differences. We have applied our methodology to a reference environment in reinforcement learning. In comparison with a deep Q-network based on conventional random sampling, the results indicate significant improvements in performance and learning efficiency. Our study shows that causal inference can be integrated into the sampling process so that experience transitions can be selected more intelligently, resulting in more effective learning for classical control problems. The study contributes to the convergence between artificial intelligence and causal inference, offering new perspectives for the application of reinforcement learning techniques in real-world applications where precise control is essential.

Keywords

Causal inference, Prioritized sampling, Deep Q-network, Reinforcement learning

Document Type

Journal article

Citation

Velez Bedoya, J. I., Gonzalez Bedia, M., Castillo Ossa, L. F., Arango Lopez , J., & Moreira, F. (2024). Integration of causal inference in the DQN sampling process for classical control problems. Neural Computing and Applications, (published online: 29 November 2024), 1-13. https://doi.org/10.1007/s00521-024-10540-4. Repositório Institucional UPT. https://hdl.handle.net/11328/6027

Identifiers

TID

Designation

Access Type

Restricted Access

Sponsorship

Description