Integration of causal inference in the DQN sampling process for classical control problems

In this study, causal inference is integrated into deep reinforcement learning to enhance sampling in classical control environments. The problem we’re working on is "classical control," where an agent makes decisions to keep systems balanced. With the help of artificial intelligence and causal inference, we have developed a method that adjusts a deep Q-network’s experience memory by adjusting the priority of transitions. According to the agent’s actions, these priorities are based on the magnitude of causal differences. We have applied our methodology to a reference environment in reinforcement learning. In comparison with a deep Q-network based on conventional random sampling, the results indicate significant improvements in performance and learning efficiency. Our study shows that causal inference can be integrated into the sampling process so that experience transitions can be selected more intelligently, resulting in more effective learning for classical control problems. The study contributes to the convergence between artificial intelligence and causal inference, offering new perspectives for the application of reinforcement learning techniques in real-world applications where precise control is essential.

Keywords

Causal inference, Prioritized sampling, Deep Q-network, Reinforcement learning

Document Type

Journal article

Version

VoR - Version of Record

Publisher Version

https://doi.org/10.1007/s00521-024-10540-4

Dataset

https://link.springer.com/article/10.1007/s00521-024-10540-4#citeas

Citation

Velez Bedoya, J. I., Gonzalez Bedia, M., Castillo Ossa, L. F., Arango Lopez , J., & Moreira, F. (2024). Integration of causal inference in the DQN sampling process for classical control problems. Neural Computing and Applications, (published online: 29 November 2024), 1-13. https://doi.org/10.1007/s00521-024-10540-4. Repositório Institucional UPT. https://hdl.handle.net/11328/6027

Identifiers

https://hdl.handle.net/11328/6027
1433-3058
0941-0643

Access Type

Restricted Access

License

Attribution (CC-BY)

Full item page