Integration of causal inference in the DQN sampling process for classical control problems
dc.contributor.author | Velez Bedoya, Jairo Ivan | |
dc.contributor.author | Gonzalez Bedia, Manuel | |
dc.contributor.author | Castillo Ossa, Luis Fernando | |
dc.contributor.author | Arango Lopez , Jeferson | |
dc.contributor.author | Moreira, Fernando | |
dc.date.accessioned | 2024-12-04T14:27:43Z | |
dc.date.available | 2024-12-04T14:27:43Z | |
dc.date.issued | 2024-11-29 | |
dc.description.abstract | In this study, causal inference is integrated into deep reinforcement learning to enhance sampling in classical control environments. The problem we’re working on is "classical control," where an agent makes decisions to keep systems balanced. With the help of artificial intelligence and causal inference, we have developed a method that adjusts a deep Q-network’s experience memory by adjusting the priority of transitions. According to the agent’s actions, these priorities are based on the magnitude of causal differences. We have applied our methodology to a reference environment in reinforcement learning. In comparison with a deep Q-network based on conventional random sampling, the results indicate significant improvements in performance and learning efficiency. Our study shows that causal inference can be integrated into the sampling process so that experience transitions can be selected more intelligently, resulting in more effective learning for classical control problems. The study contributes to the convergence between artificial intelligence and causal inference, offering new perspectives for the application of reinforcement learning techniques in real-world applications where precise control is essential. | |
dc.identifier.citation | Velez Bedoya, J. I., Gonzalez Bedia, M., Castillo Ossa, L. F., Arango Lopez , J., & Moreira, F. (2024). Integration of causal inference in the DQN sampling process for classical control problems. Neural Computing and Applications, (published online: 29 November 2024), 1-13. https://doi.org/10.1007/s00521-024-10540-4. Repositório Institucional UPT. https://hdl.handle.net/11328/6027 | |
dc.identifier.issn | 1433-3058 | |
dc.identifier.issn | 0941-0643 | |
dc.identifier.uri | https://hdl.handle.net/11328/6027 | |
dc.language.iso | eng | |
dc.publisher | Springer | |
dc.relation.hasversion | https://doi.org/10.1007/s00521-024-10540-4 | |
dc.rights | restricted access | |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
dc.subject | Causal inference | |
dc.subject | Prioritized sampling | |
dc.subject | Deep Q-network | |
dc.subject | Reinforcement learning | |
dc.subject.fos | Ciências Naturais - Ciências da Computação e da Informação | |
dc.title | Integration of causal inference in the DQN sampling process for classical control problems | |
dc.type | journal article | |
dcterms.references | https://link.springer.com/article/10.1007/s00521-024-10540-4#citeas | |
dspace.entity.type | Publication | |
oaire.citation.endPage | 13 | |
oaire.citation.issue | Published online: 29 November 2024 | |
oaire.citation.startPage | 1 | |
oaire.citation.title | Neural Computing and Applications | |
oaire.version | http://purl.org/coar/version/c_970fb48d4fbd8a85 | |
person.affiliation.name | Universidade Portucalense | |
person.familyName | Moreira | |
person.givenName | Fernando | |
person.identifier.ciencia-id | 7B1C-3A29-9861 | |
person.identifier.orcid | 0000-0002-0816-1445 | |
person.identifier.rid | P-9673-2016 | |
person.identifier.scopus-author-id | 8649758400 | |
relation.isAuthorOfPublication | bad3408c-ee33-431e-b9a6-cb778048975e | |
relation.isAuthorOfPublication.latestForDiscovery | bad3408c-ee33-431e-b9a6-cb778048975e |
Files
Original bundle
1 - 1 of 1