Reducing the Time Complexity of Goal-Independent Reinforcement Learning

Ollington, R; Vamplew, P

Reducing the Time Complexity of Goal-Independent Reinforcement Learning

conference contribution

posted on 2023-05-26, 07:32 authored by Ollington, R, Vamplew, P

Concurrent Q-Learning (CQL) is a goal independent\ reinforcement learning technique that learns the action\ values to all states simultaneously. These action values\ may then be used in a similar way to eligibility traces to\ allow many action values to be updated at each time\ step. CQL learns faster than conventional Q-learning\ techniques with the added benefit of being able to apply\ all experiences gained performing one task to any new\ task within the problem domain. Unfortunately the\ update time complexity of CQL is O(|S|2x|A|). This\ paper presents a technique for reducing the update\ complexity of CQL to O(|A|) with little impact on\ performance.

History

Pagination

132-137

Publication status

Published

Event title

AISAT2004: International Conference on Artificial Intelligence in Science and Technology

Event Venue

Hobart, Tasmania, Australia

Date of Event (Start Date)

2004-11-21

Date of Event (End Date)

2004-11-25

Repository Status

Open

Usage metrics

Reducing the Time Complexity of Goal-Independent Reinforcement Learning

History

Pagination

Publication status

Event title

Event Venue

Date of Event (Start Date)

Date of Event (End Date)

Repository Status

Usage metrics

Categories

Keywords

Licence

Exports