Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL
Taku Yamagata, Ahmed Khalil, Raul Santos-Rodriguez
Research output: Chapter in Book/Report/Conference proceeding › Conference Contribution (Conference Proceeding)
69
Citations
(Scopus)