Research output per year
Research output per year
A.A. Diwan*, Julen Urain, Jens Kober, Jan Peters
Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review
This paper introduces a new imitation learning framework based on energy-based generative models capable of learning complex, physics-dependent, robot motion policies through state-only expert motion trajectories. Our algorithm, called Noise-conditioned Energy-based Annealed Rewards (NEAR), constructs several perturbed versions of the expert's motion data distribution and learns smooth, and well-defined representations of the data distribution's energy function using denoising score matching. We propose to use these learnt energy functions as reward functions to learn imitation policies via reinforcement learning. We also present a strategy to gradually switch between the learnt energy functions, ensuring that the learnt rewards are always well-defined in the manifold of policy-generated samples. We evaluate our algorithm on complex humanoid tasks such as locomotion and martial arts and compare it with state-only adversarial imitation learning algorithms like Adversarial Motion Priors (AMP). Our framework sidesteps the optimisation challenges of adversarial imitation learning techniques and produces results comparable to AMP in several quantitative metrics across multiple imitation settings. Code and videos available at anishhdiwan.github.io/noise-conditionedenergy-based-annealed-rewards/.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 13th International Conference on Learning Representations, ICLR 2025 |
| Publisher | International Conference on Learning Representations, ICLR |
| Pages | 29320-29341 |
| Number of pages | 22 |
| ISBN (Electronic) | 9798331320850 |
| Publication status | Published - 2025 |
| Event | 13th International Conference on Learning Representations, ICLR 2025 - Singapore, Singapore Duration: 24 Apr 2025 → 28 Apr 2025 Conference number: 13 |
| Conference | 13th International Conference on Learning Representations, ICLR 2025 |
|---|---|
| Abbreviated title | ICLR 2025 |
| Country/Territory | Singapore |
| City | Singapore |
| Period | 24/04/25 → 28/04/25 |
Research output: Contribution to conference › Poster › Scientific