RL

Pretrained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control

Fine-tuning vision-language foundation models has emerged as a powerful approach to leveraging internet-scale data for generalization in downstream applications. A particularly promising source of representations already used in supervised learning …

ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages

This paper introduces a novel method for enhancing the effectiveness of the Asynchronous Advantage Actor-Critic (A3C) algorithm by incorporating state-aware exploration. We achieve this improvement through three simple yet impactful modifications (1) …