Skip to content
Snippets Groups Projects
Select Git revision
  • main default protected
1 result

mso_3_4-td1

  • Clone with SSH
  • Clone with HTTPS
  • Forked from Dellandrea Emmanuel / MSO_3_4-TD1
    7 commits behind, 5 commits ahead of the upstream repository.
    user avatar
    oscarchaufour authored
    fe50437f
    History

    Reinforcement learning

    Familiarization with a complete RL pipeline: Application to training a robotic arm

    Get familiar with Hugging Face Hub

    Link to the model on the hub :

    REINFORCE

    Plot showing the total reward accross episodes: Alt text