UROP Openings

Have a UROP opening you would like to submit?

Please fill out the form.

Submit your UROP opening

Reinforcement Learning Model Development and Comparison




4: Architecture

Faculty Supervisor:

Takehiko Nagakura

Faculty email:


Apply by:


Paloma Francisca Gonzalez Rojas <palomagr@mit.edu>

Project Description

We are using human trajectory data from drone videos from Machu Picchu to feed to a Reinforcement Learning Model in Unity3D. We have two training approaches to compare, Exploratory RL and Imitation Learning. For each a training scene is set up with rewards and a curated label system. Such scenes have been set up, and the main task is to test them in order to do a quantitative/qualitative comparison.


C# and Python programming skills in medium level, object oriented programming, Reinforcement Learning experience / knowledge Unity3D experience