Ryan Bak

Software Engineer, Phoenix AZ

Deep Reinforcement Learning

As part of Udacity's Deep Reinforcement Learning nanodegree program, I created three projects trained to perform tasks in Unity's learning environments. Below is a description of the three projects, as well as images captured from my trained agents.

The first project is an implementation of deep Q-learning, trained on the Banana Collector environment. In this environment the agent is a robot attempting to collect yellow bananas while avoiding the blue bananas.

The code and more information can be found here.

Banana Collector
Banana Collector

The second project is an implementation of PPO with an actor-critic network. It is trained on the Reacher environment, in which each arm is an agent with the goal of keeping the end of the arm inside the moving sphere.

The code and more information can be found here.

The third project is an implementation of DDPG. This is a multi-agent environment in which each agent is rewarded for hitting the ball over the net.

The code and more information can be found here.

Banana Collector

This Site

This site is itself a live example of my work. Everything here was designed and coded by me, not done using a template. This project was originally designed as an opportunity for me to learn and play with web technologies, and later evolved into a personal website.