Composing Complex Skills by Learning Transition Policies (ICLR 2019)

Edward Hu

Los Angeles, California

1 0

0 Collaborators

Humans acquire complex skills by exploiting previously learned skills and making transitions between them. To empower machines with this ability, we propose a method that can learn transition policies which effectively connect primitive skills to perform sequential tasks without handcrafted rewards. To efficiently train our transition policies, we introduce proximity predictors which induce rewards gauging proximity to suitable initial states for the next skill. The proposed method is evaluated on a set of complex continuous control tasks in bi-pedal locomotion and robotic arm manipulation which traditional policy gradient methods struggle at. We demonstrate that transition policies enable us to effectively compose complex skills with existing primitive skills. The proposed induced rewards computed using the proximity predictor further improve training efficiency by providing more dense information than the sparse rewards from the environments. ...learn more

Project status: Published/In Market

Robotics, Artificial Intelligence

Groups
DeepLearning, Artificial Intelligence West Coast, Student Developers for AI

Code Samples [1]Links [2]

Overview / Usage

Methodology / Approach

To efficiently train our transition policies, we introduce proximity predictors which induce rewards gauging proximity to suitable initial states for the next skill. The proposed method is evaluated on a set of complex continuous control tasks in bi-pedal locomotion and robotic arm manipulation which traditional policy gradient methods struggle at. We demonstrate that transition policies enable us to effectively compose complex skills with existing primitive skills. The proposed induced rewards computed using the proximity predictor further improve training efficiency by providing more dense information than the sparse rewards from the environments.

Technologies Used

Tensorflow
Python

Repository

https://youngwoon.github.io/module/

You have disabled JavaScript

We are sorry, but without JavaScript we are currently unable to display the latest activity feed. Please, enable Javascript in your browser.

Composing Complex Skills by Learning Transition Policies (ICLR 2019)

Edward Hu

Overview / Usage

Methodology / Approach

Technologies Used

Repository

Other links

Login to continue

This action requires you to be logged in.

Thanks for voting. Please leave a comment.

Composing Complex Skills by Learning Transition Policies (ICLR 2019)

Edward Hu

Overview / Usage

Methodology / Approach

Technologies Used

Repository

Other links

Login to continue

This action requires you to be logged in.