Skip to content

sbrml/pilco

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

66 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pilco

Build Status Coverage Status

Learn to balance baby.

Balance

Roadmap

Rename this repo. Candidates:

  • Talos
  • PRL
  • IRL (Inference for RL)

Priorities

  • Run on more environments: Cartpole, Mountaincar
  • Tensor shapes on all methods

Clean up current code

  • Define documentation layout
  • Docstrings - Sphinx
  • Doctest?
  • Remove all hacky stuff, like hard coded tensors
  • Clean up pendulum.py: learning-dynamics, objective, optimisation, plotting.
  • Batching in calls of our agents, policies and costs. Start with policies.
  • Migrate to gpflow.
  • Feasible/Initialisation space - this needs more specification, included it so we don't forget.

Example notebooks

  • Pendulum notebook

Write derivations, including complexity

  • Moment matching

Profiling

  • I have no clue how profiling works, let's research into how we should go about it

Write tests (especially Monte Carlo - maybe one test for all moment matching)

  • EQAgent
  • EQCost
  • Transforms
  • EQPolicy and TransformedPolicy
  • Util (cholesky update)

Run on other environments

Control something real

  • Cartpole swing
  • Lego mindstorms
  • Ask robotics faculty

Future algorithms

  • Non-greedy exploration (see the DL-algo)
  • Efficient Learning of Dynamics with an information based criterion (IRL)
  • Posterior Sampling for RL
  • Deep PILCO
  • Embed to control
  • PlaNet

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •