LM

Code for the LM project collaboration

It may be helpful to install this git repo for the maze code: https://github.com/MattChanTK/gym-maze.

Required Packages: Quandl, Gym, Pygame, Pandas, Sklearn, Torchvision, Pytorch, Tqdm, Scipy

Optional Packages: Comet_ml

stockSynthesis.py - simple LSTM close price prediction given the corresponding open price
predictOpen.py - predict the next open price given the previous one
LSTM_Sequential.py - predict an open price sequence given a sequence of previous open prices
stockDataset.py - manipulate kaggle stock data into sequences of open prices
stockAgent.py - reinforcement learning agent to learn a multiplier to produce the next open price
stockRL.py - runner code for the Stock Agent to predict open prices
pytorchSequentialPredict.py - LSTM_Sequential.py but implemented in pytorch instead of keras

maze_2d_q_learning.py - solve a maze using Q learning
maze_env.py - creates the base maze environment for these experiments
maze_view_2D.py - helper for maze_env.py
maze_2d_dqn.py - solve a maze using Deep Q learning
maze_multiscale_qlearning.py - solve a maze using Feudal Q learning
multi_maze.py - creates the environment for the feudal learning experiments
maze2d_002.npy - maze file for the manager (2x2 maze)
maze2d_003.npy - maze file for the worker (4x4 maze)
maze_qmodels.py - contains code for the worker and manager networks for maze_multiscale_qlearning.py
maze_multiscale_dqn.py - solve a maze using Feudal Deep Q Networks
maze_dqn_models.py - contains code for the worker and manager networks for maze_multiscale_dqn.py

stock_qlearning.py - portfolio experiment with a Q learning agent
stock_env.py - stock market environment for stock_qlearning.py
stock_dqn.py - portfolio experiment with a DQN agent
stock_env_dqn.py - stock market environment for stock_dqn.py
multiscale_q_stock.py - portfolio experiment with feudal Q learning
multi_stock_env.py - stock market environment for multiscale_q_stock.py
stock_qmodles.py - contains the worker and manager code for multiscale_q_stock.py
stock_multi_dqn.py - portfolio experiment with feudal DQNs
multi_stock_env_dqn.py - stock market environment for stock_multi_dqn.py
stock_dqn_models.py - contains the worker, manager, and other helper classes for multi_stock_env_dqn.py
multi_transaction_feudal.py - feudal portfolio experiment where the manager chooses from several workers with different levels of temporal abstraction
multi_trans_stock_env.py - stock market environment for multi_transaction_feudal.py
hard_coded_qLearning.py - portfolio experiment with a hard coded agent as a baseline
dynamicM_hardcodeW.py - feudal portfolio experiment where the manager chooses from several hard coded workers
hard_coded_agents.py - contains the hard coded agents for dynamicM_hardcodeW.py

mutAngTSNE.py - use the TSNE embedding as the subroutine ID when predicting steering angles; also capable of predicting multiple steering angles out
predZKomanda.py - use a modified version of the winning udacity steering angle challenge network to predict steering angles and subroutine IDs
tsnePredict.py - predict the TSNE centroids using images (poor results)
udacityData.py - dataset class to process the udacity driving dataset
komanda.py - predict steering angles using the winning udacity steering angle challenge network
tsnePrevDataPred.py - predict the next TSNE centroid given the input data for the previous centroid (poor results)

Provide feedback