Skip to content

Conversation

@dependabot
Copy link
Contributor

@dependabot dependabot bot commented on behalf of github Jul 14, 2022

Bumps stable-baselines3 from 1.5.0 to 1.6.0.

Release notes

Sourced from stable-baselines3's releases.

SB3 v1.6.0: Recurrent PPO (PPO LSTM), better defaults for learning from pixels with SAC/TD3

SB3 Contrib: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib

Breaking Changes:

  • Changed the way policy "aliases" are handled ("MlpPolicy", "CnnPolicy", ...), removing the former register_policy helper, policy_base parameter and using policy_aliases static attributes instead (@​Gregwar)
  • SB3 now requires PyTorch >= 1.11
  • Changed the default network architecture when using CnnPolicy or MultiInputPolicy with SAC or DDPG/TD3, share_features_extractor is now set to False by default and the net_arch=[256, 256] (instead of net_arch=[] that was before)

SB3-Contrib

Bug Fixes:

  • Fixed saving and loading large policies greater than 2GB (@​jkterry1, @​ycheng517)
  • Fixed final goal selection strategy that did not sample the final achieved goal (@​qgallouedec)
  • Fixed a bug with special characters in the tensorboard log name (@​quantitative-technologies)
  • Fixed a bug in DummyVecEnv's and SubprocVecEnv's seeding function. None value was unchecked (@​ScheiklP)
  • Fixed a bug where EvalCallback would crash when trying to synchronize VecNormalize stats when observation normalization was disabled
  • Added a check for unbounded actions
  • Fixed issues due to newer version of protobuf (tensorboard) and sphinx
  • Fix exception causes all over the codebase (@​cool-RR)
  • Prohibit simultaneous use of optimize_memory_usage and handle_timeout_termination due to a bug (@​MWeltevrede)
  • Fixed a bug in kl_divergence check that would fail when using numpy arrays with MultiCategorical distribution

Others:

  • Upgraded to Python 3.7+ syntax using pyupgrade
  • Removed redundant double-check for nested observations from BaseAlgorithm._wrap_env (@​TibiGG)

Documentation:

  • Added link to gym doc and gym env checker
  • Fix typo in PPO doc (@​bcollazo)
  • Added link to PPO ICLR blog post
  • Added remark about breaking Markov assumption and timeout handling
  • Added doc about MLFlow integration via custom logger (@​git-thor)
  • Updated Huggingface integration doc
  • Added copy button for code snippets
  • Added doc about EnvPool and Isaac Gym support
Commits
  • c1f1c3d Release v1.6.0 (#958)
  • ef10189 Prohibit simultaneous use of optimize_memory_usage and handle_timeout_termina...
  • d64bcb4 Fix exception cause in base_class.py (#940)
  • 7ce7b6a Update defaults for offpolicy algos with features extractor (#935)
  • d68f0a2 Update doc: SB3 Contrib RecurrentPPO (#927)
  • 4b89fbf Fix issues due to newer version of protobuf and sphinx (#924)
  • 49813d8 Update doc and add check for unbounded action space (#918)
  • 2fcf8f9 Removed redundant double-check of nested Dict (#908)
  • 0fadc94 Fix synchronization bug with EvalCallback (#907)
  • c2518dc Add doc to use mlflow logger (#889)
  • Additional commits viewable in compare view

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

@dependabot dependabot bot requested review from bheijden and jelledouwe as code owners July 14, 2022 19:39
@dependabot dependabot bot added the dependencies Pull requests that update a dependency file label Jul 14, 2022
@dependabot dependabot bot force-pushed the dependabot/pip/stable-baselines3-1.6.0 branch 2 times, most recently from 7c9203d to cc339cf Compare July 20, 2022 16:02
@dependabot dependabot bot force-pushed the dependabot/pip/stable-baselines3-1.6.0 branch 2 times, most recently from 871d143 to b6a0fbf Compare August 5, 2022 09:55
@dependabot dependabot bot force-pushed the dependabot/pip/stable-baselines3-1.6.0 branch 3 times, most recently from 3eebc07 to 798464d Compare September 14, 2022 15:06
Bumps [stable-baselines3](https://github.com/DLR-RM/stable-baselines3) from 1.5.0 to 1.6.0.
- [Release notes](https://github.com/DLR-RM/stable-baselines3/releases)
- [Commits](DLR-RM/stable-baselines3@v1.5.0...v1.6.0)

---
updated-dependencies:
- dependency-name: stable-baselines3
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
@dependabot dependabot bot force-pushed the dependabot/pip/stable-baselines3-1.6.0 branch from 798464d to e6a844e Compare September 14, 2022 15:13
@dependabot @github
Copy link
Contributor Author

dependabot bot commented on behalf of github Sep 29, 2022

Superseded by #61.

@dependabot dependabot bot closed this Sep 29, 2022
@dependabot dependabot bot deleted the dependabot/pip/stable-baselines3-1.6.0 branch September 29, 2022 19:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant