Skip to content

v0.4.4

Compare
Choose a tag to compare
@PaParaZz1 PaParaZz1 released this 31 Oct 08:52
· 248 commits to main since this release

API Change

  1. context in new task pipeline is implemented by dataclass now, rather than dict
  2. recommend visulization is wandb now, rather than tensorboard

Env

  1. add modified gym-hybrid including moving, sliding and hardmove (#505) (#519)
  2. add evogym support (#495) (#527)
  3. add save_replay_gif option (#506)
  4. adapt minigrid_env and related config to latest MiniGrid v2.0.0 (#500)

Algorithm

  1. add pcgrad optimizer (#489)
  2. add some features in MLP and ResBlock (#511)
  3. delete mcts related modules (#518) (we will release a MCTS repo in future)

Enhancement

  1. add wandb middleware and demo (#488) (#523) (#528)
  2. add new properties in Context (#499)
  3. add single env policy wrapper for policy deployment (demo)
  4. add custom model demo and doc (文档)

Fix

  1. fix build logger args and unittests (#522)
  2. fix total_loss calculation in PDQN (#504)
  3. fix save gif function bug
  4. fix level sample unittest bug

Style

  1. update contact email address (#503)
  2. polish env log and resblock name
  3. add details button in readme

New Repo

  • DI-1024: Deep Reinforcement Learning + 1024 Game

Contributors: @PaParaZz1 @puyuan1996 @karroyan @hiha3456 @davide97l @Weiyuhong-1998 @zjowowen @norman26625