v0.4.4

PaParaZz1 released this 31 Oct 08:52

· 248 commits to main since this release

API Change

context in new task pipeline is implemented by dataclass now, rather than dict
recommend visulization is wandb now, rather than tensorboard

Env

add modified gym-hybrid including moving, sliding and hardmove (#505) (#519)
add evogym support (#495) (#527)
add save_replay_gif option (#506)
adapt minigrid_env and related config to latest MiniGrid v2.0.0 (#500)

Algorithm

add pcgrad optimizer (#489)
add some features in MLP and ResBlock (#511)
delete mcts related modules (#518) (we will release a MCTS repo in future)

Enhancement

add wandb middleware and demo (#488) (#523) (#528)
add new properties in Context (#499)
add single env policy wrapper for policy deployment (demo)
add custom model demo and doc (文档)

Fix

fix build logger args and unittests (#522)
fix total_loss calculation in PDQN (#504)
fix save gif function bug
fix level sample unittest bug

Style

update contact email address (#503)
polish env log and resblock name
add details button in readme

New Repo

DI-1024: Deep Reinforcement Learning + 1024 Game

Contributors: @PaParaZz1 @puyuan1996 @karroyan @hiha3456 @davide97l @Weiyuhong-1998 @zjowowen @norman26625

Contributors

karroyan, PaParaZz1, and 6 other contributors

Assets 2