polish(gry): polish reward model and td error #624

ruoyuGao · 2023-03-28T05:05:38Z

Description

add the config table for each reward model
remove unuse parameter
add comment for td error

Related Issue

TODO

Check List

merge the latest version source branch/repo, and resolve all the conflicts
pass style check
pass all the tests

fix style for reward model

ding/reward_model/gail_irl_model.py

ding/reward_model/drex_reward_model.py

PaParaZz1 · 2023-03-28T11:48:39Z

ding/reward_model/gail_irl_model.py

        target_new_data_count=64,
+        # (int) Linear model hidden size


why linear here, there maybe some more complicated networks

because our remodel network is linear, should we change this comment?

ding/reward_model/gail_irl_model.py

ding/rl_utils/td.py

… ruoyugao

codecov · 2023-03-30T04:33:02Z

Codecov Report

Merging #624 (d7d16ac) into main (405191d) will decrease coverage by 0.08%.
The diff coverage is 91.66%.

❗ Current head d7d16ac differs from pull request most recent head 2d741e8. Consider uploading reports for the commit 2d741e8 to get more accurate results

@@            Coverage Diff             @@
##             main     #624      +/-   ##
==========================================
- Coverage   83.03%   82.96%   -0.08%     
==========================================
  Files         570      570              
  Lines       47037    46955      -82     
==========================================
- Hits        39056    38954     -102     
- Misses       7981     8001      +20

Flag	Coverage Δ
unittests	`82.96% <91.66%> (-0.08%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
ding/framework/task.py	`93.55% <ø> (ø)`
ding/policy/a2c.py	`91.20% <ø> (ø)`
ding/policy/acer.py	`93.95% <ø> (ø)`
ding/policy/atoc.py	`90.90% <ø> (ø)`
ding/policy/collaq.py	`86.88% <ø> (ø)`
ding/policy/coma.py	`91.96% <ø> (ø)`
ding/policy/d4pg.py	`96.38% <ø> (ø)`
ding/policy/ddpg.py	`88.00% <ø> (ø)`
ding/policy/decision_transformer.py	`18.07% <ø> (ø)`
ding/policy/il.py	`27.77% <ø> (ø)`
... and 52 more

... and 63 files with indirect coverage changes

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

ruoyuGao added 11 commits March 26, 2023 23:51

polish gcl

803c47a

polish gail irl

a1843b6

polish icm api doc

3ebd9a5

add api comment for mdqn td error

82fb8ff

add config table for pdeil reward model

801a269

add config table for pwil reward model

d5fa413

add config table for red reward model

f9f6b62

add config table for rnd reward model

352af58

add config table for trex reward model

e304a45

add config table for drex reward model

f13b2f1

add config table for drex reward model

6144775

ruoyuGao changed the title ~~Polish(gry) : polish reward model and td error~~ polish(gry) : polish reward model and td error Mar 28, 2023

add comment for td error

7aceb57

fix style for reward model

ruoyuGao force-pushed the ruoyugao branch from 1dc8041 to 7aceb57 Compare March 28, 2023 06:57

Merge branch 'main' into ruoyugao

e8e8f34

PaParaZz1 added the enhancement New feature or request label Mar 28, 2023

PaParaZz1 requested changes Mar 28, 2023

View reviewed changes

ruoyuGao and others added 4 commits March 29, 2023 23:54

fix typo for reward model and td

b1d6722

Merge branch 'opendilab:main' into ruoyugao

512b4e3

Merge branch 'ruoyugao' of https://github.com/ruoyuGao/DI-engine into…

d7d16ac

… ruoyugao

fix typo for clear buffer

2d741e8

PaParaZz1 approved these changes Apr 3, 2023

View reviewed changes

PaParaZz1 merged commit a580019 into opendilab:main Apr 3, 2023

PaParaZz1 mentioned this pull request Apr 3, 2023

Roadmap for DI-engine #548

Open

PaParaZz1 changed the title ~~polish(gry) : polish reward model and td error~~ polish(gry): polish reward model and td error Apr 3, 2023

ruoyuGao deleted the ruoyugao branch April 4, 2023 02:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

polish(gry): polish reward model and td error #624

polish(gry): polish reward model and td error #624

ruoyuGao commented Mar 28, 2023 •

edited

Loading

PaParaZz1 Mar 28, 2023

ruoyuGao Mar 30, 2023

codecov bot commented Mar 30, 2023 •

edited

Loading

polish(gry): polish reward model and td error #624

polish(gry): polish reward model and td error #624

Conversation

ruoyuGao commented Mar 28, 2023 • edited Loading

Description

Related Issue

TODO

Check List

PaParaZz1 Mar 28, 2023

Choose a reason for hiding this comment

ruoyuGao Mar 30, 2023

Choose a reason for hiding this comment

codecov bot commented Mar 30, 2023 • edited Loading

Codecov Report

ruoyuGao commented Mar 28, 2023 •

edited

Loading

codecov bot commented Mar 30, 2023 •

edited

Loading