Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

serial training league demo #12

Merged
merged 16 commits into from
Aug 2, 2021
Merged

serial training league demo #12

merged 16 commits into from
Aug 2, 2021

Conversation

PaParaZz1
Copy link
Member

@PaParaZz1 PaParaZz1 commented Jul 21, 2021

Related Issue

#5

TODO List

  • self play demo link
  • league training demo link
  • league training result viz

@PaParaZz1 PaParaZz1 added algo Add new algorithm or improve old one serial Serial training related labels Jul 21, 2021
@PaParaZz1 PaParaZz1 changed the title WIP: serial training league demo serial training league demo Aug 1, 2021
Copy link
Collaborator

@garyzhang99 garyzhang99 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Reviewed and approved.

@codecov-commenter
Copy link

codecov-commenter commented Aug 2, 2021

Codecov Report

Merging #12 (f309628) into main (dd4de1a) will decrease coverage by 0.02%.
The diff coverage is 86.42%.

Impacted file tree graph

@@            Coverage Diff             @@
##             main      #12      +/-   ##
==========================================
- Coverage   89.92%   89.90%   -0.03%     
==========================================
  Files         332      334       +2     
  Lines       23879    24243     +364     
==========================================
+ Hits        21473    21795     +322     
- Misses       2406     2448      +42     
Flag Coverage Δ
unittests 89.90% <86.42%> (-0.03%) ⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ
ding/league/player.py 95.04% <ø> (ø)
ding/worker/collector/base_serial_collector.py 90.90% <ø> (ø)
...orker/coordinator/one_vs_one_parallel_commander.py 98.22% <ø> (ø)
ding/worker/collector/comm/flask_fs_collector.py 89.38% <50.00%> (-0.80%) ⬇️
ding/entry/tests/test_serial_entry.py 79.13% <73.33%> (-0.41%) ⬇️
ding/league/base_league.py 83.50% <80.00%> (+0.52%) ⬆️
...r/collector/episode_one_vs_one_serial_collector.py 85.62% <85.62%> (ø)
...ng/worker/collector/one_vs_one_serial_evaluator.py 88.05% <88.05%> (ø)
ding/league/shared_payoff.py 97.24% <90.90%> (-1.68%) ⬇️
ding/league/tests/test_one_vs_one_league.py 98.66% <100.00%> (-0.02%) ⬇️
... and 11 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update dd4de1a...f309628. Read the comment docs.

@PaParaZz1 PaParaZz1 deleted the dev-league-demo branch August 2, 2021 17:49
puyuan1996 pushed a commit to puyuan1996/DI-engine that referenced this pull request Dec 14, 2021
* feature(nyz): add naive 1v1 two player demo

* feature(nyz): add 1v1 evaluator and 2 rule-based policy for evaluation

* feature(nyz): modify game env and adjust hyper-param

* feature(nyz): add naive league training multi player demo

* feature(nyz): enable force snapshot to support init historical league player; finish league demo basic code

* feature(nyz): modify selfplay demo and add two type game env

* style(nyz): correct format style

* polish(nyz): correct format style and adapt league demo main

* feature(nyz): add league payoff viz and enable payoff update in league demo

* feature(nyz): modify win rate calculation with draws

* test(nyz): fix one vs one league test compatibility bug

* test(nyz): add selfplay and league demo into unittest and algotest

* style(nyz): correct format

* hotfix(nyz): fix ppo continuous comatibility bug
puyuan1996 pushed a commit to puyuan1996/DI-engine that referenced this pull request Apr 18, 2022
* feature(nyz): add naive 1v1 two player demo

* feature(nyz): add 1v1 evaluator and 2 rule-based policy for evaluation

* feature(nyz): modify game env and adjust hyper-param

* feature(nyz): add naive league training multi player demo

* feature(nyz): enable force snapshot to support init historical league player; finish league demo basic code

* feature(nyz): modify selfplay demo and add two type game env

* style(nyz): correct format style

* polish(nyz): correct format style and adapt league demo main

* feature(nyz): add league payoff viz and enable payoff update in league demo

* feature(nyz): modify win rate calculation with draws

* test(nyz): fix one vs one league test compatibility bug

* test(nyz): add selfplay and league demo into unittest and algotest

* style(nyz): correct format

* hotfix(nyz): fix ppo continuous comatibility bug
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
algo Add new algorithm or improve old one serial Serial training related
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants