Version 0.2.0 of QDax #126

felixchalumeau · 2022-11-30T09:29:06Z

Hi guys,

Time for us to move to version 0.2.0 🎉

This release introduces:

fix a irrelevant 0.25 factor in td3 loss
fix a overflow issue in the replay buffer
clear the doc section about GPU usage with QDax
update jax, brax and flax versions
add a set of benchmarking functions from the QD community and clean the way we define our tasks
add a caveat section in the doc
add a QDax logo
scale the action space of pointmaze to match the action space of Brax tasks
add delay update in the PG emitter (PGAME)
fix reinitialisation of optimizer state in PG variations (PGAME)
introduce a wrapper to use fixed and known initial states for the Brax tasks
fix Brax version in colab
add the algorithm ME-ES
introduce a meta emitter to use multiple emitters in parallel, called MultiEmitter
add the algorithm CMA-ME with all its possible variants through the introduction of all three CMA emitters (improvement, random direction, optimization)
fix CMA ES optimizer
fix issue in CMA-MEGA
add the algorithm QDPG through the introduction of the Diversity PG emitter
refactor PGAME to use the multi emitter class
add template for pull requests
add DistributedMAPElites, an example of how to distribute a QDax algorithm on multiple devices
fix run-image stage in docker
avoid multiple jits of same method in for loops in our notebooks examples
add the extra_scores in the API of the repertoire addition in MAP-Elites

overload issue in replay buffer, add buffer test + move baselines test to separate folder

…ronments, Hypervolume functions and QD Suite (#73) * adding sphere, rastrigin, arm, and noisy_arm scoring functions * adding possibility to create default scoring_function for brax environments. * changing type of EnvState to brax.envs.State * fix some quick typing inconsistencies throughout the code. * update README to include the default functions and make it a usable * create an examples directory to include scripts and notebooks

Policy delay update to PGA emitter update. Present in TD3 but was missing from the PG Emitter.

Missing re-initialization of the optimizer state (adam optimizer) for PG variations in PG emitter

* add wrapper for to make the reset function return a fixed initial state * set fixed_init_state to False by default

* update brax version for notebook examples

* add MAP-Elites-ES * add tests for mees * add mees example notebook * update colab link in mees example * clean up mees notebook * Speed up novelty for mees * Set paper parameters for emitter and notebook * Add utils to resample individuals repetitively * [fix] fix key handling in sampling * Test for sampling utils * Fix sampling test format * Update files to last develop * Remove old notebook * Uniformise optimizer for Adam and SGD * Add parents-sampling parameters to config * Simplify ME-ES notebook using reset_based_scoring_function_brax_envs * minor updates * style updates * Define a separate class for novelty archive * Refactor code for readability - extract es process from main method * Add option for ME-ES-explore * [fix] Fix NoveltyArchive lax.scan compatibility * add MEES to docs + minor typos Co-authored-by: Felix <f.chalumeau@instadeep.com>

* Improve reset_based scoring usage in brax env default definition functions * fix styling issue with pre-commit Co-authored-by: Felix <f.chalumeau@instadeep.com> Co-authored-by: Bryan Lim <limbryan239@gmail.com>

* Add a base Multi-Emitter implementation of a batch of Emitters. * No choice strategy is used, all sub emitters are called and the proportion are defined in the script when defining the sub emitters. * We will consider adding "strategic" layers later, to manage the choice of emitters or the proportion. This is left for a future PR.

Add emitters introduced in CMA-ME, add tests, notebook, update the documentation, fix CMA-ES and fix CMA-MEGA

refactor pgame, create quality pg emitter, creates diversity pg emitter, rename greedy and controllers, add qdpg, with notebooks and tests, update readme and docs

Co-authored-by: valentin <v.mace@instadeep.com>

distributed map elites with example and docs. No explicit support for TPU.

* feat!: extra-scores for repertoire addition * Fix display * Add extra_scores to mome repertoire as well * add warnings to me and mome repertoires Co-authored-by: Felix <f.chalumeau@instadeep.com>

codecov-commenter · 2022-11-30T09:52:43Z

Codecov Report

Merging #126 (f84f887) into main (2fb5619) will increase coverage by 2.73%.
The diff coverage is 97.11%.

@@            Coverage Diff             @@
##             main     #126      +/-   ##
==========================================
+ Coverage   89.58%   92.31%   +2.73%     
==========================================
  Files          67      103      +36     
  Lines        3907     5816    +1909     
==========================================
+ Hits         3500     5369    +1869     
- Misses        407      447      +40

Impacted Files	Coverage Δ
qdax/core/mome.py	`100.00% <ø> (ø)`
tests/baselines_test/cmamega_test.py	`98.36% <ø> (ø)`
tests/baselines_test/dads_smerl_test.py	`97.14% <ø> (ø)`
tests/baselines_test/dads_test.py	`96.92% <ø> (ø)`
tests/baselines_test/diayn_smerl_test.py	`96.96% <ø> (ø)`
tests/baselines_test/diayn_test.py	`96.72% <ø> (ø)`
tests/baselines_test/ga_test.py	`100.00% <ø> (ø)`
tests/baselines_test/omgmega_test.py	`98.36% <ø> (ø)`
tests/baselines_test/td3_test.py	`98.27% <ø> (ø)`
qdax/tasks/qd_suite/qd_suite_task.py	`79.31% <79.31%> (ø)`
... and 61 more

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

…n using fixed init state (#128)

Lookatator and others added 27 commits August 9, 2022 16:03

Merge branch 'main' into develop

be464f2

fix: correct irrelevant factor 0.25 in td3 loss (#78)

a04fadc

Fix the replay buffer overflow issue (#75)

82262e5

overload issue in replay buffer, add buffer test + move baselines test to separate folder

Merge branch 'main' into develop

ab4d4ca

docs: add caveats and logo (#99)

b691c7b

hotfix(images): re-add deleted logos to the repo

e58472e

fix(pointmaze): scale after the clip of actions (#101)

7d094ef

fix: add update policy delay in PG emitter

9d35be7

Policy delay update to PGA emitter update. Present in TD3 but was missing from the PG Emitter.

fix: optimizer state reinitialization for PG variations (#104)

0180665

Missing re-initialization of the optimizer state (adam optimizer) for PG variations in PG emitter

fix(style): mypy issue in controller training

7607290

feat(envs): wrapper for fixed initial state of environments (#92)

aee7e51

* add wrapper for to make the reset function return a fixed initial state * set fixed_init_state to False by default

fix(docs): avoid using flax 0.6.2 in setup (#112)

46e82b6

fix(examples): brax version in colab examples (#108)

df67142

* update brax version for notebook examples

fix: reset_based scoring in brax_env default task (#109)

f20a32a

* Improve reset_based scoring usage in brax env default definition functions * fix styling issue with pre-commit Co-authored-by: Felix <f.chalumeau@instadeep.com> Co-authored-by: Bryan Lim <limbryan239@gmail.com>

fix(mees): add batch size property (#114)

d7c6dc7

feat(algorithms): add CMA-ME, fix CMA-ES and CMA-MEGA (#86)

a5c19a2

Add emitters introduced in CMA-ME, add tests, notebook, update the documentation, fix CMA-ES and fix CMA-MEGA

feat(algorithms): add QDPG emitter + refactor PGAME (#110)

6370def

refactor pgame, create quality pg emitter, creates diversity pg emitter, rename greedy and controllers, add qdpg, with notebooks and tests, update readme and docs

feat(github): add GitHub template for PR (#120)

2afc4ea

Co-authored-by: valentin <v.mace@instadeep.com>

fix(test): inverse fitness and desc names in sampling test (#119)

db730e0

feat(algorithms): add MAP-Elites distributed on multiple devices (#117)

0f07770

distributed map elites with example and docs. No explicit support for TPU.

fix(docker): fix run-image docker stage (#121)

062b52f

fix(jit): avoid consecutive jits of same method in for loops (#122)

ab13dbd

feat(repertoire): optional extra-scores for repertoire addition (#118)

f5b5d94

* feat!: extra-scores for repertoire addition * Fix display * Add extra_scores to mome repertoire as well * add warnings to me and mome repertoires Co-authored-by: Felix <f.chalumeau@instadeep.com>

fix(doc): add colab links, missing doc, update version (#125)

f84f887

felixchalumeau requested review from Lookatator and limbryan November 30, 2022 09:41

felixchalumeau marked this pull request as ready for review November 30, 2022 09:43

Lookatator approved these changes Nov 30, 2022

View reviewed changes

limbryan approved these changes Nov 30, 2022

View reviewed changes

fix(envs): order of wrappers to ensure update of state descriptor whe…

cfaf3c8

…n using fixed init state (#128)

felixchalumeau merged commit fc0ad78 into main Nov 30, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Version 0.2.0 of QDax #126

Version 0.2.0 of QDax #126

felixchalumeau commented Nov 30, 2022

codecov-commenter commented Nov 30, 2022

Version 0.2.0 of QDax #126

Version 0.2.0 of QDax #126

Conversation

felixchalumeau commented Nov 30, 2022

codecov-commenter commented Nov 30, 2022

Codecov Report