Multi Objective CNN benchmark - [WIP] #147

ayushi-3536 · 2022-05-06T07:19:13Z

Added benchmark outline for CNN benchmarks from paper
Pull from MO interface by Phillip is pending

- datamanager in progress

- preprocessing of data

hpobench/benchmarks/mo/cnn_benchmark.py

hpobench/util/data_manager.py

- changed lock name for mo_cnn bench

- removed hard coded model input to support multiple datasets

This reverts commit f76ea99.

…ation of various datasets - changed epoch from 50 to 25 (from literature) - corrected epoch training(0 indexed) - removed subsample from fidelity(not done in literature, can discuss to add it if we want to perform experiments for this) - returning python object

hpobench/benchmarks/mo/cnn_benchmark.py

hpobench/util/data_manager.py

hpobench/benchmarks/mo/cnn_benchmark.py

codecov · 2022-05-17T11:59:34Z

Codecov Report

Merging #147 (cf7ccfa) into development (9dde397) will decrease coverage by 2.03%.
The diff coverage is 30.18%.

❗ Current head cf7ccfa differs from pull request most recent head 761a7ee. Consider uploading reports for the commit 761a7ee to get more accurate results

@@               Coverage Diff               @@
##           development     #147      +/-   ##
===============================================
- Coverage        44.26%   42.23%   -2.04%     
===============================================
  Files               41       46       +5     
  Lines             2415     2671     +256     
===============================================
+ Hits              1069     1128      +59     
- Misses            1346     1543     +197

Impacted Files	Coverage Δ
hpobench/util/data_manager.py	`47.04% <15.78%> (-10.36%)`	⬇️
hpobench/container/benchmarks/mo/cnn_benchmark.py	`66.66% <66.66%> (ø)`
hpobench/container/client_abstract_benchmark.py	`85.64% <0.00%> (-1.39%)`	⬇️
...bench/container/benchmarks/surrogates/yahpo_gym.py	`100.00% <0.00%> (ø)`
hpobench/dependencies/mo/scalar.py	`0.00% <0.00%> (ø)`
hpobench/dependencies/mo/fairness_metrics.py	`0.00% <0.00%> (ø)`
...pobench/container/benchmarks/mo/adult_benchmark.py	`100.00% <0.00%> (ø)`
hpobench/container/benchmarks/nas/nasbench_201.py	`36.84% <0.00%> (+36.84%)`	⬆️

- merged fidelity space and choice method

PhMueller · 2022-05-24T12:05:55Z

@KEggensperger, could you please have a look at it?

extra_requirements/mo_cnn.json

hpobench/benchmarks/mo/cnn_benchmark.py

KEggensperger · 2022-05-27T09:15:33Z

hpobench/benchmarks/mo/cnn_benchmark.py

+        val_accuracy = model.eval_fn(ds_val, device).item()
+        eval_valid_runtime = time.time() - start
+        start = time.time()
+        test_accuracy = model.eval_fn(ds_test, device).item()


Same questions as for the other benchmark: Why spending time on computing test metrics?

Good question. Changed it to "training time".

The eval time should be almost equal for every run, so i think it is more important to report the "training time" instead of the "total time per configuration".

Thanks for the feedback!

-update test case

Added mo cnn benchmarks from bag of baseline paper We deviate from the original benchmark in two points: * we return as cost only the training time instead of the total elapsed time * we return as objective for minimization instead of `-100 * accuracy` now `1 - accuracy` to achieve better output scalings. Co-authored-by: ayushi-3536 <ayushi-3536@github.com> Co-authored-by: Philipp Müller <muller-phil@gmx.net>

ayushi-3536 and others added 5 commits May 5, 2022 22:53

-added mo cnn benchmarks from bag of baseline paper

f8f2702

- datamanager in progress

-removing log from categorical cs

d2b3762

- integrating data loaded output

0107e43

- preprocessing of data

integration of data manager with bench;changes in cs paramname

d5cfa1c

- removing unwanted logs

bf9d525

PhMueller reviewed May 6, 2022

View reviewed changes

hpobench/benchmarks/mo/cnn_benchmark.py Outdated Show resolved Hide resolved

PhMueller reviewed May 6, 2022

View reviewed changes

hpobench/benchmarks/mo/cnn_benchmark.py Outdated Show resolved Hide resolved

PhMueller reviewed May 6, 2022

View reviewed changes

hpobench/benchmarks/mo/cnn_benchmark.py Outdated Show resolved Hide resolved

PhMueller reviewed May 6, 2022

View reviewed changes

hpobench/benchmarks/mo/cnn_benchmark.py Outdated Show resolved Hide resolved

PhMueller reviewed May 6, 2022

View reviewed changes

hpobench/benchmarks/mo/cnn_benchmark.py Outdated Show resolved Hide resolved

PhMueller reviewed May 6, 2022

View reviewed changes

hpobench/benchmarks/mo/cnn_benchmark.py Outdated Show resolved Hide resolved

PhMueller reviewed May 6, 2022

View reviewed changes

hpobench/util/data_manager.py Outdated Show resolved Hide resolved

ayushi-3536 added 9 commits May 12, 2022 09:37

Merge branch 'development' of github.com:automl/HPOBench into mo_cnn

ba31486

- added singularity config for mo cnn benches

074c43a

add \ to the end of command

2251c77

remove redundant command

ed8846a

cleanup

d5fed0c

- rebase mo interface

9350ca5

- changed lock name for mo_cnn bench

- removing sample size fidelity

f76ea99

- removed hard coded model input to support multiple datasets

Revert "- removing sample size fidelity"

6285096

This reverts commit f76ea99.

PhMueller reviewed May 17, 2022

View reviewed changes

hpobench/benchmarks/mo/cnn_benchmark.py Outdated Show resolved Hide resolved

PhMueller reviewed May 17, 2022

View reviewed changes

hpobench/benchmarks/mo/cnn_benchmark.py Outdated Show resolved Hide resolved

PhMueller reviewed May 17, 2022

View reviewed changes

hpobench/benchmarks/mo/cnn_benchmark.py Show resolved Hide resolved

PhMueller reviewed May 17, 2022

View reviewed changes

hpobench/benchmarks/mo/cnn_benchmark.py Outdated Show resolved Hide resolved

PhMueller reviewed May 17, 2022

View reviewed changes

hpobench/benchmarks/mo/cnn_benchmark.py Show resolved Hide resolved

PhMueller reviewed May 17, 2022

View reviewed changes

hpobench/benchmarks/mo/cnn_benchmark.py Outdated Show resolved Hide resolved

PhMueller reviewed May 17, 2022

View reviewed changes

hpobench/util/data_manager.py Outdated Show resolved Hide resolved

PhMueller reviewed May 17, 2022

View reviewed changes

hpobench/util/data_manager.py Outdated Show resolved Hide resolved

PhMueller reviewed May 17, 2022

View reviewed changes

hpobench/benchmarks/mo/cnn_benchmark.py Outdated Show resolved Hide resolved

Add test file

d85732c

PhMueller and others added 6 commits May 17, 2022 14:15

Fix Test Config

55684c3

- report total elapsed time in obj func as cost(as in original bench)

17b48e3

- merged fidelity space and choice method

Update Recipe

945e720

Add init and log statement

0400bf2

Add __init__ files

86f18c0

Clean up

30815af

PhMueller requested a review from KEggensperger May 24, 2022 12:04

PhMueller added 3 commits May 24, 2022 14:16

Allow to evaluate the test performance on different budgets

05cf152

Enable the gpu automatically for the container.

5baa3b1

Adapt Test

396dfb4

KEggensperger approved these changes May 27, 2022

View reviewed changes

PhMueller and others added 14 commits May 30, 2022 14:39

Add missing signatures

c1ee943

Return training time instead of evaluation time.

82a842c

Fix dependency version

4b64dcb

Update CNN Mo Tests

74c50f5

-add conditional dependencies in search space

47a9c64

-update test case

Update tests

c7ec827

Merge remote-tracking branch 'fork_ayushi/mo_cnn' into mo_cnn

6262ebc

Update tests

c5921ab

Simlify ConfigSpace. + Update Tests.

65740ec

Simlify ConfigSpace. + Update Tests.

a4ab38e

Simlify ConfigSpace. + Update Tests.

863ad6d

Change returned Objectives

47b8978

Change returned Objectives.. again

cf7ccfa

Change inheritance in container benchmark

761a7ee

PhMueller merged commit 4c4f1d9 into automl:development Jun 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Multi Objective CNN benchmark - [WIP] #147

Multi Objective CNN benchmark - [WIP] #147

Uh oh!

ayushi-3536 commented May 6, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented May 17, 2022 •

edited

Loading

Uh oh!

PhMueller commented May 24, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KEggensperger May 27, 2022

Uh oh!

PhMueller May 30, 2022 •

edited

Loading

Uh oh!

Uh oh!

Multi Objective CNN benchmark - [WIP] #147

Multi Objective CNN benchmark - [WIP] #147

Uh oh!

Conversation

ayushi-3536 commented May 6, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented May 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

PhMueller commented May 24, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KEggensperger May 27, 2022

Choose a reason for hiding this comment

Uh oh!

PhMueller May 30, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codecov bot commented May 17, 2022 •

edited

Loading

PhMueller May 30, 2022 •

edited

Loading