MethodsOfMachineLearning
diff --git a/‎README.md‎
Lines changed: 33 additions & 81 deletions b/‎README.md‎
Lines changed: 33 additions & 81 deletions
diff --git a/‎__init__.py‎
Lines changed: 3 additions & 0 deletions b/‎__init__.py‎
Lines changed: 3 additions & 0 deletions
diff --git a/‎deepobs/__init__.py‎
Lines changed: 4 additions & 12 deletions b/‎deepobs/__init__.py‎
Lines changed: 4 additions & 12 deletions
diff --git a/‎deepobs/analyzer/__init__.py‎
Lines changed: 2 additions & 0 deletions b/‎deepobs/analyzer/__init__.py‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎deepobs/analyzer/analyze.py‎
Lines changed: 191 additions & 0 deletions b/‎deepobs/analyzer/analyze.py‎
Lines changed: 191 additions & 0 deletions
@@ -1,98 +1,50 @@
-[![Documentation Status](https://readthedocs.org/projects/deepobs/badge/?version=latest)](http://deepobs.readthedocs.io/?badge=latest)
-
-
 # DeepOBS - A Deep Learning Optimizer Benchmark Suite
 
-DeepOBS is a benchmarking suite that drastically simplifies, automates and improves the evaluation of deep learning optimizers.
-
-It can evaluate the performance of new optimizers on a variety of **real-world test problems** and automatically compare them with **realistic baselines**.
-
-The full documentation is available on readthedocs: https://deepobs.readthedocs.io/
-
-The paper describing DeepOBS has been accepted for ICLR 2019 and can be found here:
-https://openreview.net/forum?id=rJg6ssC5Y7
-
-Currently we provide DeepOBS only for TensorFlow, but plan to provide a PyTorch version soon. In the meantime, PyTorch users can still use parts of DeepOBS such as the data preprocessing scripts or the visualization features.
-
-##  Quick Start Guide
-
-### Install Deep OBS
-	pip install git+https://github.com/fsschneider/DeepOBS.git
-
-### Download the data
-	deepobs_prepare_data.sh
-
-This will automatically download, sort and prepare all the datasets (except ImageNet). It can take a while, as it will download roughly 1 GB.
-(If you already have the data, you could skip this step and always tell Deep OBS where the data is instead.)
-The data is now in a folder called 'data_deepobs'.
-
-You are now ready to run different optimizers on different test problems, you can try for example
-
-	deepobs_run_sgd.py mnist.mnist_mlp --num_epochs=2 --lr=1e-1 --bs=128 --nologs
+![DeepOBS](docs/deepobs_banner.png "DeepOBS")
 
-to run SGD on a simple multi-layer perceptron (with a learning rate of 1e-1 and a batch size of 128 for 4 epochs without keeping logs).
+**DeepOBS** is a benchmarking suite that drastically simplifies, automates and
+improves the evaluation of deep learning optimizers.
 
-Of course, the real value of a benchmark lies in evaluating new optimizers:
+It can evaluate the performance of new optimizers on a variety of
+**real-world test problems** and automatically compare them with
+**realistic baselines**.
 
-### Download and edit a run script
-You can download a template run script from there
+DeepOBS automates several steps when benchmarking deep learning optimizers:
 
-https://github.com/fsschneider/DeepOBS/blob/master/scripts/deepobs_run_sgd.py
+  - Downloading and preparing data sets.
+  - Setting up test problems consisting of contemporary data sets and realistic
+    deep learning architectures.
+  - Running the optimizers on multiple test problems and logging relevant
+    metrics.
+  - Reporting and visualization the results of the optimizer benchmark.
 
-Now you have a deepobs_run_script.py script in your folder. In order to run your optimizer, you need to change a few things in this script.
-The script takes take of the training, evaluation and logging.
-Let's assume that we want to benchmark the RMSProp optimizer. Then we only have to change Line 129 from
+![DeepOBS Output](docs/deepobs.jpg "DeepOBS_output")
 
-	opt = tf.train.GradientDescentOptimizer(lr)
+The code for the current implementation working with **TensorFlow** can be found
+on [Github](https://github.com/fsschneider/DeepOBS).
 
-   to
+The full documentation is available on readthedocs:
+https://deepobs.readthedocs.io/
 
-	opt = tf.train.RMSPropOptimizer(lr)
-
-Usually the hyperparameters of the optimizers need to be included as well, but for now let's only take the learning rate as a hyperparameter for RMSProp (and if you want change all the 'sgd's in the comments to 'rmsprop'). Let's name this run script now deepobs_run_rmsprop.py
-
-### Run your optimizer
-   You can now run your optimizer on a test problem. Let's try it on a noisy quadratic problem:
-
-	python deepobs_run_rmsprop.py quadratic.noisy_quadratic --num_epochs=100 --lr=1e-1 --bs=128 --pickle --run_name=RMSProp_1e-1/
-
-   (we can repeat this a couple of times with different random seeds. This way, we will get a measure of uncertainty in the benchmark plots)
-
-	python deepobs_run_rmsprop.py quadratic.noisy_quadratic --num_epochs=100 --lr=1e-1 --bs=128 --pickle --run_name=RMSProp_1e-1/ --random_seed=43
-	python deepobs_run_rmsprop.py quadratic.noisy_quadratic --num_epochs=100 --lr=1e-1 --bs=128 --pickle --run_name=RMSProp_1e-1/ --random_seed=44
-
-   You can monitor the training in real-time using Tensorboard
-
-    tensorboard --logdir=results
-
-   For this example, we will run the above code again, but with a different learning rate. We will call this "second optimizer" RMRProp_1e-2
-
-	python deepobs_run_rmsprop.py quadratic.noisy_quadratic --num_epochs=100 --lr=1e-2 --bs=128 --pickle --run_name=RMSProp_1e-2/
-	python deepobs_run_rmsprop.py quadratic.noisy_quadratic --num_epochs=100 --lr=1e-2 --bs=128 --pickle --run_name=RMSProp_1e-2/ --random_seed=43
-	python deepobs_run_rmsprop.py quadratic.noisy_quadratic --num_epochs=100 --lr=1e-2 --bs=128 --pickle --run_name=RMSProp_1e-2/ --random_seed=44
-
-   If you want to you can quickly run both optimizers on another problem
-
-	python deepobs_run_rmsprop.py mnist.mnist_mlp --num_epochs=5 --lr=1e-1 --bs=128 --pickle --run_name=RMSProp_1e-1/
-	python deepobs_run_rmsprop.py mnist.mnist_mlp --num_epochs=5 --lr=1e-1 --bs=128 --pickle --run_name=RMSProp_1e-1/ --random_seed=43
-	python deepobs_run_rmsprop.py mnist.mnist_mlp --num_epochs=5 --lr=1e-1 --bs=128 --pickle --run_name=RMSProp_1e-1/ --random_seed=44
+The paper describing DeepOBS has been accepted for ICLR 2019 and can be found
+here:
+https://openreview.net/forum?id=rJg6ssC5Y7
 
-	python deepobs_run_rmsprop.py mnist.mnist_mlp --num_epochs=5 --lr=1e-2 --bs=128 --pickle --run_name=RMSProp_1e-2/
-	python deepobs_run_rmsprop.py mnist.mnist_mlp --num_epochs=5 --lr=1e-2 --bs=128 --pickle --run_name=RMSProp_1e-2/ --random_seed=43
-	python deepobs_run_rmsprop.py mnist.mnist_mlp --num_epochs=5 --lr=1e-2 --bs=128 --pickle --run_name=RMSProp_1e-2/ --random_seed=44
+We are actively working on a **PyTorch** version and will be releasing it in the
+next months. In the meantime, PyTorch users can still use parts of DeepOBS such
+as the data preprocessing scripts or the visualization features.
 
 
-### Plot Results
-   Now we can plot the results of those two "new" optimizers "RMSProp_1e-1" and "RMSProp_1e-2". Since the performance is always relative, we automatically plot the performance against the most popular optimizers (SGD, Momentum, Adam) with the best settings we found after tuning their hyperparameters. Try out:
+## Installation
 
-	deepobs_plot_results.py --results_dir=results --log
+	pip install git+https://github.com/fsschneider/DeepOBS.git
 
-   which shows you the learning curves (loss and accuracy for both test and train dataset, but in the case of optimizing a quadratic, there is no accuracy) on a logarithmic plot.
-   Additionally it will print out a table summarizing the performances over all test problems (here we only have one or two).
-   If you add the option --saveto=save_dir the plots and a color coded table are saved as .png and ready-to-include .tex-files!
+Note, that the installation process can take a while as it will also
+automatically download all baseline results.
 
-### Estimate runtime overhead
-   You can estimate the runtime overhead of the new optimizers compared to SGD like this:
+We tested the package with Python 3.6 and TensorFlow version 1.12. Other
+versions of Python and TensorFlow (>= 1.4.0) might work, and we plan to expand
+compatibility in the future.
 
-	deepobs_estimate_runtime.py deepobs_run_rmsprop.py --optimizer_arguments=--lr=1e-2
-   It will return an estimate of the overhead of the new optimizer compared to SGD. In our case it should be quite close to 1.0, as RMSProp costs roughly the same as SGD.
+Further tutorials and a suggested protocol for benchmarking deep learning
+optimizers can be found on https://deepobs.readthedocs.io/
@@ -0,0 +1,3 @@
+# -*- coding: utf-8 -*-
+
+from . import deepobs
@@ -1,13 +1,5 @@
-import cifar100
-import cifar10
-import mnist
-import fmnist
-import tolstoi
-import imagenet
-import svhn
-import two_d
-import quadratic
+# -*- coding: utf-8 -*-
 
-import dataset_utils
-import run_utils
-import plot_utils
+from . import tensorflow
+from . import analyzer
+from . import scripts
@@ -0,0 +1,2 @@
+from . import analyze_utils
+from . import analyze
@@ -0,0 +1,191 @@
+#!/usr/bin/env python
+
+from __future__ import print_function
+
+import matplotlib.pyplot as plt
+import seaborn as sns
+
+import deepobs
+
+sns.set()
+sns.set_style("whitegrid", {
+    'axes.grid': False,
+    'axes.spines.top': False,
+    'axes.spines.right': False
+})
+
+
+def get_best_run(folder_pars):
+    print("Get best run\n\n")
+    for _, testprob in folder_pars.testproblems.items():
+        print("***********************")
+        print("Analyzing", testprob.name)
+        print("***********************")
+        for _, opt in testprob.optimizers.items():
+            #         print("Analyzing", opt.name)
+            print("Checked", opt.num_settings, "settings for", opt.name,
+                  "and found the following")
+            setting_final = opt.get_best_setting_final()
+            setting_best = opt.get_best_setting_best()
+            print("Best Setting (Final Value)", setting_final.name,
+                  "with final performance of",
+                  setting_final.aggregate.final_value)
+            print("Best Setting (Best Value)", setting_best.name,
+                  "with best performance of",
+                  setting_best.aggregate.best_value)
+
+
+def plot_lr_sensitivity(folder_pars, baseline_pars=None, mode='final'):
+    print("Plot learning rate sensitivity plot")
+    fig, axis = plt.subplots(2, 4, figsize=(35, 4))
+
+    ax_row = 0
+    for testprob in [
+            "quadratic_deep", "mnist_vae", "fmnist_2c2d", "cifar10_3c3d"
+    ]:
+        if testprob in folder_pars.testproblems:
+            for _, opt in folder_pars.testproblems[testprob].optimizers.items(
+            ):
+                opt.plot_lr_sensitivity(axis[0][ax_row], mode=mode)
+            ax_row += 1
+    if baseline_pars is not None:
+        ax_row = 0
+        for testprob in [
+                "quadratic_deep", "mnist_vae", "fmnist_2c2d", "cifar10_3c3d"
+        ]:
+            if testprob in baseline_pars.testproblems:
+                for _, opt in baseline_pars.testproblems[
+                        testprob].optimizers.items():
+                    opt.plot_lr_sensitivity(axis[0][ax_row], mode=mode)
+                ax_row += 1
+    ax_row = 0
+    for testprob in [
+            "fmnist_vae", "cifar100_allcnnc", "svhn_wrn164", "tolstoi_char_rnn"
+    ]:
+        if testprob in folder_pars.testproblems:
+            for _, opt in folder_pars.testproblems[testprob].optimizers.items(
+            ):
+                opt.plot_lr_sensitivity(axis[1][ax_row], mode=mode)
+            ax_row += 1
+    if baseline_pars is not None:
+        ax_row = 0
+        for testprob in [
+                "fmnist_vae", "cifar100_allcnnc", "svhn_wrn164",
+                "tolstoi_char_rnn"
+        ]:
+            if testprob in baseline_pars.testproblems:
+                for _, opt in baseline_pars.testproblems[
+                        testprob].optimizers.items():
+                    opt.plot_lr_sensitivity(axis[1][ax_row], mode=mode)
+                ax_row += 1
+
+    fig, axis = deepobs.analyzer.analyze_utils.beautify_lr_sensitivity(
+        fig, axis)
+    deepobs.analyzer.analyze_utils.texify_lr_sensitivity(fig, axis)
+    plt.show()
+
+
+def plot_performance(folder_pars, baseline_pars=None, mode="most"):
+    # Small Benchmark
+    fig, axis = plt.subplots(4, 4, sharex='col', figsize=(25, 8))
+
+    ax_col = 0
+    for testprob in [
+            "quadratic_deep", "mnist_vae", "fmnist_2c2d", "cifar10_3c3d"
+    ]:
+        if testprob in folder_pars.testproblems:
+            for _, opt in folder_pars.testproblems[testprob].optimizers.items(
+            ):
+                opt.plot_performance(axis[:, ax_col], mode=mode)
+            ax_col += 1
+    if baseline_pars is not None:
+        ax_col = 0
+        for testprob in [
+                "quadratic_deep", "mnist_vae", "fmnist_2c2d", "cifar10_3c3d"
+        ]:
+            if testprob in baseline_pars.testproblems:
+                for _, opt in baseline_pars.testproblems[
+                        testprob].optimizers.items():
+                    opt.plot_performance(axis[:, ax_col], mode='most')
+                ax_col += 1
+    fig, axis = deepobs.analyzer.analyze_utils.beautify_plot_performance(
+        fig, axis, folder_pars, "small")
+    deepobs.analyzer.analyze_utils.texify_plot_performance(fig, axis, "small")
+    plt.show()
+
+    # Large Benchmark
+    fig, axis = plt.subplots(4, 4, sharex='col', figsize=(25, 8))
+
+    ax_col = 0
+    for testprob in [
+            "fmnist_vae", "cifar100_allcnnc", "svhn_wrn164", "tolstoi_char_rnn"
+    ]:
+        if testprob in folder_pars.testproblems:
+            for _, opt in folder_pars.testproblems[testprob].optimizers.items():
+                opt.plot_performance(axis[:, ax_col], mode=mode)
+            ax_col += 1
+    if baseline_pars is not None:
+        ax_col = 0
+        for testprob in [
+                "fmnist_vae", "cifar100_allcnnc", "svhn_wrn164",
+                "tolstoi_char_rnn"
+        ]:
+            if testprob in baseline_pars.testproblems:
+                for _, opt in baseline_pars.testproblems[
+                        testprob].optimizers.items():
+                    opt.plot_performance(axis[:, ax_col], mode='most')
+                ax_col += 1
+    fig, axis = deepobs.analyzer.analyze_utils.beautify_plot_performance(
+        fig, axis, folder_pars, "large")
+    deepobs.analyzer.analyze_utils.texify_plot_performance(fig, axis, "large")
+    plt.show()
+
+
+def plot_table(folder_pars, baseline_pars=None):
+    print("Plot overall performance table")
+
+    bm_table_small = dict()
+    for testprob in [
+            "quadratic_deep", "mnist_vae", "fmnist_2c2d", "cifar10_3c3d"
+    ]:
+        bm_table_small[testprob] = dict()
+        bm_table_small[testprob]['Performance'] = dict()
+        bm_table_small[testprob]['Speed'] = dict()
+        bm_table_small[testprob]['Tuneability'] = dict()
+        if testprob in folder_pars.testproblems:
+            for _, opt in folder_pars.testproblems[testprob].optimizers.items():
+                bm_table_small[testprob] = opt.get_bm_table(
+                    bm_table_small[testprob])
+        if baseline_pars is not None:
+            if testprob in baseline_pars.testproblems:
+                for _, opt in baseline_pars.testproblems[
+                        testprob].optimizers.items():
+                    bm_table_small[testprob] = opt.get_bm_table(
+                        bm_table_small[testprob])
+    bm_table_small_pd = deepobs.analyzer.analyze_utils.beautify_plot_table(
+        bm_table_small)
+    deepobs.analyzer.analyze_utils.texify_plot_table(bm_table_small_pd,
+                                                     "small")
+
+    bm_table_large = dict()
+    for testprob in [
+            "fmnist_vae", "cifar100_allcnnc", "svhn_wrn164", "tolstoi_char_rnn"
+    ]:
+        bm_table_large[testprob] = dict()
+        bm_table_large[testprob]['Performance'] = dict()
+        bm_table_large[testprob]['Speed'] = dict()
+        bm_table_large[testprob]['Tuneability'] = dict()
+        if testprob in folder_pars.testproblems:
+            for _, opt in folder_pars.testproblems[testprob].optimizers.items():
+                bm_table_large[testprob] = opt.get_bm_table(
+                    bm_table_large[testprob])
+        if baseline_pars is not None:
+            if testprob in baseline_pars.testproblems:
+                for _, opt in baseline_pars.testproblems[
+                        testprob].optimizers.items():
+                    bm_table_large[testprob] = opt.get_bm_table(
+                        bm_table_large[testprob])
+    bm_table_large_pd = deepobs.analyzer.analyze_utils.beautify_plot_table(
+        bm_table_large)
+    deepobs.analyzer.analyze_utils.texify_plot_table(bm_table_large_pd,
+                                                     "large")
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+# -- coding: utf-8 --`
	`2`	`+`
	`3`	`+from . import deepobs`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+from . import analyze_utils`
	`2`	`+from . import analyze`