Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Example with collecting timestamp of the metrics #970

Merged
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
Show all changes
21 commits
Select commit Hold shift + click to select a range
bebf6c0
Increase Suggestion memLimit
andreyvelich Dec 9, 2019
f1bd3ca
Create getSuggestionConfigData function
andreyvelich Dec 10, 2019
cafd115
Change memLimit for nasrl
andreyvelich Dec 10, 2019
2e57536
Merge remote-tracking branch 'upstream/master' into increase-suggesti…
andreyvelich Dec 10, 2019
f08fd42
Change resources format for katib-config
andreyvelich Dec 11, 2019
c8acc67
Merge remote-tracking branch 'upstream/master' into issue-944-timesta…
andreyvelich Dec 11, 2019
30eaad5
Create example with recording metrics timestamp
andreyvelich Dec 11, 2019
f840692
Merge remote-tracking branch 'upstream/master' into issue-944-timesta…
andreyvelich Dec 12, 2019
1aa3a8e
Add comment line
andreyvelich Dec 12, 2019
d52bd9d
Merge remote-tracking branch 'upstream/master' into issue-944-timesta…
andreyvelich Jan 8, 2020
923902a
Change example from pytorch to mxnet
andreyvelich Jan 8, 2020
9954e6b
Delete find_mxnet file
andreyvelich Jan 8, 2020
d3cbdda
Change mxnet-mnist-timestamp to mxnet-mnist
andreyvelich Jan 10, 2020
68979df
Merge remote-tracking branch 'upstream/master' into issue-944-timesta…
andreyvelich Jan 10, 2020
b3ca4ae
Reduce num epochs in grid
andreyvelich Jan 10, 2020
a714749
Enable autoscaling in CI cluster
andreyvelich Jan 10, 2020
933fb3c
Add max nodes
andreyvelich Jan 10, 2020
6710326
Add num nodes 6
andreyvelich Jan 10, 2020
ccbd475
Increase num nodes
andreyvelich Jan 10, 2020
0c651a5
Change num nodes to 6
andreyvelich Jan 13, 2020
f25c13d
Remove autoscaling
andreyvelich Jan 13, 2020
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Change example from pytorch to mxnet
  • Loading branch information
andreyvelich committed Jan 8, 2020
commit 923902acdb9988c2bad52cd7671fde5afddd68cf
6 changes: 0 additions & 6 deletions examples/v1alpha3/metrics-with-timestamp/Dockerfile

This file was deleted.

154 changes: 0 additions & 154 deletions examples/v1alpha3/metrics-with-timestamp/mnist.py

This file was deleted.

Original file line number Diff line number Diff line change
Expand Up @@ -2,14 +2,14 @@ apiVersion: "kubeflow.org/v1alpha3"
kind: Experiment
metadata:
namespace: kubeflow
name: metrics-with-timestamp
name: mxnet-mnist-timestamp
spec:
objective:
type: maximize
goal: 0.99
objectiveMetricName: accuracy
objectiveMetricName: Validation-accuracy
additionalMetricNames:
- loss
- Train-accuracy
algorithm:
algorithmName: random
parallelTrialCount: 3
Expand All @@ -21,11 +21,18 @@ spec:
feasibleSpace:
min: "0.01"
max: "0.03"
- name: --momentum
parameterType: double
- name: --num-layers
parameterType: int
feasibleSpace:
min: "2"
max: "5"
- name: --optimizer
parameterType: categorical
feasibleSpace:
min: "0.3"
max: "0.7"
list:
- sgd
- adam
- ftrl
trialTemplate:
goTemplate:
rawTemplate: |-
Expand All @@ -39,12 +46,11 @@ spec:
spec:
containers:
- name: {{.Trial}}
image: docker.io/andreyvelichkevich/timestamp-metric
image: docker.io/kubeflowkatib/mxnet-mnist-timestamp
imagePullPolicy: Always
command:
- "python"
- "/var/mnist.py"
- "--epochs=3"
- "python3"
- "/opt/mxnet-mnist-timestamp/mnist.py"
{{- with .HyperParameters}}
{{- range .}}
- "{{.Name}}={{.Value}}"
Expand Down
13 changes: 13 additions & 0 deletions examples/v1alpha3/mxnet-mnist-timestamp/Dockerfile
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
FROM ubuntu:16.04

RUN apt-get update && \
apt-get install -y wget python3-dev gcc && \
wget https://bootstrap.pypa.io/get-pip.py && \
python3 get-pip.py

RUN pip3 install mxnet

ADD . /opt/mxnet-mnist-timestamp
WORKDIR /opt/mxnet-mnist-timestamp

ENTRYPOINT ["python3", "/opt/mxnet-mnist-timestamp/mnist.py"]
6 changes: 6 additions & 0 deletions examples/v1alpha3/mxnet-mnist-timestamp/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
# Mxnet image classification example
This is Mxnet image classification training container with recording time of the metrics.

It uses random suggestion algorithm and has only simple multilayer perceptron network (mlp).

If you want to read more about this example, visit official [incubator-mxnet](https://github.com/apache/incubator-mxnet/tree/master/example/image-classification) github repository.
Empty file.
24 changes: 24 additions & 0 deletions examples/v1alpha3/mxnet-mnist-timestamp/common/find_mxnet.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,24 @@
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.

import os, sys
try:
import mxnet as mx
except ImportError:
curr_path = os.path.abspath(os.path.dirname(__file__))
sys.path.append(os.path.join(curr_path, "../../../python"))
import mxnet as mx
Loading