Python net specification #2086

longjon · 2015-03-10T00:52:03Z

master edition of #1733. Still rough, but should be usable by the intrepid.

Now including an AlexNet (CaffeNet variant) generation example.
Fixed an error in uncamel which broke acronym names (e.g., LRN) (thanks @sontran).
Now supports fillers (thanks @sontran).

Python net specification

longjon · 2015-03-17T04:20:27Z

This should now support repeated Messages as lists of dicts (like param or dummy_data_param's shape; @erictzeng, you asked about this earlier).

I think that means you can now specify any NetParameter as Python code. Once layer naming has been cleaned up a bit (for which I have an idea in mind), I think this will have reached mergeability as a thin wrapper around prototxt.

Python net specification

muupan · 2015-04-05T02:31:14Z

uncamel('HDF5Data') wrongly returns 'hd_f5_data'.

muupan · 2015-04-05T04:12:07Z

And uncamel('PReLU') wrongly returns 'p_relu'.

I think 'HDF5Data' -> 'hdf5_data' is normal uncamelling but 'PReLU' -> 'prelu' is exceptional. Maybe we have to find some other way.

seanbell · 2015-04-20T21:13:24Z

@muupan The layers that break the rule could just be hardcoded in the "uncamel" function, e.g. with a dictionary. Something like this:

_UNCAMEL_EXCEPTIONS = {
    'HDF5Data': 'hdf5_data',
    'PReLU': 'prelu',
}

def uncamel(s):
    """Convert CamelCase to underscore_case."""
    return _UNCAMEL_EXCEPTIONS.get(s, 
        re.sub('(?!^)([A-Z])(?=[^A-Z])', r'_\1', s).lower())

Shaunakde · 2015-04-21T13:51:42Z

@longjon I am having an issue using this PR. Running the example: http://nbviewer.ipython.org/github/BVLC/caffe/blob/tutorial/examples/01-learning-lenet.ipynb gives me the following error:

 File "/home/shaunak/caffe-pr2086/examples/wine/classify.py", line 18, in lenet
    n = caffe.NetSpec()

  File "../../python/caffe/layers.py", line 84, in __init__
    super(NetSpec, self).__setattr__('tops', OrderedDict())

TypeError: must be type, not None

Update

I tried adding an import statement for caffe as well and the following happened:

import numpy as np
import matplotlib.pyplot as plt

# Make sure that caffe is on the python path:
caffe_root = '../../'  # this file is expected to be in {caffe_root}/examples/wine
import sys
sys.path.insert(0, caffe_root + 'python')

from pylab import *

import caffe 

from caffe import layers as L
from caffe import params as P

def logreg(hdf5, batch_size):
    # logistic regression: data, matrix multiplication, and 2-class softmax loss
    n = caffe.NetSpec()
    n.data, n.label = L.HDF5Data(batch_size=batch_size, source=hdf5, ntop=2)
    n.ip1 = L.InnerProduct(n.data, num_output=2, weight_filler=dict(type='xavier'))
    n.accuracy = L.Accuracy(n.ip1, n.label)
    n.loss = L.SoftmaxWithLoss(n.ip1, n.label)
    return n.to_proto()

with open('../../examples/hdf5_classification/logreg_auto_train.prototxt', 'w') as f:
    f.write(str(logreg('examples/hdf5_classification/data/train.txt', 10)))

with open('../../examples/hdf5_classification/logreg_auto_test.prototxt', 'w') as f:
    f.write(str(logreg('examples/hdf5_classification/data/test.txt', 10)))

causes this error:

runfile('/home/shaunak/caffe-pr2086/examples/wine/classify.py', wdir='/home/shaunak/caffe-pr2086/examples/wine')
Reloaded modules: caffe, caffe.proto, caffe._caffe, caffe.pycaffe, caffe.detector, caffe.proto.caffe_pb2, caffe.io, caffe.classifier, caffe.layers
Traceback (most recent call last):

  File "<ipython-input-9-694741de221d>", line 1, in <module>
    runfile('/home/shaunak/caffe-pr2086/examples/wine/classify.py', wdir='/home/shaunak/caffe-pr2086/examples/wine')

  File "/home/shaunak/anaconda/lib/python2.7/site-packages/spyderlib/widgets/externalshell/sitecustomize.py", line 682, in runfile
    execfile(filename, namespace)

  File "/home/shaunak/anaconda/lib/python2.7/site-packages/spyderlib/widgets/externalshell/sitecustomize.py", line 78, in execfile
    builtins.execfile(filename, *where)

  File "/home/shaunak/caffe-pr2086/examples/wine/classify.py", line 26, in <module>
    f.write(str(logreg('examples/hdf5_classification/data/train.txt', 10)))

  File "/home/shaunak/caffe-pr2086/examples/wine/classify.py", line 23, in logreg
    return n.to_proto()

  File "../../python/caffe/layers.py", line 97, in to_proto
    top.fn._to_proto(layers, names, autonames)

  File "../../python/caffe/layers.py", line 78, in _to_proto
    assign_proto(layer, k, v)

  File "../../python/caffe/layers.py", line 25, in assign_proto
    setattr(proto, name, val)

AttributeError: 'LayerParameter' object has no attribute 'source'

Discussion: http://stackoverflow.com/questions/29774793/typeerror-python-class

Python net specification

escorciav · 2015-05-26T20:54:40Z

Hi @Shaunakde , you safe me hours of reading code so I felt that I should help you (sorry if you already noticed that). You should use the comment of seanbell for your example. I guess that the reason is that HDF5Data is an CamelCase tricky layer.

Thank you longjon for this tool. I have spent hours debugging prototxt without noticing minor difference such as layers instead of layer.

BlGene · 2015-06-09T14:17:11Z

examples/python_nets/caffenet.py

@@ -0,0 +1,54 @@
+from caffe import layers as L, params as P, to_proto


I was looking over this and from what I can tell the to_proto function from PR #1733 has moved
into NetSpec here.

In this case NetSpec should be imported and all layer variables should be defined to it. This would then also print the layers in their correct order.

The alexnet function would look as follows:

def alexnet(lmdb, batch_size=256, include_acc=False): n = NetSpec() n.data, n.label = L.Data(source=lmdb, backend=P.Data.LMDB, batch_size=batch_size, ntop=2, transform_param=dict(crop_size=227, mean_value=[104, 117, 123], mirror=True)) # the net itself n.conv1, n.relu1 = conv_relu(n.data, 11, 96, stride=4) n.pool1 = max_pool(n.relu1, 3, stride=2) n.norm1 = L.LRN(n.pool1, local_size=5, alpha=1e-4, beta=0.75) n.conv2, n.relu2 = conv_relu(n.norm1, 5, 256, pad=2, group=2) n.pool2 = max_pool(n.relu2, 3, stride=2) n.norm2 = L.LRN(n.pool2, local_size=5, alpha=1e-4, beta=0.75) n.conv3, n.relu3 = conv_relu(n.norm2, 3, 384, pad=1) n.conv4, n.relu4 = conv_relu(n.relu3, 3, 384, pad=1, group=2) n.conv5, n.relu5 = conv_relu(n.relu4, 3, 256, pad=1, group=2) n.pool5 = max_pool(n.relu5, 3, stride=2) n.fc6, n.relu6 = fc_relu(n.pool5, 4096) n.drop6 = L.Dropout(n.relu6, in_place=True) n.fc7, n.relu7 = fc_relu(n.drop6, 4096) n.drop7 = L.Dropout(n.relu7, in_place=True) n.fc8 = L.InnerProduct(n.drop7, num_output=1000) n.loss = L.SoftmaxWithLoss(n.fc8, n.label) if include_acc: n.acc = L.Accuracy(n.fc8, n.label) return n.to_proto()

BR, Max

BlGene · 2015-06-15T09:52:03Z

Hi,

I've been using this PR for a few days and I have been able to write all the models I wanted to using it. I am very pleased with it and Ty for writing it. 👍 @longjon

The PR mentions that it fillers, but I don't see how these can be accessed from python, was this omitted from the PR?

I was wondering if there is an easy way to specify parameters for python layers, here a few ideas came to mind:

Just generating the python layer code with the variables in place
Extending PythonParameter in order to smuggle a dict through to the python layer:

message PythonParameter {
  optional string module = 1;
  optional string layer = 2;
  repeated string param_keys = 3;
  repeated string param_values = 3;
}

Extending PythonParameter with param_string, then people can put pickled parameters in it/ do whatever they want.

So for this I was wondering if there is an easier way.

Also in case it was overlooked, its worthwhile looking at the Theano version of this, called Mariana.

bhack · 2015-06-15T11:50:18Z

See also Lasagne on Theano.

longjon · 2015-06-16T02:29:23Z

longjon · 2015-06-18T23:23:09Z

TODOs all done for now; marking this ready for review!

@muupan @seanbell and others, uncamel is gone; the layer-parameter correspondence is now determined through inspection of the caffe_pb2 module. The only assumption is that the parameter type of a layer named X is XParameter, which is true for all existing layers and should remain true.

@Shaunakde, note that we generally aren't able to provide support for PRs, especially in-progress ones. You're welcome to contribute to the development discussion, but otherwise please use caffe-users or other venues.

@BlGene, see the tests for an example of specifying fillers. Parameters for Python layers are a different (though related) issue; see, e.g., #2001.

shelhamer · 2015-06-30T06:09:06Z

examples/python_nets/caffenet.py

+def max_pool(bottom, ks, stride=1):
+    return L.Pooling(bottom, pool=P.Pooling.MAX, kernel_size=ks, stride=stride)
+
+def alexnet(lmdb, batch_size=256, include_acc=False):


Trivial, but this should be called caffenet since it has our usual inversion.

shelhamer · 2015-06-30T20:33:44Z

@longjon amended names and the (AttributeError, KeyError) exception handling and pushed. Merging.

Python net specification

kjmonaghan · 2015-07-16T17:00:13Z

Very helpful, thank you!

How do I go about changing the decay_mult parameter?

jeffdonahue · 2015-07-16T18:33:24Z

e.g., L.Convolution(data, kernel_size=5, num_output=20, param=[dict(decay_mult=0.5)]). Any proto-generated object (in this case a ParamSpec) can be specified using a dict. The brackets [] are needed to make a list of them, since param is a repeated field (to support multiple parameters). You could also explicitly use a ParamSpec object (rather than a dict that gets converted into one) by doing:

filter_spec = caffe_pb2.ParamSpec()
filter_spec.decay_mult = 0.5
L.Convolution(data, kernel_size=5, num_output=20, param=[filter_spec])

(untested, could be slightly wrong)

Python net specification

jmerkow · 2015-07-30T21:09:02Z

Is it possible to specify dummy data with this?
I've tried about 1000 things to get it to work and can't seem to get it...

Based on caffenet.py, I would expect that it would be something like this:

import caffe
from caffe import layers as L, params as P, to_proto
from caffe.proto import caffe_pb2
from __future__ import print_function
def gen_inputs():
    data = L.DummyData(name="data",ntop=1,shape=dict(dim=[1,2,3]))
    label = L.DummyData(name="label",ntop=1,shape=dict(dim=[1,2,3]))
    return data,label

def caffenet(include_acc=False):
    data,label=gen_inputs()

    loss = L.SoftmaxWithLoss(data, label)
    return to_proto(loss)

def make_net(output_dir='./',net_name='train'):
    fname = os.path.join(output_dir,net_name+'.prototxt')
    with open(fname, 'w') as f:
        print(caffenet(), file=f)
make_net()

this produces:

layer {
  name: "data"
  type: "DummyData"
  top: "DummyData1"
}
layer {
  name: "label"
  type: "DummyData"
  top: "DummyData2"
}
layer {
  name: "SoftmaxWithLoss1"
  type: "SoftmaxWithLoss"
  bottom: "DummyData1"
  bottom: "DummyData2"
  top: "SoftmaxWithLoss1"
}

I've tried various other things using caffe_pb2 and caffe.params.
And I tried assigning dummy_data_param explicitly and with dictionaries:

def gen_inputs():
    data = L.DummyData(name="data",ntop=1,dummy_data_param=dict(shape=dict(dim=[1,2,3])))
    label = L.DummyData(name="label",ntop=1,dummy_data_param=dict(shape=dict(dim=[1,2,3])))
    return data,label

def gen_inputs():
    data_shape = caffe_pb2.BlobShape()
    data_shape.dim = [1,2,3]
    data_param = caffe_pb2.DummyDataParameter()
    data_param.shape = data_shape
    data = L.DummyData(name="data",ntop=1,dummy_data_param=data_param)
    label = L.DummyData(name="label",ntop=1,dummy_data_param=data_param)
    return data,label

Any thoughts?
--Jameson

BlGene · 2015-07-31T07:14:08Z

Maybe this?:
(Because shape is a repeated field in DummyDataParameter it should be given as a list.)

from __future__ import print_function
import caffe
from caffe import layers as L, params as P, to_proto
from caffe.proto import caffe_pb2
import os

def gen_inputs():
    data  = L.DummyData(name="data", ntop=1,dummy_data_param=dict(shape=[dict(dim=[1,2,3])]))
    label = L.DummyData(name="label",ntop=1,dummy_data_param=dict(shape=[dict(dim=[1,2,3])]))
    return data,label

def caffenet(include_acc=False):
    data,label=gen_inputs()

    loss = L.SoftmaxWithLoss(data, label)
    return to_proto(loss)

def make_net(output_dir='./',net_name='train'):
    fname = os.path.join(output_dir,net_name+'.prototxt')
    with open(fname, 'w') as f:
        print(caffenet(), file=f)

make_net()

BR, Max

longjon · 2015-07-31T21:43:16Z

@BlGene is right. If bad parameters are being silently ignored, however (in current master), that's a bug and you're welcome to open an issue.

jmerkow · 2015-07-31T21:46:36Z

I can open it with this as a test case. It may be the name 'shape' which has meaning in some contexts?

dfagnan · 2015-10-09T17:29:53Z

@longjon Does this caffenet example get all the weight_filler and bias_fillers correct? I'm not able to easily see that the default weight_filler is somehow gaussian with std = 0.01 here. Is this true?

longjon force-pushed the python-net-spec branch from 0e79cef to 22ef4fe Compare March 10, 2015 00:53

longjon added a commit to longjon/caffe that referenced this pull request Mar 10, 2015

Merge pull request BVLC#2086 from longjon/python-net-spec

a638bc8

Python net specification

longjon force-pushed the python-net-spec branch from 22ef4fe to 23ee7ab Compare March 10, 2015 02:01

longjon added a commit to longjon/caffe that referenced this pull request Mar 10, 2015

Merge pull request BVLC#2086 from longjon/python-net-spec

1764a53

Python net specification

longjon force-pushed the python-net-spec branch from 23ee7ab to cde046a Compare March 10, 2015 02:05

longjon added a commit to longjon/caffe that referenced this pull request Mar 10, 2015

Merge pull request BVLC#2086 from longjon/python-net-spec

8c7ce28

Python net specification

longjon added in progress ES interface labels Mar 13, 2015

longjon force-pushed the python-net-spec branch from cde046a to 6f6a67e Compare March 17, 2015 03:27

longjon force-pushed the python-net-spec branch from 6f6a67e to c64f6cf Compare March 17, 2015 21:38

longjon mentioned this pull request Mar 20, 2015

Add option to take parameters from bottoms #2166

Closed

longjon mentioned this pull request Mar 28, 2015

Synchronous SGD via layer-wise parallelism #2219

Closed

weiliu89 added a commit to weiliu89/caffe that referenced this pull request Apr 1, 2015

Merge pull request BVLC#2086 from longjon/python-net-spec

8eecdd5

Python net specification

elleryrussell pushed a commit to elleryrussell/caffe that referenced this pull request May 1, 2015

Merge pull request BVLC#2086 from longjon/python-net-spec

2dd61dc

Python net specification

escorciav mentioned this pull request May 26, 2015

Safe prototxt creation escorciav/mlml-cnn#1

Closed

BlGene reviewed Jun 9, 2015
View reviewed changes

BlGene mentioned this pull request Jun 15, 2015

spp-net first steps #2593

Closed

longjon force-pushed the python-net-spec branch 3 times, most recently from 9409178 to 1d9546e Compare June 18, 2015 21:50

longjon force-pushed the python-net-spec branch from 1d9546e to af93cdf Compare June 18, 2015 23:06

longjon added ready for review and removed in progress labels Jun 18, 2015

longjon mentioned this pull request Jun 18, 2015

Improve/enhance Python net specification #2621

Open

6 tasks

longjon force-pushed the python-net-spec branch from af93cdf to ed4ecaa Compare June 18, 2015 23:27

shelhamer reviewed Jun 30, 2015
View reviewed changes

longjon added 3 commits June 30, 2015 13:32

[pycaffe] basic net specification

c237223

[examples] caffenet python spec

ad2e7f4

[pytest] minimal testing of net specification

1cdad89

shelhamer force-pushed the python-net-spec branch from ed4ecaa to 1cdad89 Compare June 30, 2015 20:32

shelhamer added a commit that referenced this pull request Jun 30, 2015

Merge pull request #2086 from longjon/python-net-spec

1d6cac2

Python net specification

shelhamer merged commit 1d6cac2 into BVLC:master Jun 30, 2015

twerdster pushed a commit to twerdster/caffe that referenced this pull request Jul 19, 2015

Merge pull request BVLC#2086 from longjon/python-net-spec

c81b580

Python net specification

kashefy mentioned this pull request Jul 22, 2015

Fully Convolutional Semantic Segmentation error #2788

Closed

ih4cku mentioned this pull request Apr 13, 2016

to read ih4cku/caffe-notes#2

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python net specification #2086

Python net specification #2086

longjon commented Mar 10, 2015

longjon commented Mar 17, 2015

muupan commented Apr 5, 2015

muupan commented Apr 5, 2015

seanbell commented Apr 20, 2015

Shaunakde commented Apr 21, 2015

escorciav commented May 26, 2015

BlGene Jun 9, 2015

BlGene commented Jun 15, 2015

bhack commented Jun 15, 2015

longjon commented Jun 16, 2015

longjon commented Jun 18, 2015

shelhamer Jun 30, 2015

shelhamer commented Jun 30, 2015

kjmonaghan commented Jul 16, 2015

jeffdonahue commented Jul 16, 2015

jmerkow commented Jul 30, 2015

BlGene commented Jul 31, 2015

longjon commented Jul 31, 2015

jmerkow commented Jul 31, 2015

dfagnan commented Oct 9, 2015

		@@ -0,0 +1,54 @@
		from caffe import layers as L, params as P, to_proto

Python net specification #2086

Python net specification #2086

Conversation

longjon commented Mar 10, 2015

longjon commented Mar 17, 2015

muupan commented Apr 5, 2015

muupan commented Apr 5, 2015

seanbell commented Apr 20, 2015

Shaunakde commented Apr 21, 2015

escorciav commented May 26, 2015

BlGene Jun 9, 2015

Choose a reason for hiding this comment

BlGene commented Jun 15, 2015

bhack commented Jun 15, 2015

longjon commented Jun 16, 2015

longjon commented Jun 18, 2015

shelhamer Jun 30, 2015

Choose a reason for hiding this comment

shelhamer commented Jun 30, 2015

kjmonaghan commented Jul 16, 2015

jeffdonahue commented Jul 16, 2015

jmerkow commented Jul 30, 2015

BlGene commented Jul 31, 2015

longjon commented Jul 31, 2015

jmerkow commented Jul 31, 2015

dfagnan commented Oct 9, 2015