TensorFlow Transformer Part-4 #11

phi-dbq · 2017-09-21T00:26:38Z

The implementation of TFTransformer based on previous steps.

1. Move `InputGraph` to its module.

1. Create tf-transformer-part4 from tf-1d-transformer 2. Merge branch 'tf-transformer-part3' into tf-transformer-part4

Signed-off-by: Philip Yang <philip.yang@databricks.com>

codecov-io · 2017-09-21T02:04:03Z

Codecov Report

Merging #11 into tf-transformer-part3 will increase coverage by 1.58%.
The diff coverage is 97.77%.

@@                   Coverage Diff                    @@
##           tf-transformer-part3      #11      +/-   ##
========================================================
+ Coverage                 84.01%   85.59%   +1.58%     
========================================================
  Files                        25       29       +4     
  Lines                      1376     1673     +297     
  Branches                      5       15      +10     
========================================================
+ Hits                       1156     1432     +276     
- Misses                      220      241      +21

Impacted Files	Coverage Δ
python/sparkdl/param/__init__.py	`100% <ø> (ø)`	⬆️
python/sparkdl/__init__.py	`100% <100%> (ø)`	⬆️
python/sparkdl/transformers/tf_tensor.py	`100% <100%> (ø)`
python/sparkdl/graph/input.py	`98.05% <100%> (+0.46%)`	⬆️
python/sparkdl/graph/builder.py	`93.75% <100%> (+0.05%)`	⬆️
python/sparkdl/param/converters.py	`82.5% <80%> (-0.17%)`	⬇️
python/sparkdl/param/shared_params.py	`81.44% <92.3%> (+3.53%)`	⬆️
python/sparkdl/transformers/keras_applications.py	`84.9% <0%> (-6.94%)`	⬇️
...a/com/databricks/sparkdl/DeepImageFeaturizer.scala	`95.38% <0%> (ø)`
...in/scala/com/databricks/sparkdl/ModelFetcher.scala	`97.14% <0%> (ø)`
... and 3 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update decdc8f...af95b74. Read the comment docs.

* adding new tests * remove original test design * cleanup

thunterdb

@phi-dbq this looks good. I just have a few comments that will be quick to address.

thunterdb · 2017-11-21T21:16:06Z

python/sparkdl/graph/input.py

@@ -92,6 +91,50 @@ def __init__(self, graph_def, input_tensor_name_from_signature,
        self.input_tensor_name_from_signature = input_tensor_name_from_signature
        self.output_tensor_name_from_signature = output_tensor_name_from_signature

+    def translateInputMapping(self, input_mapping):


these two functions are either too much or not enough: either you should provide some tests and some doc examples, or not include them. Since they are not used elsewhere, let's put then in a separate PR for now.

Well, they are actually in our API design. Let me add some tests for these guys.

Ok I see it below.

thunterdb · 2017-11-22T01:48:54Z

python/tests/transformers/tf_transformer_test.py

+    for idx in range(100):
+        _dict = {'idx': idx}
+        for colname, _ in _input_mapping.items():
+            _dict[colname] = np.random.randn(_tensor_size).tolist()


let's use something deterministic instead

thunterdb · 2017-11-22T01:51:54Z

python/tests/transformers/tf_transformer_test.py

+from ..tests import SparkDLTestCase
+
+class TFTransformerTests(SparkDLTestCase):
+    def test_graph_novar(self):


great test, we will be able to add some more pretty easily after that

thunterdb · 2017-11-22T01:52:17Z