TF 2.X & Keras 2.3.1 compatibility #2278

IgnacioAmat · 2020-07-09T15:07:56Z

No description provided.

VictorAtPL · 2020-07-17T07:07:08Z

@IgnacioAmat
Are you able to train this model in Tensorflow 2.2.0 or still struggling with some incompatibilities?

IgnacioAmat · 2020-07-17T07:20:00Z

@VictorAtPL With the changes I proposed I was able to run the training without any incompatibilities, the model was well trained and showed good results

VictorAtPL · 2020-07-17T08:48:32Z

@IgnacioAmat
Ok, is the training done in eager mode, or the graph is compilated?
If eventually I will be able to train on Tensorflow 2.2 I will have to extend the model with new heads. What I will need is a great debugging possibility and in Tensorflow 2.x it's easier than in Tensorflow 1.x, isn't it?

IgnacioAmat · 2020-07-20T15:41:31Z

@VictorAtPL
As Tensorflow 2.X has eager mode by default I just trained the model with eager mode enabled.
Yes, Tensorflow 2.X offers way better debugging possibilities than Tensorflow 1.X, it allows debugging normal python code for example using pdb.

VictorAtPL · 2020-07-20T20:46:10Z

@IgnacioAmat
Okay, but did you use pdb to ensure that it's really in eager mode and you saw EagerTensors with values within them, or you just suppose it should be eager and nothing in Keras is graph by default?

IgnacioAmat · 2020-07-21T12:57:07Z

@VictorAtPL
Oh sorry, there was a misunderstanding from my part ! No I actually executed the code based on graph definition, the Keras mode. I supposed it was eager mode as it was enabled by default, but didn't modify the code to actually execute it in eager mode.

ivanlen · 2020-08-10T18:38:52Z

I am having some issues with this PR.
Have you checked PR #2115?
Using both PRs together I am able to run everything on TF 2.x and keras 2.3.1.

IgnacioAmat · 2020-08-31T12:47:17Z

You were right @ivanlen, I forgot to add those changes too for the compatibility of TF 2.X. Thanks for the remark !

RishikMani · 2020-09-02T14:31:47Z

You were right @ivanlen, I forgot to add those changes too for the compatibility of TF 2.X. Thanks for the remark !

Thank you very much. I was finally able to execute the code. Could you also let me know, were you able to start the training? As of now I have 350 training images and 50 validation images of size 256x256 with 4GB NVIDIA GTX 960. But it could not train and throws ran out of memory exception.

IgnacioAmat · 2020-09-02T15:31:25Z

Hi @RishikMani, you have to reduce your batch size to prevent running out of memory or reduce image size to lower resolution.

ivanlen · 2020-09-02T17:06:41Z

Hey @RishikMani Ideally you can use a generator, check out:

https://www.tensorflow.org/api_docs/python/tf/keras/utils/Sequence

dsalnikov · 2020-09-02T18:42:04Z

Hi, I get an error when trying to start training with this pr:

ValueError: The following Variables were created within a Lambda layer (anchors) but are not tracked by said layer: <tf.Variable 'anchors/Variable:0' shape=(1, 36720, 4) dtype=float32> The layer cannot safely ensure proper Variable reuse across multiple calls, and consquently this behavior is disallowed for safety. Lambda layers are not well suited to stateful computation; instead, writing a subclassed Layer is the recommend way to define layers with Variables.

How can I fix this?

Shakesbeer333 · 2020-09-11T07:41:21Z

@dsalnikov there is a solution in the issue section

kimile599 · 2020-09-14T02:11:35Z

Hi @RishikMani, you have to reduce your batch size to prevent running out of memory or reduce image size to lower resolution.

Hi bro, i am using the mask r cnn and lowered the loss to 0.1 sometimes 0.09, but the bal_loss is not converging? any suggestions on that?

Shakesbeer333 · 2020-09-14T10:11:35Z

Is this working with CUDA 10.2?

IgnacioAmat · 2020-09-16T12:36:39Z

Yes @Shakesbeer333

IgnacioAmat · 2020-09-16T12:42:07Z

Hi @kimile599, your problem may be that during training you are overfitting your data. Have you tried to increase you dataset size or using data augmentation ? Check these issues if to see if they can help you with your problem #281 and #527

kimile599 · 2020-09-16T19:11:33Z

Hi @kimile599, your problem may be that during training you are overfitting your data. Have you tried to increase you dataset size or using data augmentation ? Check these issues if to see if they can help you with your problem #281 and #527

Thank you for your reply. I am now doing the augmentation and try to flat the loss.

jvdavim · 2020-09-24T14:41:56Z

What about updating requirements.txt?

IgnacioAmat · 2020-09-25T15:01:00Z

@jvdavim Are you thinking about upgrading the minimal versions of both TF and keras to be something like: tensorflow>=2.0 keras>=2.3.1?

jvdavim · 2020-09-25T21:03:04Z

Yes. But I just realized that there is no difference. It will install the latest version anyway.

jvdavim · 2020-09-28T20:27:02Z

I got this error.
AttributeError: module 'tensorflow.python.framework.ops' has no attribute '_TensorLike'

tensorflow==2.3.1
keras==2.3.1

IgnacioAmat · 2020-09-28T20:41:13Z

Please check the comment of @NMazzatenta in this issue

sirbastiano · 2020-10-05T11:10:53Z

@VictorAtPL With the changes I proposed I was able to run the training without any incompatibilities, the model was well trained and showed good results
Got this error running in Colab
ValueError:
The following Variables were created within a Lambda layer (anchors)
but are not tracked by said layer:
<tf.Variable 'anchors/Variable:0' shape=(8, 4092, 4) dtype=float32>
The layer cannot safely ensure proper Variable reuse across multiple
calls, and consquently this behavior is disallowed for safety. Lambda
layers are not well suited to stateful computation; instead, writing a
subclassed Layer is the recommend way to define layers with
Variables.

sirbastiano · 2020-10-05T11:11:23Z

Got this error:
ValueError:
The following Variables were created within a Lambda layer (anchors)
but are not tracked by said layer:
<tf.Variable 'anchors/Variable:0' shape=(8, 4092, 4) dtype=float32>
The layer cannot safely ensure proper Variable reuse across multiple
calls, and consquently this behavior is disallowed for safety. Lambda
layers are not well suited to stateful computation; instead, writing a
subclassed Layer is the recommend way to define layers with
Variables.

IgnacioAmat · 2020-10-05T12:42:02Z

@sirbastiano you seem to be having the same problem as @dsalnikov. You should maybe check this issue to see if it helps, but as @Shakesbeer333 said, the solution to this is on the issue section

sirbastiano · 2020-10-05T12:47:03Z

Do you have a colab version of your work? I'm struggling for my master thesis. ( This Is not my field of study) Il lun 5 ott 2020, 14:42 Ignacio Amat <notifications@github.com> ha scritto:

…

@sirbastiano <https://github.com/SirBastiano> you seem to be having the same problem as @dsalnikov <https://github.com/dsalnikov>. You should maybe check this issue <#1930> to see if it helps, but as @Shakesbeer333 <https://github.com/Shakesbeer333> said, the solution to this is on the issue section — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#2278 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ARFBHLUNHLAOD7ICWNMUKJ3SJG5KXANCNFSM4OVWJATA> .

IgnacioAmat · 2020-10-05T12:57:17Z

No, sorry @sirbastiano. Maybe @dsalnikov can give you a hint on how he managed to solve that problem.

sirbastiano · 2020-10-05T13:11:13Z

I figured out how to make it work. But now I receive this error: TypeError Traceback (most recent call last) <ipython-input-19-a9b60817b002> <https://localhost:8080/#> in <module>() 2 image_id = random.choice(dataset_val.image_ids) 3 original_image, image_meta, gt_class_id, gt_bbox, gt_mask = modellib.load_image_gt(dataset_val, inference_config, ----> 4 image_id, use_mini_mask=True) 5 6 log("original_image", original_image) TypeError: load_image_gt() got an unexpected keyword argument 'use_mini_mask' It's when I test on a random image: # Test on a random image image_id = random.choice(dataset_val.image_ids) original_image, image_meta, gt_class_id, gt_bbox, gt_mask =\ modellib.load_image_gt(dataset_val, inference_config, image_id, use_mini_mask=True) Il lun 5 ott 2020, 14:57 Ignacio Amat <notifications@github.com> ha scritto:

…

No, sorry @sirbastiano <https://github.com/SirBastiano>. Maybe @dsalnikov <https://github.com/dsalnikov> can give you a hint on how he managed to solve that problem. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#2278 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ARFBHLTZFMHIXL5JPAHSIZDSJG7D5ANCNFSM4OVWJATA> .

IgnacioAmat · 2020-10-05T13:22:03Z

You have to delete that parameter from model.py file as it is enabled by default in config.py as you can check in this issue

sirbastiano · 2020-10-05T17:42:23Z

Thank you very much. I almost completed the debugging. Il lun 5 ott 2020, 15:22 Ignacio Amat <notifications@github.com> ha scritto:

…

You have to delete that parameter from model.py file as it is enabled by default in config.py — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#2278 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ARFBHLQG6QGZILFFBQZ66P3SJHCAZANCNFSM4OVWJATA> .

sirbastiano · 2020-10-08T08:31:49Z

Dear Ignacio, I've to ask you a question. I imported my own dataset and masks but I get this error: 'Dataset' object non subscriptable. Do you know what does this mean? Il giorno lun 5 ott 2020 alle ore 19:42 Roberto Del Prete < robertodelprete88@gmail.com> ha scritto:

…

Thank you very much. I almost completed the debugging. Il lun 5 ott 2020, 15:22 Ignacio Amat ***@***.***> ha scritto: > You have to delete that parameter from model.py file as it is enabled by > default in config.py > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub > <#2278 (comment)>, > or unsubscribe > <https://github.com/notifications/unsubscribe-auth/ARFBHLQG6QGZILFFBQZ66P3SJHCAZANCNFSM4OVWJATA> > . >

Ademord · 2020-10-08T12:13:44Z

My basic test was to run the /samples notebooks.
First one: demo, doesn't run.
The other ones have a lot of errors after mini masks section and data_generator.

---------------------------------------------------------------------------
ModuleNotFoundError                       Traceback (most recent call last)
<ipython-input-1-aa019bc2cbcd> in <module>
     16 sys.path.append(ROOT_DIR)  # To find local version of the library
     17 from mrcnn import utils
---> 18 import mrcnn.model as modellib
     19 from mrcnn import visualize
     20 # Import COCO config

~/src/maskrcnn-amatt/mrcnn/model.py in <module>
     18 import numpy as np
     19 import tensorflow as tf
---> 20 import keras
     21 import keras.backend as K
     22 import keras.layers as KL

ModuleNotFoundError: No module named 'keras'

IgnacioAmat · 2020-10-14T08:07:26Z

error: 'Dataset' object non subscriptable

Hi @sirbastiano, what I think you are trying to do while getting this error is to index an object that doesn't have that functionality. Check that your Dataset object can be subscriptable in order to avoid this problem. Can you provide some more code and error traceback ?

sirbastiano · 2020-10-14T08:09:23Z

Solved, thanks Ignacio. Il mer 14 ott 2020, 10:07 Ignacio Amat <notifications@github.com> ha scritto:

…

error: 'Dataset' object non subscriptable Hi @sirbastiano <https://github.com/SirBastiano>, what I think you are trying to do while getting this error is to index an object that doesn't have that functionality. Check that your Dataset object can be subscriptable in order to avoid this problem. Can you provide some more code and error traceback ? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#2278 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ARFBHLTXBOGJPQ7TXXO3AQ3SKVL5BANCNFSM4OVWJATA> .

IgnacioAmat · 2020-10-14T08:10:06Z

My basic test was to run the /samples notebooks.
First one: demo, doesn't run.
The other ones have a lot of errors after mini masks section and data_generator.

---------------------------------------------------------------------------
ModuleNotFoundError                       Traceback (most recent call last)
<ipython-input-1-aa019bc2cbcd> in <module>
     16 sys.path.append(ROOT_DIR)  # To find local version of the library
     17 from mrcnn import utils
---> 18 import mrcnn.model as modellib
     19 from mrcnn import visualize
     20 # Import COCO config

~/src/maskrcnn-amatt/mrcnn/model.py in <module>
     18 import numpy as np
     19 import tensorflow as tf
---> 20 import keras
     21 import keras.backend as K
     22 import keras.layers as KL

ModuleNotFoundError: No module named 'keras'

Hi @Ademord, this error means that in your current environment you don't have the keras module installed. Install it with conda conda install -c conda-forge keras or with pip pip install Keras

wiktor-jurek · 2021-03-24T14:22:28Z

I think I may have found another incompatibility in the utils module. When calling utils.compute_ap(), that ends up performing an np.dot() calculation, but the two masks given have a shape mismatch. I've looked through the Utils code, but I can't find a reason as to why downgrading TF results in a successful operation. This is briefly mentioned in #960 , and the suggested advice is to downgrade TF.

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-9-1731946196d5> in <module>
     22     #precision_, recall_, AP_
     23     AP_, precision_, recall_, overlap_ = utils.compute_ap(gt_bbox, gt_class_id, gt_mask,
---> 24                                           r['rois'], r['class_ids'], r['scores'], r['masks'])
     25     #check if the vectors len are equal
     26     print("the actual len of the gt vect is : ", len(gt_tot))

~/project/2_MaskRCNN/mrcnn/utils.py in compute_ap(gt_boxes, gt_class_ids, gt_masks, pred_boxes, pred_class_ids, pred_scores, pred_masks, iou_threshold)
    716         gt_boxes, gt_class_ids, gt_masks,
    717         pred_boxes, pred_class_ids, pred_scores, pred_masks,
--> 718         iou_threshold)
    719 
    720     # Compute precision and recall at each prediction box step

~/project/2_MaskRCNN/mrcnn/utils.py in compute_matches(gt_boxes, gt_class_ids, gt_masks, pred_boxes, pred_class_ids, pred_scores, pred_masks, iou_threshold, score_threshold)
    669 
    670     # Compute IoU overlaps [pred_masks, gt_masks]
--> 671     overlaps = compute_overlaps_masks(pred_masks, gt_masks)
    672 
    673     # Loop through predictions and find matching ground truth boxes

~/project/2_MaskRCNN/mrcnn/utils.py in compute_overlaps_masks(masks1, masks2)
    118 
    119     # intersections and union
--> 120     intersections = np.dot(masks1.T, masks2)
    121     union = area1[:, None] + area2[None, :] - intersections
    122     overlaps = intersections / union

<__array_function__ internals> in dot(*args, **kwargs)

ValueError: shapes (2,65536) and (3136,43) not aligned: 65536 (dim 1) != 3136 (dim 0)

Ademord · 2021-03-24T19:58:45Z

If anyone was looking for an alternative to this repo I moved to Detectron2 and saw even better performance in terms of speed and accuracy, so I would recommend it.

guilhermemarim · 2022-01-08T15:17:26Z

Hi everyone! I'm using the TF 2.1.0 and keras 2.3.1 and I got this error:

AttributeError: module 'tensorflow' has no attribute 'random_shuffle'

May someone help me?

TF 2.2.0 & Keras 2.3.1 compatibility

23417db

IgnacioAmat mentioned this pull request Jul 9, 2020

Tensorflow 2.0 support #1775

Open

Tensonflow 2.X compat fix

345376c

IgnacioAmat changed the title ~~TF 2.2.0 & Keras 2.3.1 compatibility~~ TF 2.X & Keras 2.3.1 compatibility Aug 31, 2020

TF 2.X & Keras 2.3.1 compatibility #2278

Are you sure you want to change the base?

TF 2.X & Keras 2.3.1 compatibility #2278

Conversation

IgnacioAmat commented Jul 9, 2020

VictorAtPL commented Jul 17, 2020

IgnacioAmat commented Jul 17, 2020

VictorAtPL commented Jul 17, 2020

IgnacioAmat commented Jul 20, 2020

VictorAtPL commented Jul 20, 2020 • edited Loading

IgnacioAmat commented Jul 21, 2020

ivanlen commented Aug 10, 2020

IgnacioAmat commented Aug 31, 2020

RishikMani commented Sep 2, 2020

IgnacioAmat commented Sep 2, 2020

ivanlen commented Sep 2, 2020

dsalnikov commented Sep 2, 2020

Shakesbeer333 commented Sep 11, 2020

kimile599 commented Sep 14, 2020

Shakesbeer333 commented Sep 14, 2020

IgnacioAmat commented Sep 16, 2020

IgnacioAmat commented Sep 16, 2020 • edited Loading

kimile599 commented Sep 16, 2020

jvdavim commented Sep 24, 2020

IgnacioAmat commented Sep 25, 2020

jvdavim commented Sep 25, 2020

jvdavim commented Sep 28, 2020

IgnacioAmat commented Sep 28, 2020

sirbastiano commented Oct 5, 2020

sirbastiano commented Oct 5, 2020

IgnacioAmat commented Oct 5, 2020

sirbastiano commented Oct 5, 2020 via email

IgnacioAmat commented Oct 5, 2020

sirbastiano commented Oct 5, 2020 via email

IgnacioAmat commented Oct 5, 2020 • edited Loading

sirbastiano commented Oct 5, 2020 via email

sirbastiano commented Oct 8, 2020 via email

Ademord commented Oct 8, 2020

IgnacioAmat commented Oct 14, 2020

sirbastiano commented Oct 14, 2020 via email

IgnacioAmat commented Oct 14, 2020

wiktor-jurek commented Mar 24, 2021

Ademord commented Mar 24, 2021

guilhermemarim commented Jan 8, 2022

VictorAtPL commented Jul 20, 2020 •

edited

Loading

IgnacioAmat commented Sep 16, 2020 •

edited

Loading

IgnacioAmat commented Oct 5, 2020 •

edited

Loading