Skip to content

Errors only with mesh #189

Open
Open
@VorlonCD

Description

@VorlonCD

Hello!

Merry Thanksgiving??

I have 2 windows machines both with YOLOv5 .NET and YOLOv5 6.2 enabled. If I point to either individually without mesh enabled they work all day long without trouble. So I think that rules out cards, memory, drivers.

As soon as I enable mesh (no matter which machine is the master) I start to get 1 of 2 errors depending on which module processes the request:

YOLOv5 .NET
{"error":"No File supplied for object detection.","inferenceMs":0,"processMs":0,"analysisRoundTripMs":30000,"success":false,"moduleName":"Object Detection (YOLOv5 .NET)","moduleId":"ObjectDetectionYOLOv5Net","command":"detect","requestId":"0d584a1f-c553-407b-a3f8-ba29139a7238","processedBy":"PCNAME","timestampUTC":"Wed, 27 Nov 2024 15:23:33 GMT"}

YOLOv5 6.2
{"success":false,"error":"Error occurred on the server","moduleId":"ObjectDetectionYOLOv5-6.2","moduleName":"Object Detection (YOLOv5 6.2)","code":500,"command":"detect","requestId":"7b64bd0f-39eb-45f0-ac84-9c67ecf49e53","inferenceDevice":"GPU","analysisRoundTripMs":30000,"processedBy":"PCNAME","timestampUTC":"Wed, 27 Nov 2024 14:11:11 GMT"}'

For 6.2 there is also an error in the server log that doesnt exist for .net - when I google around I believe it essentially means the same thing as above: "File Not found":

Response rec'd from Object Detection (YOLOv5 6.2) command 'detect' (...822dc4)

Object Detection (YOLOv5 6.2):  [AttributeError] : Traceback (most recent call last):
  File "C:\Program Files\CodeProject\AI\modules\ObjectDetectionYOLOv5-6.2\detect.py", line 140, in do_detection
    det                  = detector(img, size=640)
  File "C:\Program Files\CodeProject\AI\runtimes\bin\windows\python37\venv\lib\site-packages\torch\nn\modules\module.py", line 1190, in _call_impl
    return forward_call(*input, **kwargs)
  File "C:\Program Files\CodeProject\AI\runtimes\bin\windows\python37\venv\lib\site-packages\torch\autograd\grad_mode.py", line 27, in decorate_context
    return func(*args, **kwargs)
  File "C:\Program Files\CodeProject\AI\runtimes\bin\windows\python37\venv\lib\site-packages\yolov5\models\common.py", line 689, in forward
    if im.shape[0] < 5:  # image in CHW
AttributeError: 'NoneType' object has no attribute 'shape'

So it seems like there may be an issue with one server uploading the file to the other server (both on the same 10gig network)

I'm throwing a lot of 4k 2mb files at it mixed in with some lower resolution images so maybe thats a factor for why it doesnt happen every single call.

I havent really flushed this out yet but I think I need to restart the whole codeproject service after the error to get the mesh to work correctly again.

What else can we do to troubleshoot this?

My system:

  • CodeProject.AI Server version: 2.9.3
  • OS: Windows 10 [NO FIREWALL, out of the box AV, 10gig wired connection between devices]
  • System RAM 128 GB
  • GPU, machine 1: NVIDIA T1000 (8 GB)
  • GPU, machine 2: NVIDIA GeForce RTX 3080 (12 GiB)

Metadata

Metadata

Assignees

Labels

Can't ReplicateUnable to replicate this issue

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions