[OV JS] Add optical-character-recognition sample notebook #25191

qxprakash · 2024-06-24T19:22:54Z

Details:

added code in node notebook
updated the samples list in readme

Workarounds which still needs to be worked upon

couldn't find the js equivalent method for cv2.getTextSize() (opencv python method) which is used for getting the height and width of the crop text the opencv-wasm package does not have this api , in the current implementation I have written a custom function getTextSize which uses canvas to get the width and height of the text
text-recognition-resnet-fc model IR was larger in size it was around 355MB hence I did not included it in my PR

Please provide Feedback @Aliczi @vishniakov-nikolai

With Regards
Prakash

vishniakov-nikolai

Hi @qxprakash!
Because it's hard to review notebooks, I split changes by group. In the case when target changes needed I refer to cell and line:

Codestyle remarks

Two spaces as offset
Empty line before return
Try to do not exeed 80 symbols in line (long urls and paths may be an exception)
Sort imports: system libs (fs, path, ...), ex packages (canvas, openvino-node, ...), project files (helpers.js, ...)
One empty line at the end of code block
No more than two empty lines in the row
Remove extra comments from code blocks, check that useful comments start from capital letter
Make an order in variable names (recModel, recCompiled, recogInputLayer, etc). Name should answer what's inside. Do not hesitate using long names. In your case there are two models. Avoid common names like compiledModel to do not mix them.
Remove any extra commented code
Remove extra blocks

Common

Specify device as 'AUTO' (will be fixed in the rest of notebooks as well)
Each word in subheaders should start from capital letter (ex: Prepare Image for inference = Prepare Image for Inference)
I propose to fix input image colors (that shows in notebook). Now it shows with replaced channels.
Also, propose to keep original comments from Python notebook in the same places
Use async version of functions like inferAsync instead of infer, etc

Targeted remarks

Prepare Image for inference block, line 18:
There is no need to wrap it as Int32Array

const tensor = new ov.Tensor(ov.element.f32, inputLayer.shape, tensorData);

Define Post-Processing functions block, line 22
Extract function that uses in map
Define Post-Processing functions block, line 31
Why let not const?

vishniakov-nikolai · 2024-07-04T12:13:19Z

Please, use this link to download text recognition model: https://storage.openvinotoolkit.org/repositories/open_model_zoo/public/text-recognition-resnet-fc/

…tale comments and logs

qxprakash · 2024-07-08T21:00:31Z

vishniakov-nikolai · 2024-07-09T13:42:50Z

@qxprakash thank you. Now it works without any addition.
Below list of unresolved remarks:

Codestyle remarks

Comments should start from capital letter
Merge two last code cells or add header for the last one
Use comma for the last parameter of function call in the case when
bracket at the next line after parameter
System modules like fs and path require as node:fs and node:path
Avoid space before comma (saw in comment)
Empty line before return
(exception: when funtion contains only one line with return, there is no need to add empty line)
Prefer function instead of arrow functions in variable (not const fn = () => 'result';)
Two spaces as offset

Code remarks

preprocessImage unclear name for fn, there is no need to put property in
named variable to use it once (ex: resizedImage)
runPreprocessingOnCrop unclear function name

General

Please add similar description that Python sample has (without table of content),
adopt description like in another js notebooks

qxprakash · 2024-07-09T19:58:49Z

vishniakov-nikolai · 2024-07-10T11:54:27Z

Looks good!

Remarks:

Just Download Models
Load a Detection Model
Just Do Inference
Variable boxesWithAnnotations is let and it's value specified inside processBoundingBoxes function.
It's not a good pattern. I propose set it as const inside function and return value from processBoundingBoxes to expose it value to global scope.
Extra offset
Useless comment
"Magic number", please, name it
Semicolon is extra }; after function body, but space after parameters is needed ) {
Propose simplify this part like that:

const detInferRequest = detCompiledModel.createInferRequest();
const detResult = await detInferRequest.inferAsync([tensor]);
const boundingBoxesArray = extractBoundingBoxes(detResult[detOutputLayer]);

// Show original image
displayArrayAsImage(displayImageMat.data,
  displayImageMat.cols,
  displayImageMat.rows,
  display,
);

What's the point to do 2 of these actions by once? Propose to make only coefficient calculation here.

When I told about extract, I meant that you can keep it near, like that:

// Make slice on call:
// multiplyByRatio(ratioX, ratioY, box.slice(0, -1))
// or even better put trimmed set to named variable

// Function to adjust bounding box coordinates by a given ratio
function multiplyByRatio(ratioX, ratioY, box) {
  const scaleShape = (shape, idx) => idx % 2
    ? Math.max(shape * ratioY, 10)
    : shape * ratioX;

  return box.map(scaleShape);
}

Didn't address fully: "Use a comma for the last parameter of a function call when
the bracket is on the next line after the parameter"

Immediate return:

// Function to extract recognition results from the model output
function extractRecognitionResults(output) {
  const outputData = output.getData();
  const outputShape = output.getShape();
  const [batchSize, height, width] = outputShape;

  return setShape(outputData, [height, width]);
}

Use early return:

if (conf <= threshold) return;

// Print labels

Same here
What the point to make this function? I propose extract its functional to the body of the cell.
You don't need variable if you don't use it

qxprakash · 2024-07-11T06:18:59Z

Hi @vishniakov-nikolai thanks for your feedback , I have made the changes suggested by you

I have few things to ask -- Semicolon is extra }; after function body, but space after parameters is needed ) {
you mean , function testFunc(param1, param2 ) , space after param or space after closing backets ?
in extractBoundingBoxes I have named the variable as foldingCoefficient is that okay ?
do I also have to split resizeAndConvertCropToModelInput into two functions ?

please let me know if any other changes are needed

with regards
Prakash

qxprakash · 2024-07-18T16:23:52Z

@vishniakov-nikolai I have fixed the remaning code formatting issues , went through the entire sample again to verify the offset , let me know if any other changes are required.

vishniakov-nikolai

LGTM

vishniakov-nikolai · 2024-07-25T22:38:10Z

build_jenkins

added ocr sample and updated readme

44812c9

qxprakash requested review from a team as code owners June 24, 2024 19:22

qxprakash requested review from zKulesza and removed request for a team June 24, 2024 19:22

github-actions bot added category: samples OpenVINO Runtime Samples category: docs OpenVINO documentation labels Jun 24, 2024

sys-openvino-ci added the ExternalPR External contributor label Jun 24, 2024

vishniakov-nikolai added the gsoc Google Summer of Code related discussion label Jun 24, 2024

vishniakov-nikolai self-assigned this Jun 24, 2024

path fixes

e0e8327

vishniakov-nikolai requested changes Jul 3, 2024

View reviewed changes

qxprakash added 7 commits July 5, 2024 18:05

fixes

24dec7a

download detection model , add utility function scaleShape , remove s…

e6440e9

…tale comments and logs

formatting fixes

788a8b5

name fixes

a0e3362

replaced infer with inferAsync

32b0e07

formatting fixes

ec1ca11

formatting fixes

d4b5202

remove extra line

57a6f6a

qxprakash added 2 commits July 10, 2024 01:11

Codestyle remarks fixes

8fe8d4c

Code remarks fixes

5dfb7eb

suggested fixes

d0a9b06

qxprakash requested a review from vishniakov-nikolai July 11, 2024 06:20

formatting fixes

e70b89e

qxprakash changed the title ~~[OV JS] Add optical-character-recognition sample~~ [OV JS] Add optical-character-recognition sample notebook Jul 19, 2024

remove the first cell error

00c750a

vishniakov-nikolai approved these changes Jul 22, 2024

View reviewed changes

qxprakash and others added 2 commits July 25, 2024 00:37

imports formatting

15a5d43

Merge branch 'master' into ocr-sample

b5e3411

vishniakov-nikolai enabled auto-merge July 25, 2024 18:53

vishniakov-nikolai added this pull request to the merge queue Jul 26, 2024

Merged via the queue into openvinotoolkit:master with commit 4a5bd43 Jul 26, 2024
123 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OV JS] Add optical-character-recognition sample notebook #25191

[OV JS] Add optical-character-recognition sample notebook #25191

qxprakash commented Jun 24, 2024 •

edited

Loading

vishniakov-nikolai left a comment

vishniakov-nikolai commented Jul 4, 2024

qxprakash commented Jul 8, 2024

vishniakov-nikolai commented Jul 9, 2024

qxprakash commented Jul 9, 2024

vishniakov-nikolai commented Jul 10, 2024

qxprakash commented Jul 11, 2024

qxprakash commented Jul 18, 2024

vishniakov-nikolai left a comment

vishniakov-nikolai commented Jul 25, 2024

[OV JS] Add optical-character-recognition sample notebook #25191

[OV JS] Add optical-character-recognition sample notebook #25191

Conversation

qxprakash commented Jun 24, 2024 • edited Loading

Details:

Workarounds which still needs to be worked upon

vishniakov-nikolai left a comment

Choose a reason for hiding this comment

Codestyle remarks

Common

Targeted remarks

vishniakov-nikolai commented Jul 4, 2024

qxprakash commented Jul 8, 2024

Codestyle remarks

vishniakov-nikolai commented Jul 9, 2024

Codestyle remarks

Code remarks

General

qxprakash commented Jul 9, 2024

Codestyle Remarks

Code remarks

General

vishniakov-nikolai commented Jul 10, 2024

qxprakash commented Jul 11, 2024

qxprakash commented Jul 18, 2024

vishniakov-nikolai left a comment

Choose a reason for hiding this comment

vishniakov-nikolai commented Jul 25, 2024

qxprakash commented Jun 24, 2024 •

edited

Loading