Skip to content

Commit

Permalink
More readme updates
Browse files Browse the repository at this point in the history
  • Loading branch information
rafelafrance committed Mar 10, 2024
1 parent 427a81e commit d0581e9
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 4 deletions.
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -27,7 +27,7 @@ cd /path/to/label_finder
make install
```

You will also need to install [YOLO7](https://github.com/WongKinYiu/yolov7). You may follow the instructions given there. I find that if you clone the repository, set up a virtual environment for that repository, and install the requirements into that virtual environment things work just fine.
You will also need to install [YOLO7](https://github.com/WongKinYiu/yolov7). You may follow the instructions given there. I find that if you clone the repository, set up a virtual environment for that repository, and install the requirements into that virtual environment, things work just fine.

Every time you want to run any scripts in a new terminal session you will need to activate the virtual environment, once, before running them.

Expand Down Expand Up @@ -114,9 +114,9 @@ If the sheet is named: `248106.jpg`, then a label may be named `248106_Typewritt

### Optional: Filter typewritten labels

This moves all labels that are classified as "Typewritten" into a separate directory. The OCR works best on typewritten labels or barcodes with printing. It will do a fair job when labels with handwriting or a mix of typewriting & handwriting if they are neatly printed.
This moves all labels that are classified as "Typewritten" into a separate directory. The OCR works best on typewritten labels or barcodes with printing. It will do a fair job with handwritten labels if the handwriting is neatly printed.

I have noticed that the current example YOLO model tends to have a fair number of false positives but close to zero false negatives. Manually pruning the false positives is much easier than sorting all labels. YMMV.
I have noticed that the current example YOLO model (v0.2.0) tends to have a fair number of false positives but close to zero false negatives. Manually pruning the false positives is much easier than sorting all labels. YMMV.

#### Example

Expand Down
2 changes: 1 addition & 1 deletion finder/pylib/box_calc.py
Original file line number Diff line number Diff line change
Expand Up @@ -60,7 +60,7 @@ def find_box_groups(boxes: npt.NDArray, threshold: float = 0.8) -> npt.ArrayLike
overlapping[curr] = group

found = True # Do we need to look for more overlapping boxes
start = 0 # Used to skip repeated searches
start = 0 # Skip already searched boxes

# Every time we find new matches we need to check the new ones against the rest
while found:
Expand Down

0 comments on commit d0581e9

Please sign in to comment.