Skip to content

zen-ham/Search-OCR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Digests all your image files into a pickle cluster with a fast searchengine GUI:

 day 5 release!


Technical details:

The software is able to read a wide range of file types, many more then PIL supports for example. File types such as SVG and EPS are supported, along with almost every single other image format.

For quick indexing I implemented a recursive subdirectory file list function in my library that uses multithreading, multiprocessing (ray), and cache based optimizations, in order to index millions of files in ~2.0 seconds, to keep the boot time of the software small.

For optimizations in the search engine and in the actual processing pipeline of the files, the ray wrapper in my library was used extensively, this brought search times and rendering from 20+ seconds to ~0.5 seconds.

About

Local OCR indexer and search engine, digests all image files on the system into a pickle cluster and allows key word or sentence searching in real time across 100K+ files.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages