OCROperator

This tool provides functionality to automatically read pdf's that were previously created without text recognition, extract the text and pass this data to the actions that will process the data that.

Tech Stack

Framework:

Server:

Demo

Run Locally to Develop

Clone the project

  git clone https://github.com/generalsle1n/OCROperator

Go to the project directory

  cd OCROperator\OCROperator

Install dependencies

  dotnet restore

Start the server

  dotnet run

Installation for Production Usage

Publish the project with the following settings

Config: Release
TargetFramework: .Net 6
Deployment: SelfContained
Singlefile: false
ReadyToRun: false
Remove not used Code: false

  sc.exe create "OCROperator" binpath="C:\Path\To\OCROperator.exe"

Roadmap

Add an Microsoft Busines Central Action
Add more watchers (Database, Mailbox, eg.)

Lessons Learned

Used mostly Interfaces to allowes multiple classes that can define actions over an user controlled config file

Feedback

If you have any feedback, please open an issue or an pull request 😀

Features

This tool searchs asynchron for pdf and metadata that are specifid in an watcher config. If the tool finds some the tools process the pdf with the ocr enginge tesseract to get the text and then the file with the text is transferd to an action which can do any stuff with the pdf (Upload to an ticketsystem, erp, crm or so on) Currently it only works in connection with papercut mf, because papercut generate an metadata json which is processed

Watchers

Currently there is only one Watcher implemented:

FileSystem

This watchers looks in an folder for the pdf

Actions

FileToZammad

This action trys to extract an ticketnumber and then upload the pdf to this ticket. If no ticket is found then it create an empty ticket

The setting is splitted by an ;

Zammad URL
API Token
UserID (In the example the 1)

So the string must look like: https://zammadserver.com;SECRET;1

FileToUser

This actions send the extracted text to the user mail that is specifid in the metadata json from papercut. The mail settings are specifid via the mailfactory solution

FAQ

Can i add by my own an custom action?

Yeah sure --> If you think its good and create an pull request to merge it

Will there be an implementation without papercut?

Currently its not planned, but if you want to implement it just open an pull request too

Configuration

To run this project, you will need to add the following config variables to your appsettings.json file. All important settings are in "Watchers" There is an example config in the repo

Connectors

Destination: The Path where the watcher should look to get the pdf string

SuffixMetadata: The suffix pattern to look for the metadata string

ActionType: The binary Type for the action, possible values: string

OCROperator.Models.Interface.Action.FileToFixedEmail
OCROperator.Models.Interface.Action.FileToUserEmail
OCROperator.Models.Interface.Action.FileToZammad

ActionSettings: Enter the custom settings for the action int

Type: The binary Type for the watcher , possible values: string

OCROperator.Models.Interface.FileSystem

Language: Enter the OCR Langaue, possible values: string

deu (German)
eng (English)
spa (spanish)

HoldPDF: Decide if the pdf after the process is fisnihed should be hold and not deleted bool

MailFactory

SMTPServer: Enter the name of the smtp server which should be used

Port: Enter the port for the smtp server

GenerateFrom: The from mail, which should be used

Authors

@Niels Schuler

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
OCROperator		OCROperator
blob		blob
.gitattributes		.gitattributes
.gitignore		.gitignore
OCROperator.sln		OCROperator.sln
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCROperator

Tech Stack

Demo

Run Locally to Develop

Installation for Production Usage

Roadmap

Lessons Learned

Feedback

Features

Watchers

FileSystem

Actions

FileToZammad

FileToUser

FAQ

Can i add by my own an custom action?

Will there be an implementation without papercut?

Configuration

Connectors

MailFactory

Authors

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

generalsle1n/OCROperator

Folders and files

Latest commit

History

Repository files navigation

OCROperator

Tech Stack

Demo

Run Locally to Develop

Installation for Production Usage

Roadmap

Lessons Learned

Feedback

Features

Watchers

FileSystem

Actions

FileToZammad

FileToUser

FAQ

Can i add by my own an custom action?

Will there be an implementation without papercut?

Configuration

Connectors

MailFactory

Authors

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages