Skip to content

Latest commit

 

History

History
50 lines (28 loc) · 3.34 KB

README.md

File metadata and controls

50 lines (28 loc) · 3.34 KB

Form Recognizer Solution Accelerator

Accelerate your Form Recognizer solution to production with this Solution Accelerator, which leverages an Azure Function and a set of Logic Apps to split multi-page PDF files to single-page PDF files and sends individual PDF files to the REST API endpoint of a trained custom document model in Form Recognizer.

Architecture

This solution implements two capabilities that are commonly required when working with a trained custom document model:

  1. Splitting multi-page PDF documents into individual, single-page PDF documents
  2. Analyzing the results of documents sent to the Form Recognizer REST API endpoint of a trained custom document model

Please reference this blog post for detailed, step-by-step instructions for how to implement this solution. We are also actively working on organizing the same step-by-step instructions in this repository.


Using the below button, six Azure services will be deployed:

  • Storage account
  • Function app
  • App Service plan
  • Form Recognizer
  • Logic app (x2)

Deploy to Azure

Download sample data from this repository and upload it into the new containers you create.

Open the Form Recognizer Studio and train a custom document model.

Deploy open-source Python code to your Function App to split multi-page PDF files.

Create a Logic App to call your Azure Function App and save individual PDF files based on a multi-page PDF file input.

Leverage the REST API endpoint of a trained custom document model in Form Recognizer.

Upload a multi-page PDF file and verify that the first Logic App produces single-page PDF files. Then, verify that the second Logic App sends each file to the custom model endpoint in Form Recognizer and saves the resulting JSON.