This solution is intended to be used as a reference architecture and it is not a production ready application.
This version allows you to automate the redaction of PDF documents using Google Cloud using serverless technology such as CloudRun, Workflows, Data Loss Prevention (DLP), and more.
It supports multi-page PDFs, it processes each page individually for optimal performance, assembles back the PDF keeping search and highlighting capabilities, and writes the DLP findings into a BigQuery table.