Computer Science student focused on Data Automation, Audit Analytics, and Data Quality.
I build data automation workflows that turn messy PDFs, spreadsheets, CSVs, and internal databases into auditable validations, exception reports, and traceable business controls.
| Focus | Working With |
|---|---|
| Data automation | Python, pandas, openpyxl, PDF and spreadsheet extraction |
| Data validation | SQL, PostgreSQL, SQLite, data quality checks, business rules |
| Engineering workflow | Git, GitHub, documentation, reproducible scripts |
| Learning next | Docker, dbt, FastAPI, REST APIs, AWS basics |
- Continuous audit routines for benefits, payroll-related checks, and internal controls.
- Python and SQL workflows for extracting, cleaning, matching, and validating operational data.
- Exception reports that help business teams review inconsistencies with traceability.
- A path from audit analytics into analytics engineering, data quality, and data engineering.
Currently working as an Audit & Risk / Compliance intern at a hospitality company, focused on continuous auditing, benefits validation, and process automation.
| Now | Next |
|---|---|
| Advanced SQL | dbt |
| PostgreSQL hands-on | REST APIs / FastAPI |
| Professional Git workflow | AWS basics |
| Docker basics | Public synthetic data project |
continuous-audit-toolkit is the public project I plan to build from fully synthetic data.
It will simulate the kind of operational data mess that shows up in real companies:
- inconsistent identifiers, names, dates, and value formats;
- changing vendor file layouts across months;
- duplicated records, missing eligible people, total rows, and false positives;
- standardized outputs for validation, review status, and audit traceability.
Audit Analytics -> Analytics Engineering -> Data Operations -> Data Engineering
I want to work on technical data systems that make business processes more reliable: pipelines, validation layers, data quality checks, automation tools, and traceable reporting workflows.
- Recife, Brazil
- B.Sc. Computer Science at CESAR School, expected December 2027
- Portuguese native, English fluent, German A2.2
- LinkedIn: linkedin.com/in/yuri--cavalcanti
