AI Agent Factory: Intelligent Document Processing Platform (Internship Project Showcase)

This repository serves as a detailed showcase of the comprehensive AI Agent and Retrieval-Augmented Generation (RAG) platform I co-developed during my summer 2025 internship at Abysalto. The platform is designed to create, configure, and deploy specialized AI agents capable of intelligent document processing and complex reasoning.

Note: The source code for this project is proprietary to Abysalto and cannot be shared publicly. This repository serves as a detailed portfolio piece to document the project's architecture, my technical contributions, and the results achieved.

---

My Role & Key Contributions

Official team structure from our final presentation, highlighting my role on the RAG Studio team.

As an AI Academy Intern, I was deeply involved in the entire development lifecycle. I played an integral role in architectural design and hands-on implementation, focusing on building a system that was not just functional but also scalable, reliable, and extensible.

System Architecture: I took a leading role in key architectural decisions, championing the use of the Strategy design pattern to create a modular system. This allowed for pluggable components for RAG methods, document chunking, and evaluation, which accelerated our experimentation cycles by 3x.
RAG System Implementation: I designed and implemented an extensible Retrieval-Augmented Generation (RAG) system, developing several of the nine specialized retrieval methods. This core system achieved 95%+ retrieval precision in our internal testing benchmarks.
API & Microservices: I developed robust microservices using Python and FastAPI, creating over 50 RESTful endpoints for comprehensive system control, from document ingestion to agent interaction.
Database Optimization: I optimized PostgreSQL queries and designed dynamic table generation logic that automatically adapted to different embedding model dimensions, ensuring efficient vector operations with pgvector.

Technical Deep Dive: Architecture & Design

The platform was built with a focus on modern software architecture and cutting-edge AI techniques.

1. High-Level System Architecture

The system follows a standard three-tier architecture, with a clear separation between the frontend, a robust backend handling all business logic, and the database. The backend also acts as an orchestration layer for external services like the OpenAI API.

2. The Strategy Design Pattern

A core architectural decision was the extensive use of the Strategy pattern (Oblikovni obrazac Strategija). This diagram shows how the main Service (Servis) uses a common Interface (Sučelje) to interact with interchangeable strategies (Strategija1, 2, 3). This decoupled the core logic from specific implementations of our RAG and chunking methods, making the system incredibly flexible.

Diagram illustrating how the Strategy pattern was applied to the core components of the RAG system.

3. Advanced Retrieval-Augmented Generation (RAG) System

The core of the project was its sophisticated and highly modular RAG system. The diagram below, from our final presentation, illustrates the complete data lifecycle from document ingestion to agent response.

This diagram, titled 'Our Glossary' (Naš rječnik), shows the full workflow. A File is processed via a selected Chunking method (Chunking metoda) and stored in the database (Shema baze podataka), which is created using an Embedding model. On retrieval, a query is handled by a specific RAG Method (Rag metoda), with its performance measured by an Evaluation (Evaluacija) process. The entire system is orchestrated by an Agent which can also leverage external tools (Alat).

This architecture allowed for a high degree of control and optimization at each step:

Multi-Vector Embeddings: We stored separate embeddings for document content, generated summaries, and potential questions. This approach, visible in the Shema baze podataka (Database Schema) component, enabled more nuanced and accurate semantic search.
Specialized Retrieval Methods: The system featured nine distinct retrieval strategies (RAG metode), allowing users to select the optimal method for their specific use case. These included strategies like Multi-Query RAG (using LLMs for query expansion), Hybrid Meta RAG (combining semantic search with keyword filtering), and Graph-Walk RAG (exploring relationships between document chunks).

Project in Action: User Interface

While my focus was on the backend, our team developed a comprehensive UI for configuration and testing. Below is a screenshot of the Evaluation dashboard, which allowed us to benchmark the performance and relevance of different RAG method configurations.

The test results dashboard, showing performance metrics for different RAG methods.

The "Agent Playground" user interface, which was powered by the backend services I developed. It demonstrates the core chat functionality with a test agent in Croatian, where the user asks "Kako si danas?" ("How are you today?").

---

Tech Stack

Backend: Python, FastAPI, SQLAlchemy
AI/ML: LangChain, LangGraph, OpenAI API
Database: PostgreSQL with pgvector extension
Frontend: React, Bootstrap
Infrastructure: Docker

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
assets		assets
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Agent Factory: Intelligent Document Processing Platform (Internship Project Showcase)

My Role & Key Contributions

Technical Deep Dive: Architecture & Design

1. High-Level System Architecture

2. The Strategy Design Pattern

3. Advanced Retrieval-Augmented Generation (RAG) System

Project in Action: User Interface

Tech Stack

About

Uh oh!

Releases

Packages

rokocuba/AI-Agent-RAG-Platform

Folders and files

Latest commit

History

Repository files navigation

AI Agent Factory: Intelligent Document Processing Platform (Internship Project Showcase)

My Role & Key Contributions

Technical Deep Dive: Architecture & Design

1. High-Level System Architecture

2. The Strategy Design Pattern

3. Advanced Retrieval-Augmented Generation (RAG) System

Project in Action: User Interface

Tech Stack

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages