AI Study

A study tool that converts PDFs into interactive flashcards using RAG and Google Gemini.

Overview

AI Study leverages Google Gemini and Retrieval-Augmented Generation (RAG) to transform PDF documents into personalized study materials. By ingesting content and retrieving relevant context, it generates high-quality multiple-choice questions for effective self-testing.

Key Capabilities:

RAG-Powered Generation: Chunks and embeds PDFs to ground AI generation in your specific source material.
Smart Flashcards: Automatically identifies key concepts and creates questions with explanations.
Study Mode: Interactive testing with instant feedback and progress tracking.

Note: Currently supports text-based PDFs. Image-based PDFs require OCR integration (future enhancement).

🏗 System Architecture & RAG Data Flow

The system uses a multi-agent architecture powered by Google Vertex AI.

Architecture Diagrams

Diagram Type	Visualization
System Architecture
Agent Details
Data Flow
Workflow Sequence
RAG Sequence

RAG Implementation Details

The core of AI Study is its RAG pipeline, ensuring accuracy and relevance:

Ingestion: Uploaded PDFs are processed by the PDFs Module, split into semantic chunks.
Embedding: Chunks are embedded using Vertex AI text-embedding-004 and stored in PostgreSQL with pgvector (via Prisma).
Retrieval: When generating questions or answering chat queries, the system performs a cosine similarity search to find the most relevant chunks.
Generation: Retrieved chunks are injected into the prompt context for Gemini 2.5 Flash, enabling the model to answer based on the document's specific content.

Multi-Agent Workflow

Content Analyzer: Scans the PDF (or retrieved chunks) to identify learning objectives.
Question Generator: Creates questions targeting those objectives, referencing specific content.
Quality Analyzer: Validates the educational value and accuracy of generated cards.

Background Job Processing

AI Study uses BullMQ with Redis for reliable background processing:

PDF Ingestion: Asynchronous processing of uploaded PDFs (text extraction, chunking, embedding)
Flashcard Generation: Background generation of questions with progress tracking
Job Monitoring: Track job status via /queue/:queueName/jobs/:jobId endpoint
Automatic Retries: Failed jobs retry with exponential backoff (3 attempts)
WebSocket Updates: Real-time progress notifications to frontend

Production: Uses Upstash Redis (serverless) for zero-maintenance job queues.
Local Dev: Connects to local Redis instance for testing.

Tech Stack

Backend: NestJS, Prisma, PostgreSQL (with pgvector), Google Vertex AI, BullMQ + Redis.
Frontend: Next.js, TypeScript, Radix UI.
Infrastructure: Google Cloud Run, Cloud Storage, Cloud Build, Upstash Redis.

🚀 Quick Start

Prerequisites

Node.js 18+ & npm
PostgreSQL (local or cloud)
Redis (local or Upstash for production)
Google Cloud Project (Vertex AI, Storage, Cloud Build enabled)

Installation

Clone & Install:

git clone <repository-url>
cd ai-study
npm install

Environment Setup: Copy .env.example to .env and packages/api/.env, then populate with your Google Cloud credentials and DB URL.
Database Init:
```
cd packages/api
npx prisma migrate dev
```
Redis Setup (Local Development):
```
# Install Redis
brew install redis

# Start Redis
brew services start redis
```
For production, configure Upstash Redis credentials in GCP Secret Manager (see deployment section).
Run:
```
./run.sh
```
Starts Backend (:3000) and Frontend (:3001) with auto-generated SDKs.

📚 Documentation & Support

API Swagger: http://localhost:3000/api
Database Studio: npm run start:studio
Job Queue Monitoring: Check job status at GET /queue/:queueName/jobs/:jobId
Implementation Details: See PHASE_5_IMPLEMENTATION.md for background processing architecture

🚢 Deployment

Automated CI/CD via GitHub Actions with Google Cloud integration:

Workload Identity Federation for secure authentication
Google Cloud Secrets for environment variables and credentials
Docker multi-platform builds pushed to Artifact Registry
Cloud Run for serverless API and frontend hosting
Automatic database migrations with Prisma on deploy

Push to main branch to trigger deployment.

📝 License

MIT License - see LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 266 Commits
.github/workflows		.github/workflows
docs		docs
packages		packages
.DS_Store		.DS_Store
.dockerignore		.dockerignore
.env.example		.env.example
.eslintrc.js		.eslintrc.js
.gitattributes		.gitattributes
.gitignore		.gitignore
.prettierrc.json		.prettierrc.json
LICENSE		LICENSE
README.md		README.md
configure-gcs-cors.sh		configure-gcs-cors.sh
cors.json		cors.json
create-secrets.sh		create-secrets.sh
disable-cors.json		disable-cors.json
gcs-cors.json		gcs-cors.json
jest.config.js		jest.config.js
package-lock.json		package-lock.json
package.json		package.json
permissive-cors.json		permissive-cors.json
run.sh		run.sh
setup-redis-secrets.sh		setup-redis-secrets.sh
test.pdf		test.pdf
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Study

Overview

🏗 System Architecture & RAG Data Flow

RAG Implementation Details

Multi-Agent Workflow

Background Job Processing

Tech Stack

🚀 Quick Start

Prerequisites

Installation

📚 Documentation & Support

🚢 Deployment

📝 License

About

Uh oh!

Releases 1

Packages

Languages

License

bperin/ai-study

Folders and files

Latest commit

History

Repository files navigation

AI Study

Overview

🏗 System Architecture & RAG Data Flow

RAG Implementation Details

Multi-Agent Workflow

Background Job Processing

Tech Stack

🚀 Quick Start

Prerequisites

Installation

📚 Documentation & Support

🚢 Deployment

📝 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages