Pixly - Your AI Gaming Assistant 🎮

Pixly is a desktop overlay that acts as your gaming assistant, combining AI chat with automated, privacy-friendly screenshot capture and a game-specific Retrieval-Augmented Generation (RAG) knowledge base. Pixly detects what game you're playing, retrieves relevant, curated knowledge (wikis, user-supplied YouTube descriptions, and forum posts) via a local vector database, and grounds Gemini responses on those sources.

Make sure to star our repository, your support is much appreciated.

Important

🎃 Hacktoberfest 2025 Participant Please make sure to star this repo.

🤝 Contributing, Setup and Install

We welcome Hacktoberfest 2025 contributors! Whether you're adding new games to the knowledge base, improving the UI, or enhancing AI capabilities, your contributions matter.

📖 For Contributing Visit CONTRIBUTING.md
⚙️ For Setup and Installation visit INSTALL.md

🎮 What Pixly Does

🤖 Intelligent, game-focused chat using Google Gemini with a "Game Expert" system prompt
🎯 Contextual help based on your active game (process detection and/or user message)
📸 Optional screenshot-powered context for visual analysis
🔍 RAG pipeline over per-game CSV knowledge with local vector search (Chroma)
💻 Modern desktop overlay for chatting, settings, and screenshot gallery

🏗️ Architecture Overview

Pixly is organized into three main layers: UI Overlay, Backend API, and AI/RAG services, all running locally.

1) UI Overlay (`overlay.py`)

CustomTkinter-based floating overlay, always-on-top, draggable
Chat window with typing indicator and styled messages (user vs assistant)
Settings window to manage screenshot capture and set the Google API key (persisted to .env via backend)
Screenshot gallery with View and Delete actions

2) Backend API (`backend/`)

FastAPI server exposes HTTP endpoints on 127.0.0.1:8000
Responsibilities:
- Route chat requests to Gemini
- Manage screenshots (start/stop capture, list, view, delete)
- Detect current game (process + manual keywords)
- Manage per-game knowledge ingestion and vectorization
- Provide vector search and knowledge stats
- Manage API key configuration (.env persistence + live reconfigure)

Key modules:

backend/backend.py: API endpoints and routing
backend/chatbot.py: Gemini client configuration, runtime reconfigure, and chat logic (with RAG context injection)
backend/screenshot.py: Encrypted screenshot capture and storage; database operations; delete support
backend/game_detection.py: Game detection via running processes, recent screenshots, and user message keywords
backend/knowledge_manager.py: CSV ingestion and text extraction (wiki + forum; YouTube entries use user-provided description)
backend/vector_service.py: Chroma persistent client, collection management, chunking, embeddings, and semantic queries

3) AI & RAG Layer

Model: Google Gemini 2.5 Flash Lite for responses
System prompt (PROMPTS.txt) defines "Game Expert" persona and instructs grounding answers in retrieved snippets (WIKI / YOUTUBE / FORUM) with URLs
Vector DB: Chroma (persistent on disk in vector_db/)
Embeddings: sentence-transformers by default (configurable); text is chunked and embedded per content piece
Retrieval: top-k relevant chunks by cosine similarity; included as context in the prompt

📚 Knowledge Base & Data Flow

Pixly's knowledge is curated per game via CSV files that live in games_info/. The CSV schema is simple and contributor-friendly:

wiki,wiki_desc,youtube,yt_desc,forum,forum_desc

wiki: URL to a relevant wiki page; Pixly extracts textual content
wiki_desc: Contributor-provided description of the wiki URL
youtube: URL to a relevant video; Pixly does not auto-transcribe; it uses the contributor-provided description
yt_desc: Contributor-provided description of the YouTube URL
forum: URL to a relevant forum/thread; Pixly extracts textual content
forum_desc: Contributor-provided description of the forum URL

Processing pipeline per game:

Load CSV for the game (e.g., games_info/minecraft.csv)
Extract text from wiki and forum URLs; keep YouTube descriptions as-is
Clean and chunk text into manageable segments (e.g., ~512 tokens)
Generate embeddings for each chunk and persist into Chroma collections
On chat, detect game and run a semantic search to retrieve top snippets, then ground Gemini's response on those

Vector DB collections are organized by game and source type, e.g. minecraft_wiki, minecraft_youtube, minecraft_forum.

🎯 Game Detection

Pixly uses a layered strategy to infer the current game:

Process Detection: Scans running processes for known executables
Screenshot Context: Uses recent screenshot metadata (app/window) when available
Manual Override: Detects game mentions in the user's message (e.g., "I'm playing Minecraft")

The detection result is passed into the RAG layer to scope retrieval to the active game's knowledge base.

🔌 API Surface (Selected)

POST /chat: Chat with Gemini; auto-detects game; augments prompt with retrieved snippets
POST /screenshots/start?interval=30: Start periodic capture
POST /screenshots/stop: Stop capture
GET /screenshots/recent?limit=10&application=...: List recent screenshots (metadata)
GET /screenshots/{id}: Fetch a screenshot's image data (base64)
DELETE /screenshots/{id}: Delete a screenshot entry
POST /games/detect: Detect current game (optionally pass message for keyword hints)
GET /games/list: Enumerate detection-supported games, CSV-available games, and games with vectors
GET /games/{game}/knowledge/validate: Validate CSV schema
POST /games/{game}/knowledge/process: Ingest CSV and build vectors in Chroma
POST /games/{game}/knowledge/search: Vector search within a game (query, content_types, limit)
GET /games/{game}/knowledge/stats: Document counts per source type
GET /settings/api-key: Report whether the Gemini API key is configured (masked preview)
POST /settings/api-key: Persist API key to .env and live-reconfigure the chatbot

📁 Project Structure

pixly/
├── backend/             
│   ├── backend.py                # FastAPI app initialization
├── routers/                      # Contains all the API Routers
|   ├── chat.py                   # Stores chat endpoints
|   ├── game_detection.py         # Stores game detection and vector search endpoints
|   ├── screenshot.py             # Stores screenshot endpoints
|   ├── setting.py                # Stores settings endpoints
├── services/                     # Contains all the backend services.
│   ├── chatbot.py                # Gemini integration, RAG-aware chat, runtime reconfigure
│   ├── screenshot.py             # Encrypted screenshot capture, DB ops, delete support
│   ├── game_detection.py         # Process/message/screenshot-based game detection
│   ├── knowledge_manager.py      # CSV ingestion and content extraction (wiki/forum)
│   └── vector_service.py         # Chroma collections, embeddings, and search
├── schemas/                      # Contains the schemas for the various requests
|   ├── chat.py
|   ├── game_detection.py
|   ├── knowledge_search.py
|   ├── settings.py
├── overlay.py                    # CustomTkinter overlay (chat, settings, screenshot viewer)
├── games_info/                   # Per-game CSVs (e.g., minecraft.csv)
├── vector_db/                    # Chroma persistent storage
├── PROMPTS.txt                   # System persona + RAG grounding instructions
├── run.py                        # Backend server launcher
├── pyproject.toml                # Dependencies and metadata
├── screenshots.db                # Encrypted screenshot database (auto-created)
├── screenshot_key.key            # Encryption key (auto-generated)
└── README.md                     # Project documentation

🛠️ Technology Stack

UI/Frontend: CustomTkinter (modern theming and widgets for Python GUIs)
API/Backend: FastAPI (async Python web framework) + Uvicorn (ASGI server)
AI: Google Gemini 2.5 Flash Lite via google-generativeai
RAG: Chroma (persistent local vector DB) + sentence-transformers (embeddings)
Data: CSV-based per-game knowledge; SQLite for screenshots; Fernet for encryption
System: psutil + pywin32 for Windows process/window info; Pillow for imaging

Notes:

The embedding model is configurable; by default we use a sentence-transformers model suitable for local inference. The system can be switched to a different embedder (e.g., Mistral embeddings) with minor changes in vector_service.py.
The persona and grounding behavior are controlled by PROMPTS.txt so Gemini cites sources from retrieved snippets and focuses answers on gaming topics.

⚙️ How Components Work Together

The overlay sends chat requests to the backend.
The backend detects the current game and queries Chroma for relevant snippets (wiki, YouTube description, forum).
Retrieved snippets are added to the prompt to ground Gemini's response.
If a screenshot is provided, it's included as a multimodal input to Gemini for visual context.
The overlay displays a typing indicator while waiting and distinguishes user vs assistant messages for readability.

🔒 Security & Privacy

Local-first design: screenshots, vectors, and CSVs are stored on your machine
Encrypted screenshot blobs at rest using Fernet (AES)
API key managed locally via the settings UI and .env persistence
No telemetry or external data collection

📄 License

MIT License — see LICENSE.

🙏 Acknowledgments

Google Gemini for AI capabilities
CustomTkinter for modern GUI components
FastAPI for a robust backend framework
The Hacktoberfest 2025 community for open-source collaboration
All our amazing contributors who make this project possible!

Made with ❤️ for Hacktoberfest 2025

If you find this project helpful, please ⭐ star it on GitHub!

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
.github		.github
backend		backend
games_info		games_info
routers		routers
schemas		schemas
services		services
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.python-version		.python-version
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
INSTALL.md		INSTALL.md
PROMPTS.txt		PROMPTS.txt
README.md		README.md
docker-compose.yml		docker-compose.yml
overlay.py		overlay.py
pyproject.toml		pyproject.toml
run.py		run.py
setup.bat		setup.bat
test_system.py		test_system.py
test_system_BMW.py		test_system_BMW.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Pixly - Your AI Gaming Assistant 🎮

📋 Table of Contents

🤝 Contributing, Setup and Install

🎮 What Pixly Does