🎨 ChalkAI

Transform rough sketches into professional diagrams with AI

An intelligent whiteboard application that uses Google Gemini to refine hand-drawn sketches into clean, publication-ready diagrams.

Built by Abhishek & Jaydeep

🎥 Demo

ChalkAI-Demo.mp4

✨ Features

Feature	Description
🖌️ Interactive Whiteboard	Full-featured drawing canvas powered by tldraw with support for freehand drawing, shapes, arrows, and more
🤖 AI-Powered Refinement	Leverages Google Gemini 2.5 Flash to transform rough sketches into professional diagrams
🎤 Voice Input	Describe your diagram intent using voice commands with automatic idle detection
⚡ Quick Enhance	One-click enhancement to instantly improve any sketch
🎯 Selection-Aware	Refine specific parts of your canvas by selecting shapes before generating
⌨️ Keyboard Shortcuts	Accept suggestions with `Tab`, reject with `Esc` for rapid iteration
🎨 Glass Morphism UI	Modern, sleek interface with smooth animations powered by Framer Motion

🚀 Quick Start

Prerequisites

Node.js 18+ or Bun runtime
Google Gemini API Key (Get one here)

Installation

# Clone the repository
git clone https://github.com/yourusername/ChalkAI.git
cd ChalkAI/frontend

# Install dependencies
npm install
# or
bun install

Environment Setup

Create a .env.local file in the frontend directory:

GOOGLE_GENERATIVE_AI_API_KEY=your_api_key_here

Run Development Server

npm run dev
# or
bun run dev

Open http://localhost:3000 in your browser.

📖 How to Use

1️⃣	Draw — Sketch your diagram idea on the whiteboard
2️⃣	Describe — Type or speak what your diagram represents
3️⃣	Generate — Click the `+` button or use the ✨ Quick Enhance
4️⃣	Review — See the AI-generated diagram in the preview panel
5️⃣	Accept — Press `Tab` to replace your sketch with the refined diagram

Voice Input 🎤

Click the microphone button to start listening
Describe your diagram verbally
Stop speaking for 4.5 seconds or click the mic again
The AI automatically generates based on your voice description

🏗️ Architecture

graph TD
    A[User Draws Sketch] --> B[tldraw Canvas]
    B --> C[Export to PNG]
    C --> D["/api/complete-diagram"]
    D --> E["Google Gemini 2.5 Flash"]
    E --> F[Generated PNG]
    F --> G[Preview Panel]
    G -->|Tab| H[Accept & Replace]
    G -->|Esc| I[Reject & Keep Original]

📂 Project Structure

frontend/
├── app/
│   ├── api/
│   │   └── complete-diagram/
│   │       └── route.ts        # AI processing endpoint
│   ├── page.tsx                # Main application page
│   ├── layout.tsx              # Root layout with metadata
│   └── globals.css             # Global styles & theme
├── components/
│   ├── ui/                     # Reusable UI components
│   │   └── button.tsx          # Button component with variants
│   ├── whiteboard.tsx          # tldraw canvas wrapper
│   └── intent-input.tsx        # Text input for diagram intent
├── lib/
│   └── utils.ts                # Utility functions (cn helper)
└── types/
    └── speech-recognition.d.ts # TypeScript declarations

🛠️ Tech Stack

Category	Technology
Framework	Next.js 16 (App Router)
UI Library	React 19
Language	TypeScript 5
Styling	Tailwind CSS 4
Animations	Framer Motion
Whiteboard	tldraw v4
AI Integration	Vercel AI SDK + Google Gemini
Components	Custom + shadcn/ui patterns

🔌 API Reference

`POST /api/complete-diagram`

Generates a refined diagram from a sketch and text description.

Request Body:

{
  "prompt": "A flowchart showing user authentication process",
  "image_data": "base64_encoded_png_data"
}

Response:

{
  "image_data": "base64_encoded_generated_png"
}

Error Response:

{
  "error": "Error message",
  "details": "Additional error details"
}

⌨️ Keyboard Shortcuts

Shortcut	Action
`Tab`	Accept the AI suggestion
`Esc`	Reject the AI suggestion
tldraw shortcuts	All standard tldraw shortcuts work

🧩 Key Components

`<Whiteboard />`

The core canvas component wrapping tldraw. Handles:

Canvas mounting and export functionality
Selection-aware image export
AI suggestion acceptance (Tab key)
Drawing activity detection for voice input

`<IntentInput />`

A minimal text input for describing diagram intent with:

120 character limit
Clean, borderless design
Submit on Enter

📦 Scripts

# Development server with hot reload
npm run dev

# Production build
npm run build

# Start production server
npm run start

# Run ESLint
npm run lint

🔧 Troubleshooting

❌ API Key Issues

Ensure .env.local is in the frontend directory
Restart the dev server after modifying environment variables
Verify your API key is valid at Google AI Studio

❌ LightningCSS Binary Error (Windows)

If you see an error about lightningcss.win32-x64-msvc.node:

npm install lightningcss-win32-x64-msvc
Copy-Item "node_modules\lightningcss-win32-x64-msvc\lightningcss.win32-x64-msvc.node" "node_modules\lightningcss\lightningcss.win32-x64-msvc.node"
Remove-Item -Recurse -Force .next
npm run dev

❌ No Image Generated

Make sure you've drawn something on the canvas
Check that your Gemini API key has access to image generation models
Look at browser console and terminal for detailed error messages

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

📄 License

MIT License — feel free to use this project for personal or commercial purposes.

Made with ❤️ using Next.js, tldraw, and Google Gemini

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
frontend		frontend
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎨 ChalkAI

🎥 Demo

✨ Features

🚀 Quick Start

Prerequisites

Installation

Environment Setup

Run Development Server

📖 How to Use

Voice Input 🎤

🏗️ Architecture

📂 Project Structure

🛠️ Tech Stack

🔌 API Reference

`POST /api/complete-diagram`

⌨️ Keyboard Shortcuts

🧩 Key Components

`<Whiteboard />`

`<IntentInput />`

📦 Scripts

🔧 Troubleshooting

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

jaydxxp/ChalkAI

Folders and files

Latest commit

History

Repository files navigation

🎨 ChalkAI

🎥 Demo

✨ Features

🚀 Quick Start

Prerequisites

Installation

Environment Setup

Run Development Server

📖 How to Use

Voice Input 🎤

🏗️ Architecture

📂 Project Structure

🛠️ Tech Stack

🔌 API Reference

POST /api/complete-diagram

⌨️ Keyboard Shortcuts

🧩 Key Components

<Whiteboard />

<IntentInput />

📦 Scripts

🔧 Troubleshooting

🤝 Contributing

📄 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

`POST /api/complete-diagram`

`<Whiteboard />`

`<IntentInput />`

Packages