Skip to content

scottsus/deep-clone

Repository files navigation

🎙️ Deep Clone

I cloned my voice, gave it a knowledge base, and have it answer intro chats on my behalf.

deep-clone-04-20-25

Why?

Will someone use my voice to trick my mom? Hopefully not. Just wanna play around with multimodal AI, adding spice to audio by having it engage in conversation, referencing some data source, and having it talk to my friends.

Quick Start

  1. Clone and install dependencies
yarn install
  1. Add environment variables - see .env.example in each package.

  2. Visit http://localhost:3000 and watch the magic unfold

Architecture

apps/web

Standard Next.js web app. Frontend creates a Daily.co room, makes some Postgres writes, and faciliates exchange of audio between client and server.

apps/fastapi

Dockerized FastAPI app. Makes calls to various AI providers like OpenAI for GPT 4o, ElevenLabs for TTS, Deepgram for realtime audio transcription.

packages/database

Use Postgres - for obvious reasons, and Prisma ORM because it offers both node and python clients, so the same schema can be reused across multiple packages.

Email notification shenanigans

About

Conversational voice cloning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •