Talking AI

Talking AI is a simple Node.js application that allows you to upload an MP3 file, convert the speech to text using OpenAI's Whisper API, generate an intelligent answer using OpenAI GPT, and finally convert the generated answer back into speech for playback. This app is designed with a basic front-end and demonstrates a clear chain of AI-based interactions,starting from voice, moving through natural language understanding, and returning to voice.

Purpose

The aim of this project is to demonstrate how voice-driven AI pipelines can be built using modern tools like Whisper and GPT. It helps developers explore how human-like interaction can be achieved with audio input and output. This can be extended for use cases such as voice assistants, learning tools, customer support bots, and more,especially for Turkish or multilingual audiences.

Features

Upload MP3 audio files
Transcribe speech to Turkish text using OpenAI Whisper
Generate a response based on the transcribed text using OpenAI GPT
Convert the response back to voice using OpenAI Text-to-Speech
Simple UI to trigger each step manually

Installation

Clone the repository:

git clone https://github.com/rastmob/talking-ai.git
cd talking-ai

Install dependencies:
```
npm install
```
Create a .env file in the root directory and add your OpenAI API key:
```
OPENAI_API_KEY=your-api-key-here
```
Start the server:
```
npm start
```
Open your browser and go to http://localhost:3000

Tech Stack

Node.js + Express
OpenAI Whisper API (Speech to Text)
OpenAI GPT API (Text Generation)
OpenAI TTS API (Text to Speech)
HTML, CSS, JavaScript (Client Side)

Project Structure

talking-ai/
├── public/              # Frontend HTML & JS
├── responses/           # Generated audio files
├── uploads/             # Uploaded MP3s
├── server.js            # Express backend
├── .env                 # Environment variables (not committed)
├── package.json

Developed by

This project is developed and maintained by Rast Mobile, an innovative software company that specializes in mobile development, AI integrations, and custom web platforms.

Contact & Profiles:

Website: https://rastmobile.com
LinkedIn: linkedin.com/company/rastmobile
GitHub: @rastmob
Author: @mobilerast, mehmet.alp@rastmobile.com

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
public		public
.gitignore		.gitignore
README.md		README.md
env.example		env.example
package.json		package.json
server.js		server.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Talking AI

Purpose

Features

Installation

Tech Stack

Project Structure

Developed by

License

About

Uh oh!

Releases

Packages

Languages

rastmob/talking-ai

Folders and files

Latest commit

History

Repository files navigation

Talking AI

Purpose

Features

Installation

Tech Stack

Project Structure

Developed by

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages