This is a Next.js application that provides a user interface for interacting with OpenAI's audio APIs, including text-to-speech and speech-to-text.
- Text-to-speech: Convert text into audio using OpenAI's text-to-speech API.
- Speech-to-text (WIP): Transcribe audio into text using OpenAI's speech-to-text API.
- History: Keep track of your previous conversions and easily access them again.
- Audio playback: Play back the generated audio directly in the browser.
- Download: Download the generated audio files for offline use.
- Clone the repository:
git clone https://github.com/KelvinCampelo/open-aiudio-client
- Install dependencies:
npm install
- Start the development server:
npm run dev
- Open your browser and navigate to http://localhost:3000 to see the application in action.
- Fill in your OpenAI API key: Enter your API key in the designated field. You can obtain an API key from the OpenAI website.
- Select an API: Choose either "Text-to-speech" or "Speech-to-text" (WIP) from the navigation menu.
- Enter your input:
- For Text-to-speech, provide the text you want to convert.
- For Speech-to-text, upload an audio file. (WIP)
- Configure settings: Customize the voice, speed, and other settings for your conversion.
- Generate audio: Click the "Generate" button to start the conversion process.
- Play or download: Listen to the generated audio or download it for offline use.
Contributions are welcome! Here's how you can contribute:
- Report issues: If you encounter any bugs or have suggestions for improvements, please open an issue on GitHub.
- Submit pull requests: If you have code changes or new features to add, please submit a pull request.
This project is licensed under the MIT License. See the LICENSE file for more information.
- OpenAI for providing the powerful audio APIs.
- Next.js for providing a robust framework for building web applications.
- Tailwind CSS for providing a utility-first CSS framework.
For any questions or feedback, please contact kelvsousa@gmail.com