This is a monorepo project that includes both the frontend and backend for the AI Shorts Video Generator. The application allows users to generate short videos using AI, with features such as video creation, preview, and export. The frontend is built with Next.js, while the backend is built with ASP.NET Core and integrates with various AI services.
- Features
- Requirements
- Installation
- Environment Variables
- Development
- Docker
- API Endpoints
- Dependencies
- Usage
- Video Creation: Users can create short videos by selecting topics, styles, and durations.
- Video Preview: After generating a video, users can preview it with generated captions and images.
- Export Video: Users can export the generated video once the rendering is completed.
- Theming: Supports light and dark themes using
next-themes
. - Responsive Design: Built with Tailwind CSS for responsive and modern UI.
- Content Generation: Create video content based on user input using Google's Gemini API.
- Audio Synthesis: Convert text input to speech using Google Cloud Text-to-Speech API.
- Caption Generation: Generate captions for audio or video files using AssemblyAI.
- Image Generation: Generate images from text prompts using Cloudflare's AI API.
- Video Storage: Save and retrieve video data, including associated content and captions.
- Node.js (v22.x)
- .NET 9 SDK
- PostgreSQL for the database. I used Neon Serverless Postgres
- Cloudinary account and API key
- Cloudflare Workers AI account and API key. I used flux-1-schnell Model
- AssemblyAI API key
-
Clone the repository to your local machine:
git clone https://github.com/yourusername/AiShortsVideoGenerator.git cd AiShortsVideoGenerator
-
Install frontend dependencies:
cd frontend npm install
-
Install backend dependencies:
cd ../backend dotnet restore
Configure the following environment variables for both frontend and backend:
NEXT_PUBLIC_API_URL=http://localhost:5211
REMOTION_AWS_SERVE_URL=<Your AWS Serve URL for Remotion>
Check the Remotion setup lambda guide to get the REMOTION_AWS_SERVE_URL
variable.
{
"Logging": {
"LogLevel": {
"Default": "Information",
"Microsoft.AspNetCore": "Warning"
}
},
"GoogleApi": {
"GeminiKey": "your-gemini-api-key",
"TextToSpeechKey": "your-text-to-speech-api-key"
},
"AssemblyAi": {
"ApiKey": "your-assemblyai-api-key"
},
"CloudinaryUrl": "your-cloudinary-url",
"Cloudflare": {
"ApiKey": "your-cloudflare-api-key",
"AccountId": "your-cloudflare-account-id"
},
"ConnectionStrings": {
"DefaultConnection": "your-postgresql-connection-string"
}
}
To run the frontend application in development mode:
cd frontend
npm run dev
To run the backend application:
cd backend
dotnet ef database update
dotnet run
To run the application in Docker:
Configure the following environment variables:
(.env.local
)
NEXT_PUBLIC_API_URL=http://localhost:8080
Then run:
docker compose up
- GET /videos: Retrieve a list of all videos stored in the database.
- POST /generate-content: Generate video content from user input using Google's Gemini API.
- POST /generate-audio: Convert text input to audio (MP3) using Google Cloud Text-to-Speech API.
- POST /generate-captions: Generate captions for an audio or video file using AssemblyAI.
- POST /generate-image: Generate an image from a text prompt using Cloudflare's AI API.
- POST /save-video: Save a video record to the database.
- PUT /videos/{id}: Update video Output File and Render Id.
- DELETE /videos/{id}: Delete video from the database.
- Next.js: React framework for building server-side rendered applications.
- TailwindCSS: Utility-first CSS framework for styling the app.
- Remotion: Library for creating videos programmatically with React.
- Axios: HTTP client for making requests to the API.
- shadcn: A collection of re-usable components that you can copy and paste into your apps.
- Google.Cloud.TextToSpeech.V1: Google Cloud Text-to-Speech client.
- AssemblyAI: Client library for AssemblyAI services.
- CloudinaryDotNet: Cloudinary SDK for uploading media.
- Npgsql.EntityFrameworkCore.PostgreSQL: PostgreSQL support for Entity Framework Core.
- Dashboard: Users can view and manage generated short videos.
- Create New Video: Users can define the video topic, style, and duration, and the app will generate the video content, audio, captions, and images using AI.
- Video Preview & Export: After the video is generated, users can preview it in the dialog and export it once rendered.