An experimental demonstration of using MCP Servers for Genmedia in Gemini CLI with planned iterative ideation and media generation.
It uses Imagen 4 and Veo 3 models through MCP Servers for Genmedia.
It also edits images using gemini-2.5-flash-image-preview
model (aka nano-🍌) for character and scene consistency.
Demo video: TODO
-
Install gcloud CLI and Gemini CLI.
If you didn't use Google Cloud before, get a free trial.
If
gcloud auth print-identity-token -q
CLI command result is empty, rungcloud auth login --update-adc
. -
Enable Vertex AI APIs in your project.
NOTE: From here, you can ask Gemini CLI to perform steps 2 to 5 for you.
Simply run gemini
, choose Vertex AI authentication option, and ask it to perform steps 2 to 5 from the instructions below. Sometimes, it may need your help 🙂
-
Clone Vertex AI Creative Studio repository inside this repo's clone:
git clone https://github.com/GoogleCloudPlatform/vertex-ai-creative-studio
-
Install MCP Servers for Genmedia.
-
Create a Cloud Storage Bucket, and give
Vertex AI Service Agent
service account read and write access to the bucket.Vertex AI Service Agent is
service-PROJECT_NUMBER@gcp-sa-aiplatform.iam.gserviceaccount.com
account PROJECT_NUMBER is your project's number. You can give it access onPermissions
tab of your bucket's page. Project number can be found here. -
Clone this repository (finally!):
git clone https://github.com/vladkol/gemini-cli-media-generation cd gemini-cli-media-generation
-
Create
.env
file with your Google Cloud Project Id, location/region, and the bucket name. Use.env.template
as a starter. -
Configure MCP Servers in Gemini CLI. Use .gemini/settings.sample.json as a starter (we only need Imagen, Veo and AVTool servers).
-
Delete or rename the following sub-directories:
generated_images
generated_videos
prompts
They are here as an example.
-
Run a new session of Gemini CLI -
gemini
-
Ask it to create a video based on your idea!
-
Sign up for an Accelerate AI with Cloud Run developer workshop near you! 🚀
This project is a demonstration of vibe coding intended to provide an example with verification through human feedback. The goal is to show an idea for further development. It is not intended for use in a production environment.
All media assets and code in this repository are AI generated.