Nova AI - Voice-First Conversational Assistant

A sophisticated, cross-platform conversational AI assistant built with Flutter, leveraging Google's Gemini and Vertex AI APIs for real-time chat and generative image creation.

About The Project

Nova AI is a voice-first mobile assistant designed to provide a seamless and intuitive user experience. The application features a dual-AI backend, intelligently routing user prompts to the appropriate service. For conversational queries, it utilizes the powerful Google Gemini API to deliver fluid, context-aware responses, which are then vocalized using a native text-to-speech engine.

For creative requests, the app identifies the user's intent to generate art and calls the Google Vertex AI (Imagen) API to create stunning images from text prompts. The architecture is built to handle asynchronous operations gracefully, providing users with immediate visual feedback (loading states) to ensure a smooth, non-blocking UI.

Features

✅ Voice-First Interface: Start conversations and give commands entirely through voice.
✅ Dual AI Backend: Intelligently routes prompts to either Gemini for chat or Vertex AI for image generation.
✅ Text-to-Image Generation: Create high-quality images from text descriptions.
✅ Conversational Experience: The AI's responses are spoken aloud using a native TTS engine.
✅ Responsive & Performant UI: Asynchronous architecture with loading indicators prevents UI freezing ("jank") during API calls.
✅ Intuitive Controls: Users can interrupt the assistant's speech by tapping the microphone, and a dedicated button allows for easy conversation resets.

Tech Stack & Architecture

Core Framework: Flutter & Dart
State Management: setState for managing UI state including loading, listening, and result states.
AI & Cloud Services:
- Google Gemini API: For advanced conversational text generation.
- Google Vertex AI (Imagen Model): For text-to-image generation.
- Google Cloud Platform (GCP): For API management, billing, and service enablement.
Key Flutter Packages:
- speech_to_text: For native speech recognition.
- flutter_tts: For native text-to-speech synthesis.
- http: For handling advanced API requests to Vertex AI.
- google_generative_ai: For streamlined interaction with the Gemini API.
Development Tools: VS Code, Android Studio, Android Debug Bridge (ADB), Google Cloud CLI.

Key Challenges & Learnings

This project was a deep dive into building a production-quality application and involved overcoming several real-world challenges:

Complex Cloud Authentication: The biggest challenge was integrating with Vertex AI, which requires OAuth 2.0 access tokens, unlike the simpler API Key used by the Gemini API. This required learning to use the gcloud CLI to generate temporary development credentials, providing a deep understanding of Google Cloud's multi-layered security and authentication systems.
Cloud Service Configuration: Successfully navigated the Google Cloud Console to debug PERMISSION_DENIED errors by enabling the correct APIs (Vertex AI) and linking a billing account, which are critical, non-code-related skills for any cloud developer.
UI Performance Optimization: Encountered and solved the "Skipped Frames" issue by re-architecting the API call logic. By implementing a loading state (_isLoading) that updates the UI before the asynchronous network call, the main thread was kept free, resulting in a smooth, responsive user experience without any freezing.

Setup and Installation

To run this project locally, follow these steps:

Clone the repository:

git clone https://github.com/[YOUR_GITHUB_USERNAME]/nova_ai.git
cd nova_ai

Install Flutter dependencies:
```
flutter pub get
```
Configure Credentials (Crucial Step): This project requires credentials for Google Cloud services.
- Gemini API Key:
  - In the lib/ folder, create a file named secrets.dart.
  - Add your Gemini API key to this file:
```
// lib/secrets.dart
const geminiApiKey = 'YOUR_GEMINI_API_KEY_HERE';
```
- Vertex AI Access Token (For Development):
  - Make sure you have the Google Cloud CLI installed.
  - Authenticate with your Google account and provide all the access:
```
gcloud auth application-default login
```
  - Print your temporary access token:
```
gcloud auth application-default print-access-token
```
  - Copy the entire token string.
  - In lib/service.dart, paste this token into the tempAccessToken variable.
  - NOTE: This token expires after about an hour and needs to be regenerated for development.
Update Project ID:
- In lib/service.dart, find the _projectId variable and replace the placeholder with your actual Google Cloud Project ID.
Run the application:
```
flutter run
```

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contact

Debangshu Mounas - debangshumounas.dev@gmail.com

Project Link: https://github.com/DMounas/nova_ai

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
android		android
assets		assets
demo		demo
ios		ios
lib		lib
linux		linux
macos		macos
test		test
web		web
windows		windows
.gitattributes		.gitattributes
.gitignore		.gitignore
.metadata		.metadata
Nova_ai.gif		Nova_ai.gif
README.md		README.md
analysis_options.yaml		analysis_options.yaml
pubspec.lock		pubspec.lock
pubspec.yaml		pubspec.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nova AI - Voice-First Conversational Assistant

About The Project

Features

Tech Stack & Architecture

Key Challenges & Learnings

Setup and Installation

License

Contact

About

Uh oh!

Releases

Packages

Languages

DMounas/nova_ai

Folders and files

Latest commit

History

Repository files navigation

Nova AI - Voice-First Conversational Assistant

About The Project

Features

Tech Stack & Architecture

Key Challenges & Learnings

Setup and Installation

License

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages