A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
-
Updated
Nov 27, 2025 - Python
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
Deploy your private Gemini application for free with one click, supporting Gemini 1.5, Gemini 2.0 models.
A desktop application that extracts YouTube playlist transcripts and enhances them using Google's Gemini AI models. The output is a book in any language you want.
Vanilla JS web interface for Gemini 2.0 flash-exp Multimodal API with text, audio, camera, screen inputs and audio responses and function calling
Co-create PowerPoint slide decks with AI
Simplified Gemini for Claude Code.
🦀 A Pure Rust Framework For Building AGI (WIP).
A lightweight Python API wrapper and CLI for Google’s Gemini language models.
Autospec is an open-source AI agent that takes a web app URL and autonomously QAs it, and saves its passing specs as E2E test code
The Gemini API wrapper for Delphi utilizes advanced models developed by Google to provide robust capabilities, including interactive chat, text embeddings, code generation, image and video prompting, audio analysis and transcription, fine-tuning, caching, and integration with Google Search.
Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flash
Gemini Pro: An AI-powered Telegram bot script for generating text and image-based responses using Gemini AI
An AI Discord Bot leveraging Googles Gemini 1.5 Model & Prodia!
A Multi-Agent based application which provides a comphrehensive financial/market analysis of any company
This project enables real-time streaming of audio (and optionally video or screen captures) from your local device to Google Gemini using the Live API. It allows you to interact with Gemini through both text and voice, supporting conversational AI responses.
AI agent for creating personalized digests of research papers
AI-powered flashcard generator built with React and Google Gemini . Create and customize quiz content seamlessly for an interactive learning experience.
Google Gemini Voice/Vision Assistant with gemini-1.5-pro / gemini-1.5-flash modal ! #Gemini 1.5 Flash #Gemini 1.5 Pro
This repository contains a transformer-based model for real-time American Sign Language (ASL) recognition. The model leverages transformer architecture to interpret ASL gestures and utilizes the Gemini-Pro LLM API for constructing sentences from recognized ASL signs.
Add a description, image, and links to the gemini-flash topic page so that developers can more easily learn about it.
To associate your repository with the gemini-flash topic, visit your repo's landing page and select "manage topics."