Skip to content
#

multi-modal-ai

Here are 22 public repositories matching this topic...

CollabAI is an open-source & self-hosted AI operation platform for small and medium-sized businesses. It’s a customizable & team-centric platform where you can have access to custom AI agents tailored to your business needs.

  • Updated Aug 26, 2025
  • JavaScript

RAPTOR (Rapid AI-Powered Text and Object Recognition) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis, semantic search, and actionable insights. RAPTOR reducing manual tagging by 85% and making content discovery 10x faster.

  • Updated Jan 15, 2026
  • Python

Sheldon AI Assistant is a powerful and versatile Discord bot and web application that enhances user interaction and automates tasks within your Discord server and beyond. With advanced AI models, seamless integrations, a intuitive web interface, and a wide range of features, Sheldon is your go-to assistant for an engaging and productive experience.

  • Updated Aug 30, 2025

Advanced AI architecture integrating multi-modal reasoning, dynamic token optimization, and self-reflective learning loops. Designed for high efficiency, deep contextual understanding, and adaptive general intelligence across vision, language, and logic tasks—pushing beyond conventional transformer limits.

  • Updated Oct 19, 2025

Powerful framework for building applications with Large Language Models (LLMs), enabling seamless integration with memory, agents, and external data sources.

  • Updated Feb 13, 2025
  • Jupyter Notebook

Text-Vision-Agent is an AI-powered assistant that generates images from text descriptions and provides detailed image descriptions. It combines image generation using FluxPipeline with vision-based language models like ChatOllama, enabling seamless text-to-image and image interpretation interactions.

  • Updated Feb 16, 2025
  • Python

Color-based semantic routing for Apache Kafka - Tag events with RGB hex codes for flexible consumer-side filtering. Eliminates topic proliferation and enables dynamic routing without payload deserialization. Python reference implementation with validated 5x speedup over content-based routing.

  • Updated Nov 15, 2025
  • Python

Sales Forge is a high performance, real time voice interaction platform designed to train sales representatives through adaptive AI personas. It provides a low latency, immersive roleplay experience that simulates real world sales challenges.

  • Updated Jan 20, 2026
  • Python

Explore The AI Agent Index: a comprehensive study of AI agent development, conversational AI, and virtual assistants. Learn best practices, emerging trends, and AI agent optimization strategies for customer support automation, lead generation AI, and enterprise AI integration.

  • Updated Jan 15, 2026

Improve this page

Add a description, image, and links to the multi-modal-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-modal-ai topic, visit your repo's landing page and select "manage topics."

Learn more