Build software better, together

sjinnovation / CollabAI

CollabAI is an open-source & self-hosted AI operation platform for small and medium-sized businesses. It’s a customizable & team-centric platform where you can have access to custom AI agents tailored to your business needs.

openai claude gemini-api ai-platform openai-api claude-ai gpt4-api claude-api gpt-4-1106-preview openai-assistant-api collaborativeai openai-assistant selfhostedai multi-modal-ai gpt4o custom-ai-agents ai-for-agency ai-for-non-profit

Updated Aug 26, 2025
JavaScript

DHT-AI-Studio / RAPTOR

Star

RAPTOR (Rapid AI-Powered Text and Object Recognition) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis, semantic search, and actionable insights. RAPTOR reducing manual tagging by 85% and making content discovery 10x faster.

nlp machine-learning microservices ai computer-vision deep-learning artificial-intelligence semantic-search ai-framework audio-processing content-analysis digital-asset-management video-analysis vector-database ai-automation llm multi-modal-ai content-intelligence ai-orchestration

Updated Nov 21, 2025
Python

CoRAL-ASU / weaver

Star

(EMNLP 2025) Weaver: A modular agentic pipeline that dynamically combines SQL and LLMs for advanced table-based question answering

machine-learning natural-language-processing sql database research question-answering sql-agent table-qa ai-agent llm reasoning-agent multi-modal-ai ott-qa table-reasoning finqa wikitablequestions tabfact

Updated Jul 19, 2025
HTML

tomoima525 / daily-diary

Star

Pipecat(Daily.co) x Gemini hackathon project. Talk to your agent and make your daily life memorable!

voice-assistant gemini-ai image-generation-ai multi-modal-ai pipecat-ai

Updated Nov 14, 2025
TypeScript

developtheweb / sheldon-ai-showcase

Star

Sheldon AI Assistant is a powerful and versatile Discord bot and web application that enhances user interaction and automates tasks within your Discord server and beyond. With advanced AI models, seamless integrations, a intuitive web interface, and a wide range of features, Sheldon is your go-to assistant for an engaging and productive experience.

nodejs webgl threejs typescript discord-bot artificial-intelligence discord-js conversational-ai ai-chatbot ai-assistant multi-modal-ai spacial-computing

Updated Aug 30, 2025

dheeraj966 / NEXT_GEN-AI-MODEL---REVOLUTION-AI

Star

Advanced AI architecture integrating multi-modal reasoning, dynamic token optimization, and self-reflective learning loops. Designed for high efficiency, deep contextual understanding, and adaptive general intelligence across vision, language, and logic tasks—pushing beyond conventional transformer limits.

agi multi-modal-ai adaptive-ai agentic-ai next-gen-ai trending-agi relevant-agi-2025 self-reflective-architectures benchmark-outperformer

Updated Oct 19, 2025

levrex / EHR-Clustering-RA

Star

Cast different EHR (electronic health record) layers to a shared latent space to identify patient subtypes

machine-learning-algorithms ehr cluster-analysis clustering-algorithm clinical-research clinical-data unsupervised-machine-learning ehr-phenotyping rheumatoid-arthritis multi-modal-ai

Updated Oct 28, 2025
Jupyter Notebook

fenilsonani / rag-document-qa

Star

Enterprise-grade RAG system featuring dual online/offline operation, multi-modal document processing, and advanced AI capabilities including knowledge graph construction and hybrid search for intelligent document analysis.

knowledge-graph streamlit hybrid-search document-intelligence langchain chromadb retrieval-augmented-generation enterprise-ai multi-modal-ai offline-ml

Updated Aug 6, 2025
Python

Md-Emon-Hasan / LangChain

Star

Powerful framework for building applications with Large Language Models (LLMs), enabling seamless integration with memory, agents, and external data sources.

Updated Feb 13, 2025
Jupyter Notebook

ShivamMishra1603 / video-xplore

Star

AI video analysis + web research in one tool. Upload videos, ask questions, get comprehensive insights with current web data.

multi-modal-ai agentic-ai google-gemini-api

Updated Aug 30, 2025
Python

nickcottrell / vrgb-kafka

Star

Color-based semantic routing for Apache Kafka - Tag events with RGB hex codes for flexible consumer-side filtering. Eliminates topic proliferation and enables dynamic routing without payload deserialization. Python reference implementation with validated 5x speedup over content-based routing.

python distributed-systems machine-learning kafka stream-processing apache-kafka real-time-processing message-broker event-routing multi-modal-ai

Updated Nov 15, 2025
Python

mwasifanwar / ChronoPredict

Star

Multi-modal system analyzing social media, news, art, and music to predict emerging cultural movements and artistic trends years before they mainstream.

creative-ai cultural-analytics social-dynamics trend-prediction multi-modal-ai multi-modal-ai-analysis

Updated Nov 11, 2025
Python

Aish-p / Text-Vision-Agent

Star

Text-Vision-Agent is an AI-powered assistant that generates images from text descriptions and provides detailed image descriptions. It combines image generation using FluxPipeline with vision-based language models like ChatOllama, enabling seamless text-to-image and image interpretation interactions.

generative-ai multi-modal-ai nlp-and-vision-integration chatollama fluxpipeline image-generation-and-description

Updated Feb 16, 2025
Python

delegatexai / practical-ai-agents

Star

A curated list of AI agents (open-source & proprietary) that solve real-world problems. Updated regularly!

open-source machine-learning automation artificial-intelligence ai-agents ai-agents-framework llm-agents multi-modal-ai practical-ai-guide ai-agents-directory

Updated Apr 28, 2025

Lipeka / Multi-modal-Recommendation-System

Star

A multi-modal recommender system that suggests books or music based on: Voice input, Audio song recognition, Typed queries, Real-time weather in your city

python deep-learning artificial-intelligence speech-recognition gradio rag large-language-models multi-modal-ai