- Zagreb, Croatia
- in/nenad-mandi%C4%87-96a5a03
Lists (10)
Sort Oldest
Stars
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
A comprehensive list of YouTube channels and other resources.
SynthLang is a hyper-efficient prompt language designed to optimize interactions with Large Language Models (LLMs) like GPT-4o by leveraging logographical scripts and symbolic constructs.
OS-ATLAS: A Foundation Action Model For Generalist GUI Agents
A generative world for general-purpose robotics & embodied AI learning.
Automatic SQL injection and database takeover tool
Official comfyui repository of Hellomeme
Python tool for converting files and office documents to Markdown.
Memory-Guided Diffusion for Expressive Talking Video Generation
Memory-Guided Diffusion for Expressive Talking Video Generation
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
PalisadeResearch / intercode
Forked from princeton-nlp/intercodehttps://arxiv.org/abs/2412.02776
Chronos: Pretrained Models for Probabilistic Time Series Forecasting
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …
A course on aligning smol models.
Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision models
Core, Junction, and VRAM temperature reader for Linux + GDDR6/GDDR6X GPUs
You like pytorch? You like micrograd? You love tinygrad! ❤️
Simplest POC talking with GPT(Whisper - GPT - ElevenAI)
A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local LLM via llama.cpp or OpenAI API. Includes clipboard integrat…
A minimal and universal controller for FLUX.1.
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation