Skip to content
View cedricblondeau's full-sized avatar
💻✈️📷🍻🐱⚽️🥾🏕️⛰️
💻✈️📷🍻🐱⚽️🥾🏕️⛰️

Organizations

@bitbearstudio

Block or report cedricblondeau

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🦍 Kong is a Jira CLI at terminal velocity

Go 10 Updated Jul 3, 2023

Workflow Engine for Kubernetes

Go 15,426 3,260 Updated Mar 11, 2025

AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kubernetes Engine

HCL 281 217 Updated Mar 12, 2025

an R-Tree library for Go

Go 623 124 Updated Dec 20, 2024

Build Conversational AI in minutes ⚡️

TypeScript 8,821 1,181 Updated Mar 12, 2025

A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.

Rust 49,719 1,951 Updated Mar 12, 2025

The Kubernetes Package Manager

Go 27,573 7,202 Updated Mar 11, 2025

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 54,068 5,277 Updated Jan 21, 2025

Open source codebase powering the HuggingChat app

TypeScript 8,378 1,250 Updated Mar 12, 2025

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,826 233 Updated Mar 3, 2025

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,005 253 Updated Mar 6, 2025

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Python 2,839 221 Updated Sep 30, 2023

The Triton TensorRT-LLM Backend

Python 797 116 Updated Mar 11, 2025

Fast inference engine for Transformer models

C++ 3,661 329 Updated Feb 25, 2025

LLMPerf is a library for validating and benchmarking LLMs

Python 810 140 Updated Dec 9, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,686 1,147 Updated Mar 11, 2025

Django Channels HTTP/WebSocket server

Python 2,463 276 Updated Jan 22, 2025

An ASGI web server, for Python. 🦄

Python 8,958 771 Updated Mar 9, 2025

Declarative Continuous Deployment for Kubernetes

Go 18,890 5,797 Updated Mar 12, 2025

Progressive Delivery for Kubernetes

Go 2,909 921 Updated Mar 11, 2025

Netflix's Hystrix latency and fault tolerance library, for Go

Go 4,285 478 Updated Feb 24, 2024

The first real AI developer

Python 32,472 3,302 Updated Mar 4, 2025

✨ Textbase is a simple framework for building AI chatbots. ✨

Python 1,272 351 Updated Nov 27, 2023

Rich is a Python library for rich text and beautiful formatting in the terminal.

Python 51,188 1,798 Updated Dec 2, 2024

The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.

Python 27,715 854 Updated Mar 12, 2025

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…

Go 30,995 2,333 Updated Mar 12, 2025

Run your favourite LLMs locally on macOS from Swift

Swift 82 1 Updated Jun 8, 2023

Chat with your favourite LLaMA models in a native macOS app

Swift 1,495 60 Updated Jun 9, 2023

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 42,844 5,524 Updated Mar 10, 2025

French instruction-following and chat models

Jupyter Notebook 504 47 Updated Dec 5, 2024
Next