Skip to content
View artemkaa's full-sized avatar

Block or report artemkaa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

ml

36 repositories

ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution

Python 5,880 671 Updated Mar 16, 2025

Machine Learning Toolkit for Kubernetes

TypeScript 14,757 2,476 Updated Feb 20, 2025

Open source platform for the machine learning lifecycle

Python 19,797 4,398 Updated Mar 18, 2025

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 39,199 14,798 Updated Mar 17, 2025

An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models

HTML 4,486 837 Updated Mar 17, 2025

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Python 18,639 1,737 Updated Mar 17, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 42,085 5,755 Updated Mar 17, 2025

Data Platform demo

Python 12 1 Updated Feb 11, 2025

Easy token price estimates for 400+ LLMs. TokenOps.

Python 1,603 73 Updated Mar 4, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.

Go 133,445 11,020 Updated Mar 18, 2025

ClearML Fractional GPU - Run multiple containers on the same GPU with driver level memory limitation ✨ and compute time-slicing

74 5 Updated Aug 5, 2024

LLM inference in C/C++

C++ 76,738 11,112 Updated Mar 18, 2025

A Nix flake for many AI projects

Nix 709 76 Updated Feb 15, 2025

🪐 1-click Kubeflow using ArgoCD

Shell 64 14 Updated Aug 8, 2024

Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?

Jupyter Notebook 1,500 59 Updated May 13, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 41,746 6,299 Updated Mar 17, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,519 729 Updated Dec 17, 2024

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 5,741 456 Updated Nov 24, 2024

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer

TypeScript 28,078 1,645 Updated Mar 17, 2025

Best practices & guides on how to write distributed pytorch training code

Python 369 28 Updated Feb 24, 2025

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 19,850 2,320 Updated Mar 17, 2025

Fully open reproduction of DeepSeek-R1

Python 22,915 2,075 Updated Mar 16, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 12,040 1,275 Updated Mar 18, 2025

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,987 177 Updated Mar 13, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 78,275 9,380 Updated Jan 4, 2025

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python 51,849 5,690 Updated Mar 18, 2025

Jupyter Interactive Notebook

Jupyter Notebook 12,112 5,149 Updated Mar 17, 2025

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 42,903 5,532 Updated Mar 17, 2025