Lists (2)
Sort Name ascending (A-Z)
Stars
RAGEN is the first open-source reproduction of DeepSeek-R1 for training agentic models via reinforcement learning.
This repository contains a 90-day cybersecurity study plan, along with resources and materials for learning various cybersecurity concepts and technologies. The plan is organized into daily tasks, …
This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achieving exceptional performance on the edge.
LoMRF is an open-source implementation of Markov Logic Networks
Demo code and other handouts for students of our FastAPI Web Apps course.
Reference implementation for DPO (Direct Preference Optimization)
An open platform for enhancing the capability of LLMs in workflow orchestration.
Python tool for converting files and office documents to Markdown.
A generative world for general-purpose robotics & embodied AI learning.
End-to-end Generative Optimization for AI Agents
A curated list for awesome discrete diffusion models resources.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
Anthropic's educational courses
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
Awesome-LLM-Prompt-Optimization: a curated list of advanced prompt optimization and tuning methods in Large Language Models
Analyze system log messages constructing DAG with PC algorithm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
The Fastest State-of-the-Art Static Embeddings in the World
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
Robust recipes to align language models with human and AI preferences
📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) o…