Skip to content
View raymondng76's full-sized avatar
🥺
🥺
  • AI Singapore
  • Singapore

Block or report raymondng76

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 5,234 562 Updated Feb 1, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 4,178 405 Updated Jan 30, 2025
Python 335 26 Updated Jan 27, 2025

Training Large Language Model to Reason in a Continuous Latent Space

Python 762 54 Updated Jan 24, 2025
Python 100 10 Updated Jan 23, 2025

Sky-T1: Train your own O1 preview model within $450

Python 2,311 251 Updated Jan 26, 2025
Jupyter Notebook 147 9 Updated Dec 2, 2024

"Every Author as First Author" paper from SIGTBD 2023, about superimposing author names in a stack

TeX 110 Updated Jan 4, 2025

Synthetic Data curation for post-training and structured data extraction

Python 615 45 Updated Feb 1, 2025

Concurrent Python made simple

Python 971 18 Updated Jan 28, 2025

LOTUS: A semantic query engine for fast and easy LLM-powered data processing

Python 1,002 81 Updated Jan 29, 2025

fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Go 28,162 2,944 Updated Jan 30, 2025

Grokking Deep Reinforcement Learning

Jupyter Notebook 854 238 Updated Feb 4, 2022

Python library for audio and music analysis

Python 7,339 972 Updated Jan 15, 2025

TUFS Asian Language Parallel Corpus

TeX 50 13 Updated May 1, 2023

AI Observability & Evaluation

Jupyter Notebook 4,605 340 Updated Feb 1, 2025

Scrape papers from OpenReview using OpenReview API

Python 30 7 Updated Aug 14, 2023

🙌 OpenHands: Code Less, Make More

Python 45,024 4,984 Updated Feb 1, 2025

Tracking entropy flow in world of tokens

Python 7 Updated Nov 30, 2024
Cuda 57 5 Updated Dec 27, 2024

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 674 52 Updated Jan 24, 2025

Code repository for the paper - "Matryoshka Representation Learning"

Jupyter Notebook 449 22 Updated Feb 19, 2024

State-of-the-art LLM-based translation models.

Ruby 479 38 Updated Jan 24, 2025

The nnsight package enables interpreting and manipulating the internals of deep learned models.

Jupyter Notebook 472 42 Updated Jan 31, 2025

Soft-Transformers For Continual Learning

2 Updated Dec 4, 2024

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 25,104 1,891 Updated Jan 27, 2025

The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.

Jupyter Notebook 56 4 Updated Jan 25, 2025
Python 5 1 Updated May 5, 2024
Next