Skip to content
View stat-eklee's full-sized avatar
🏢
Working from Company
🏢
Working from Company

Block or report stat-eklee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Train transformer language models with reinforcement learning.

Python 12,383 1,670 Updated Mar 7, 2025

LIMO: Less is More for Reasoning

Python 825 36 Updated Feb 24, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 47,825 5,090 Updated Jan 22, 2025
Python 2 1 Updated Dec 12, 2024

Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)

Jupyter Notebook 137 22 Updated Sep 21, 2024

Clinical text summarization by adapting large language models

Python 134 28 Updated Jul 31, 2024

The official Meta Llama 3 GitHub site

Python 28,478 3,308 Updated Jan 26, 2025

A curated list of reinforcement learning with human feedback resources (continually updated)

3,788 231 Updated Feb 19, 2025

Federated Optimization in Heterogeneous Networks (MLSys '20)

Python 671 159 Updated Mar 24, 2023

자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가

Python 31 3 Updated May 31, 2024

A framework for few-shot evaluation of language models.

Python 8,189 2,181 Updated Mar 10, 2025

Flower: A Friendly Federated AI Framework

Python 5,528 949 Updated Mar 10, 2025

This repository contains two datasets with multi-turn adversarial conversations generated by human agents interacting with a dialog model and rated for safety by two corresponding diverse rater pools.

25 4 Updated Jul 16, 2024

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Python 1,365 71 Updated Apr 11, 2024
Python 217 19 Updated Jun 11, 2024

[NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.

Jupyter Notebook 22 1 Updated Jul 26, 2023

🐙 OctoPack: Instruction Tuning Code Large Language Models

Jupyter Notebook 457 28 Updated Feb 5, 2025

✏️ 기술 면접 스터디 Cheat Sheet

216 18 Updated Nov 24, 2023

☁️ 구름(KULLM): 고려대학교에서 개발한, 한국어에 특화된 LLM

581 72 Updated May 1, 2024

Transformer related optimization, including BERT, GPT

C++ 6,074 900 Updated Mar 27, 2024

Korean BART

Python 454 96 Updated Oct 3, 2024

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

Python 446 62 Updated Sep 6, 2023

Assignments for CS294-112.

Python 1,583 1,042 Updated Mar 24, 2023

For experiments involving instruct gpt. Currently used for documenting open research questions.

71 4 Updated Nov 8, 2022

🙋 핵심을 질문하다. 그리고 용감하게 대답하다. 국내 IT기업부터 실리콘밸리까지 "현직자가 해설해주는 기술면접"

4,134 308 Updated Mar 24, 2023
1 Updated Nov 18, 2021

🤗 Pretrained BERT model & WordPiece tokenizer trained on Korean Comments 한국어 댓글로 프리트레이닝한 BERT 모델과 데이터셋

478 45 Updated Nov 7, 2022

파이썬 알고리즘 / 코딩테스트 스터디

Python 5 8 Updated Jun 8, 2022
Next