Skip to content
View hankyul2's full-sized avatar
🐛
Bugging since 2019/04/30
🐛
Bugging since 2019/04/30
  • ajou university
  • suwon, KR
  • 08:27 (UTC +09:00)

Highlights

  • Pro

Organizations

@Algostu @SWCapstone2021

Block or report hankyul2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM

Large-Language-Model
9 repositories

The official PyTorch implementation of Google's Gemma models

Python 5,325 516 Updated Jul 31, 2024

Cramming the training of a (BERT-type) language model into limited compute.

Python 1,304 99 Updated Jun 13, 2024

Code for the accepted CVPR 2023 workshop paper.

Python 1 Updated Apr 15, 2023

Mamba SSM architecture

Python 13,689 1,173 Updated Dec 6, 2024

Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"

Assembly 543 44 Updated Dec 28, 2024

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 13,737 3,259 Updated Aug 12, 2024

Ongoing research training transformer models at scale

Python 11,009 2,459 Updated Jan 5, 2025

Distributed preprocessing and data loading for language datasets

Python 39 10 Updated Apr 10, 2024
Python 6 2 Updated Jun 17, 2024