Introduction

FlaxGPT is a simplistic Flax implementation of GPT (decoder-only transformer) model. The code is minimum in a single notebook and therefore good for hacking and educational purposes.

Usage

Open the main flax_gpt colab and start hacking.

Feature Roadmap

Tutorials

Here are some tutorials of how I implemented GPT from scratch.

GPT From Scratch Using Flax explains how I created FlaxGPT, step by step.
GPT From Scrach Using Jax if you prefer a more "hardcore" implementation using only the low level jax, please check this out.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
tutorials		tutorials
Readme.md		Readme.md
flax_gpt.ipynb		flax_gpt.ipynb
flaxgpt_logo.png		flaxgpt_logo.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Usage

Feature Roadmap

Tutorials

About

Releases

Packages

Languages

codescv/flax-gpt

Folders and files

Latest commit

History

Repository files navigation

Introduction

Usage

Feature Roadmap

Tutorials

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages