Skip to content
@FasterDecoding

FasterDecoding

Think deeper, decode faster

Pinned Loading

  1. Medusa Medusa Public

    Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

    Jupyter Notebook 2.5k 175

Repositories

Showing 5 of 5 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…