Skip to content

Attention Sinks Perf Boost #78

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 25 commits into from
Aug 9, 2025
Merged

Conversation

LucasWilkinson
Copy link
Collaborator

@LucasWilkinson LucasWilkinson commented Aug 7, 2025

Shout-out to @jayhshah (the performance wizard 🪄) for the implementation

PR

reasoning-effort: low
model: openai/gpt-oss-20b

Results (n=5)
mean_chars: 53.7662878788
stderr_chars: 1.6426099883
mean_score: 0.5571969697
stderr_score: 0.0072987006


Main

reasoning-effort: low
model: openai/gpt-oss-20b

Results (n=5)
mean_chars: 53.2223484848
stderr_chars: 1.8772311021
mean_score: 0.5637626263
stderr_score: 0.0020261119

@jayhshah jayhshah force-pushed the lwilkinson/aux-fast-api-clean branch from 994e966 to deb7484 Compare August 8, 2025 00:46
@LucasWilkinson LucasWilkinson force-pushed the lwilkinson/aux-fast-api-clean branch from deb7484 to e1f506c Compare August 8, 2025 01:07
jayhshah and others added 22 commits August 8, 2025 01:08
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
…in cmakelists

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
@LucasWilkinson LucasWilkinson force-pushed the lwilkinson/aux-fast-api-clean branch from e1f506c to 4bca7a3 Compare August 8, 2025 01:08
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
@LucasWilkinson LucasWilkinson changed the title [WIP] Attention Sinks Perf Boost Attention Sinks Perf Boost Aug 9, 2025
@LucasWilkinson LucasWilkinson marked this pull request as ready for review August 9, 2025 03:58
…-api-clean

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
@LucasWilkinson LucasWilkinson merged commit 2d3b750 into main Aug 9, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants