-
Notifications
You must be signed in to change notification settings - Fork 32
Pull requests: andrewkchan/deepseek.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
add --bsize arg to convert.py to allow configuring f8e5m2 block size
#16
by andrewkchan
was merged May 26, 2025
Loading…
Refactor weights from raw pointers to tensor structs
#12
by andrewkchan
was merged May 15, 2025
Loading…
Add multi-latent attention, profiling instrumentation, other perf fixes
#8
by andrewkchan
was merged May 2, 2025
Loading…
Add deepseek v3 support and blockwise quantization
#1
by andrewkchan
was merged Feb 6, 2025
Loading…
ProTip!
Exclude everything labeled
bug
with -label:bug.