-
Notifications
You must be signed in to change notification settings - Fork 520
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix specailization issue in keyed_jagged_index_select_dim1_forward_cuda
cla signed
fb-exported
#3578
opened Jan 16, 2025 by
Microve
Loading…
Enable fast FP8 GEMM for memory bound
cla signed
fb-exported
#3577
opened Jan 16, 2025 by
jiawenliu64
Loading…
more fp8 tuning for decode and not need to pad
cla signed
fb-exported
#3576
opened Jan 15, 2025 by
mxz297
Loading…
Updates and fixes to tensor_accessor.h
cla signed
fb-exported
module: rocm
#3571
opened Jan 14, 2025 by
q10
Loading…
Unifying TBE API using List (Backend)
cla signed
fb-exported
#3563
opened Jan 11, 2025 by
spcyppt
Loading…
Refactor FP8 grouped GEMM with dynamic and static versions
cla signed
fb-exported
#3561
opened Jan 10, 2025 by
jiawenliu64
Loading…
Support FP8 grouped GEMM with rowwise scailing
cla signed
fb-exported
#3560
opened Jan 10, 2025 by
jiawenliu64
Loading…
Add support for
int32_t
indices in TBE training (2I/N)
cla signed
fb-exported
module: rocm
#3556
opened Jan 7, 2025 by
q10
Loading…
Switch dynamic FP8 grouped gemm to accept tensor inputs
cla signed
fb-exported
#3552
opened Jan 6, 2025 by
jwfromm
Loading…
Add support for
int32_t
indices in TBE training (2H/N)
cla signed
fb-exported
module: rocm
#3539
opened Jan 3, 2025 by
q10
Loading…
env variable to select rounding mode
cla signed
fb-exported
#3515
opened Dec 19, 2024 by
hhyuanf
Loading…
Back out "Manual loop unroll for rocm inference"
ciflow/rocm
cla signed
fb-exported
module: rocm
#3506
opened Dec 15, 2024 by
brad-mengchi
Loading…
migrate "jagged_flash_attention"
cla signed
fb-exported
#3490
opened Dec 10, 2024 by
brad-mengchi
Loading…
Optimzed backward pass for ROCm devices (#3367)
ciflow/rocm
cla signed
fb-exported
module: rocm
#3468
opened Dec 6, 2024 by
q10
Loading…
Use GEMM kernel for KleidiAI to accelerate FP16Benchmark
cla signed
#3440
opened Dec 3, 2024 by
milpuz01
Loading…
Make check_feature_gate_key PT2 compatible
cla signed
fb-exported
#3426
opened Nov 30, 2024 by
sryap
Loading…
Make check_feature_gate_key PT2 compatible
cla signed
fb-exported
#3425
opened Nov 30, 2024 by
sryap
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.