-
Notifications
You must be signed in to change notification settings - Fork 600
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Build and optimize BF16 grouped GEMM on blackwell
cla signed
fb-exported
#4353
opened Jun 15, 2025 by
jiawenliu64
Loading…
Add FP32 support for routing_score dtype
cla signed
fb-exported
#4352
opened Jun 15, 2025 by
jianyuh
Loading…
Migrate jagged tensor kernels to
FBGEMM_LAUNCH_KERNEL
, pt 2
cla signed
fb-exported
#4350
opened Jun 13, 2025 by
q10
Loading…
Add CudaEvents Barrier before MemCpy V33
cla signed
fb-exported
#4348
opened Jun 13, 2025 by
Jason-KChen
Loading…
Add make directory to filestore abstraction
cla signed
fb-exported
#4346
opened Jun 13, 2025 by
gchalump
Loading…
[fbgemm_gpu] ROCm fixes for CI
ciflow/rocm
cla signed
module: rocm
#4345
opened Jun 13, 2025 by
q10
Loading…
kvzch inference python operator
cla signed
fb-exported
#4344
opened Jun 13, 2025 by
chenyuzhcy
Loading…
kv embedding inference cache wrapper
cla signed
fb-exported
#4343
opened Jun 13, 2025 by
chenyuzhcy
Loading…
add ckpt and restore with feature evict metaheader
cla signed
#4342
opened Jun 13, 2025 by
lalala-2
Loading…
Implement a stat library for fbgemm embedding
cla signed
fb-exported
#4339
opened Jun 13, 2025 by
Kaiweitu
Loading…
Fix int_nbit inference int8 nobag kernel meta function
cla signed
fb-exported
#4333
opened Jun 12, 2025 by
spcyppt
Loading…
Tune FP8 grouped GEMM for Llama4 shapes
cla signed
fb-exported
#4326
opened Jun 11, 2025 by
jiawenliu64
Loading…
fix output dtype issue in merge_pooled_embeddings when input tensors are all empty
cla signed
fb-exported
#4325
opened Jun 11, 2025 by
842974287
Loading…
NVFP4 quantization emulation kernels as reference
cla signed
fb-exported
#4324
opened Jun 11, 2025 by
summerdengfb
Loading…
Use local counter for TBE boundary check warinings to improve performance
cla signed
fb-exported
#4316
opened Jun 10, 2025 by
yoyoyocmu
Loading…
Support prefetch pipeline in bounds_check_indices
cla signed
fb-exported
#4312
opened Jun 9, 2025 by
sryap
Loading…
[fbgemm_gpu] TBE microbenchmark upgrades
cla signed
module: rocm
#4307
opened Jun 9, 2025 by
q10
Loading…
tbe cpu nobag dispatch and backward pass kernel impl
cla signed
fb-exported
#4303
opened Jun 9, 2025 by
yabalaban
Loading…
tbe cpu nobag dispatch and forward pass kernel impl
cla signed
fb-exported
#4302
opened Jun 9, 2025 by
yabalaban
Loading…
put feature_evict definition in cpp file
cla signed
fb-exported
#4294
opened Jun 8, 2025 by
chenyuzhcy
Loading…
put dram_kv_embedding_cache definition in cpp file
cla signed
fb-exported
#4293
opened Jun 8, 2025 by
chenyuzhcy
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.