-
Notifications
You must be signed in to change notification settings - Fork 569
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add external qparams parameters to dequantize_int4_cache API
cla signed
fb-exported
#4062
opened May 1, 2025 by
PatriceVignola
Loading…
Allow merge_pooled_embedding take in device without index
cla signed
fb-exported
#4061
opened May 1, 2025 by
ZhengkaiZ
Loading…
Integrate compat with Trunk and enable disabling BufferOps
cla signed
fb-exported
#4060
opened May 1, 2025 by
njriasan
Loading…
fbgemm_gpu.experimental.gen_ai.moe.silu_mul_quant
.
cla signed
fb-exported
#4059
opened Apr 30, 2025 by
levendlee
Loading…
Add logic to stream weights in EmbeddingKVDB
cla signed
fb-exported
#4058
opened Apr 30, 2025 by
chouxi
Loading…
Add enable_raw_embedding_streaming from TBE config to EmbeddingKVDB
cla signed
fb-exported
#4053
opened Apr 30, 2025 by
chouxi
Loading…
Simplify weight row cache load and evict routines
cla signed
fb-exported
#4050
opened Apr 30, 2025 by
q10
Loading…
Add more parameter specializations for autovec TBE kernels
cla signed
fb-exported
#4047
opened Apr 30, 2025 by
excelle08
Loading…
ReportTBE data configuration with EEG-based indices (squash stack from D73450767)
cla signed
fb-exported
#4046
opened Apr 30, 2025 by
gchalump
Loading…
Fix
int32_t
to auto
for code around WeightRow
cla signed
fb-exported
#4045
opened Apr 30, 2025 by
q10
Loading…
integrate dramKV with kvtensorwrapper
cla signed
fb-exported
#4043
opened Apr 29, 2025 by
steven1327
Loading…
torchrec support on kvzch emb lookup module
cla signed
fb-exported
#4035
opened Apr 28, 2025 by
duduyi2013
Loading…
support zero collision tables in ssd operator
cla signed
fb-exported
#4033
opened Apr 28, 2025 by
duduyi2013
Loading…
Enable NaN checks on tensor arguments to kernel launches
cla signed
fb-exported
#4029
opened Apr 26, 2025 by
q10
Loading…
update hipify_torch submodule for version 2
cla signed
#4028
opened Apr 26, 2025 by
jeffdaily
Loading…
Add keep_orig_idx_per_feature parameter to block_bucketize_sparse_features kernel
cla signed
fb-exported
#4027
opened Apr 25, 2025 by
emlin
Loading…
Migrate make_pta_acc_format() away from old macros, v2]
cla signed
fb-exported
#4026
opened Apr 25, 2025 by
q10
Loading…
Clean up
WeightRow
in preparation for optimizer state offloading
cla signed
fb-exported
#4021
opened Apr 24, 2025 by
q10
Loading…
fix build that excludes a bunch of features
cla signed
fb-exported
#4019
opened Apr 24, 2025 by
q10
Loading…
Report TBE data configuration with EEG-based indices estimation
cla signed
fb-exported
#4018
opened Apr 24, 2025 by
gchalump
Loading…
Gen modes: Remove
-Wno-mismatched-tags
cla signed
fb-exported
#4011
opened Apr 23, 2025 by
q10
Loading…
Back out "Add a workaround for stochastic rounding for AMD GPUs"
cla signed
fb-exported
module: rocm
#3977
opened Apr 17, 2025 by
ionuthristodorescu
Loading…
Back out "Cleanups to
StochasticRoundingRNGState
"
cla signed
fb-exported
module: rocm
#3976
opened Apr 17, 2025 by
ionuthristodorescu
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.