-
Notifications
You must be signed in to change notification settings - Fork 696
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add sliding window attention to splitk gen kernel
cla signed
fb-exported
meta-exported
#5231
opened Dec 16, 2025 by
Aya-ZIbra
Loading…
Update FBGEMM versioning to 1.5.0
cla signed
fb-exported
meta-exported
module: rocm
#5230
opened Dec 16, 2025 by
q10
Loading…
fmha_bwd_convert fix
cla signed
fb-exported
meta-exported
#5229
opened Dec 16, 2025 by
Aya-ZIbra
Loading…
support object cache in ssd l2 cache and add more unit tests
cla signed
fb-exported
meta-exported
#5228
opened Dec 16, 2025 by
zhaojuanmao
Loading…
Re-organize integration scripts
cla signed
fb-exported
meta-exported
#5227
opened Dec 15, 2025 by
q10
Loading…
Add split-K heuristic for decode attention
cla signed
fb-exported
meta-exported
#5225
opened Dec 15, 2025 by
Aya-ZIbra
Loading…
Optimizing 4-bit dequant to FP32 on AArch64 using vectorized intrinsics in EmbeddingSpMDMAutovec
cla signed
#5224
opened Dec 15, 2025 by
marma01
Loading…
Upgrade GitHub Actions to latest versions
cla signed
#5223
opened Dec 13, 2025 by
salmanmkc
Loading…
Upgrade GitHub Actions for Node 24 compatibility
cla signed
module: rocm
#5222
opened Dec 13, 2025 by
salmanmkc
Loading…
Change to TORCH_CHECK_VALUE for sparse ops
cla signed
fb-exported
meta-exported
#5215
opened Dec 11, 2025 by
spcyppt
Loading…
Tune max segment length per cta in triton table batched embeddings, and expose the param via cli
cla signed
fb-exported
meta-exported
#5212
opened Dec 10, 2025 by
OmarPavel
Loading…
Update heuristic to support variant batch sizes
cla signed
fb-exported
meta-exported
#5211
opened Dec 10, 2025 by
zjing14
Loading…
Use H100 runners for OSS CI
cla signed
fb-exported
meta-exported
#5205
opened Dec 9, 2025 by
q10
Loading…
Modifying clear_all_staged_data to accomadate KV Tensor Deletion
cla signed
fb-exported
meta-exported
#5202
opened Dec 9, 2025 by
Raahul46
Loading…
creating delete_rocksdb_checkpoint_dir function under KV Tensor
cla signed
fb-exported
meta-exported
#5201
opened Dec 9, 2025 by
Raahul46
Loading…
Adding returnKVTensorMetaData flag to Staging Read Strategy
cla signed
fb-exported
meta-exported
#5200
opened Dec 9, 2025 by
Raahul46
Loading…
Fix jagged_to_padded_dense autograd
cla signed
fb-exported
meta-exported
#5191
opened Dec 8, 2025 by
yunjiangster
Loading…
Add warp parallelism to populate_bucketized_permute
cla signed
fb-exported
meta-exported
#5189
opened Dec 8, 2025 by
AlbertDachiChen
Loading…
[fbgemm_gpu] Test out building against non-a SM archs
cla signed
#5188
opened Dec 5, 2025 by
q10
Loading…
Compilation flag for pytorch
cla signed
fb-exported
meta-exported
#5187
opened Dec 5, 2025 by
dsjohns2
Loading…
Improve robustness: Add PackAMatrix warning and VBE metadata validation
cla signed
#5186
opened Dec 5, 2025 by
Jitterx69
Loading…
Add aarch64-specific EmbeddingSpMDM8Bit
cla signed
fb-exported
meta-exported
#5180
opened Dec 2, 2025 by
Nicoshev
Loading…
Add CUDA implementation for fb::masked_select_jagged_1d()
cla signed
fb-exported
meta-exported
#5179
opened Dec 2, 2025 by
mfkaplan
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-12-13.