Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Optimize
_length_per_key_from_stride_per_key
with segment sum csr w…
…hen appropriate (pytorch#1699) Summary: Pull Request resolved: pytorch#1699 `segment_sum_csr` outperforms the torch split/cat ops under certain conditions. However there is performance degradation when 1. the number of segments is small 2. there are many elements in each segment to sum Reviewed By: AlbertDachiChen Differential Revision: D53020207 fbshipit-source-id: 3068479af2068922e4d9e393908c2a90982eb43f
- Loading branch information