count_nonzero

## 🚀 Feature
An efficient implementation for counting nonzero elements

## Pitch

However in some situations (MaskRCNN) you don't need the exact positions of the nonzero elements, but the sum of them and the method is called quite frequently. So far any workaround is faster than retrieving the indices for the elements and taking it's length.

Some may want the differentiable count of these values, which effectively requires to _not_ use the current nonzero method.

Related links:
It was previously mentioned on Discuss [[1](https://discuss.pytorch.org/t/count-nonzeros-element-along-an-axis/19462) [2](https://discuss.pytorch.org/t/find-non-zero-elements-in-a-tensor/4493/5)] and on #14848 #15190 

## Alternatives

```python
import torch
x = torch.randint(2, [20, 1080, 1920]) # e.g. 20 binary mask maps in int64

[len(torch.nonzero(x[i])) for i in range(len(x))]  # 311 ms ± 11.5 ms per loop (mean ± std. dev. of 10 runs, 10 loops each)
torch.sum(x != 0, dim=[1,2])                       # 136 ms ± 13.6 ms per loop (mean ± std. dev. of 10 runs, 10 loops each)
torch.sum(x.clamp(0, 1), dim=[1,2])                # 49.9 ms ± 5.34 ms per loop (mean ± std. dev. of 10 runs, 10 loops each)
```
On a 1080Ti, these times are respectively `10.9 ms, 3.54 ms, 2.68 ms` (used `torch.cuda.synchronize` before and after operation)

## Additional context

A few other non-trivial things that popped up when I dived in finding out what is the fastest way: 

1. `torch.clamp_max()` and `torch.clamp_min()` is 5x times slower than `torch.clamp()`. Time on x 261 ms ± 78 ms, 202 ms ± 17.8 ms, 47.1 ms ± 437 µs)
2. There's no significant difference if I use `uint8` or `int64` dtype


Thanks @gchanan for asking to report this issue, hope this will help others.

cc @VitalyFedyunin @ngimel @mruberry

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

count_nonzero #23907

🚀 Feature

Pitch

Alternatives

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

count_nonzero #23907

Description

🚀 Feature

Pitch

Alternatives

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions