Clarify semantics of `urKernelSuggestMaxCooperativeGroupCountExp`

CC @0x12CC @nrspruit 

In the discussion from here: https://github.com/oneapi-src/unified-runtime/pull/1246#issuecomment-1894446658

it was described that `urKernelSuggestMaxCooperativeGroupCountExp` maps to `cudaOccupancyMaxActiveBlocksPerMultiprocessor`
which takes a kernel and other params, and returns the maximum number of blocks that can be simultaneously executed in a streaming multiprocessor (SM).

However I found this in the l0 documentation:

"Use [zeKernelSuggestMaxCooperativeGroupCount](https://spec.oneapi.io/level-zero/latest/core/api.html#ze__api_8h_1af0b050a6cc08132ef84a8618942ce125) to recommend max group count for device for cooperative functions that device supports."

The "device" word implies that the semantics of of  `urKernelSuggestMaxCooperativeGroupCountExp` is the maximum number of blocks that can be simultaneously executed in a device. A device consists of multiple streaming multiprocessors. In such a case you need to multiply the max number of blocks that can be simultanously executed in a SM by the number of SMs in a device.

The number of SMs can only be retrieved by querying the device the kernel is to be run on. This information (the device to be run on) is not passed to  `urKernelSuggestMaxCooperativeGroupCountExp`, nor can it be inferred from any of the other parameters.
Therefore, there are two possibilities:

- if the semantics is the max number of blocks per device, the interface needs to be changed.
-  if the semantics is the max number of blocks per SM, the documentation should be clarified IMO.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clarify semantics of `urKernelSuggestMaxCooperativeGroupCountExp` #1687

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Clarify semantics of urKernelSuggestMaxCooperativeGroupCountExp #1687

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Clarify semantics of `urKernelSuggestMaxCooperativeGroupCountExp` #1687