Skip to content

[SYCL][Doc] Add slm_per_subslice query for Xe GPUs #16376

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 1 commit into
base: sycl
Choose a base branch
from
Draft
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
43 changes: 43 additions & 0 deletions sycl/doc/extensions/supported/sycl_ext_intel_device_info.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,6 +19,7 @@ The Feature Test Macro SYCL\_EXT\_INTEL\_DEVICE\_INFO will be defined as one of
| 4 | Free device memory query is supported |
| 5 | Device ID is supported |
| 6 | Memory clock rate and bus width queries are supported |
| 7 | SLM per subslice is supported |



Expand Down Expand Up @@ -505,6 +506,48 @@ Then the memory bus width can be obtained using the standard get\_info() interfa
auto MemoryBusWidth = dev.get_info<ext::intel::info::device::memory_bus_width>();
}

# Intel GPU SLM per Subslice #

A new device descriptor will be added which will provide the amount of shared
local memory (SLM) per subslice on an Intel GPU.

**Note:** The amount of SLM per subslice may be greater than the amount of
memory available to a single work-group, and can be used to reason about
occupancy.


## Version ##

The extension supports this query in version 7 and later.


## Device Information Descriptors ##

| Device Descriptors | Return Type | Description |
| ------------------ | ----------- | ----------- |
| ext\:\:intel\:\:info\:\:device\:\:gpu\_slm\_per\_subslice | uint64\_t| Returns the amount of SLM per subslice, in bytes. |


## Aspects ##

A new aspect, ext\_intel\_gpu\_slm\_per\_subslice, will be added.


## Error Condition ##

Throws a synchronous `exception` with the `errc::feature_not_supported` error
code if the device does not have `aspect::ext_intel_gpu_slm_per_subslice`.

## Example Usage ##

The amount of SLM per subslice can be obtained using the standard get\_info()
interface.

if (dev.has(aspect::ext_intel_gpu_slm_per_subslice)) {
auto slmBytes = dev.get_info<ext::intel::info::device::gpu_slm_per_subslice>();
}


# Deprecated queries #

The table below lists deprecated, that would soon be removed and their replacements:
Expand Down
Loading