Simplify copy and cast kernels #1165

oleksandr-pavlyk · 2023-04-08T23:24:19Z

This PR works copy-and-cast functionality by simplifying underlying functors, removing specialized implementation for 2d arrays based on CIndexer_array as it is only very marginally faster than general strided implementation based on CIndexer_vector.

This PR also adds special implementation for copy-with-casting of contiguous arrays, making type casting of contiguous arrays several times faster.

Have you provided a meaningful PR description?
Have you added a test, reproducer or referred to an issue with a reproducer?
Have you tested your changes locally for CPU and GPU devices?
Have you made sure that new changes do not introduce compiler warnings?
Have you checked performance impact of proposed changes?
If this PR is a work in progress, are you opening the PR as a draft?

Functions should store typed pointers instead of typeless. The CastFnT effectively becomes a trivial call to convert_impl in its call operator. Also added few data movement optimizations.

…rformance gains

Example where it helps: ``` In [1]: import dpctl, dpctl.tensor as dpt In [2]: x = dpt.arange(1234*7873, dtype=dpt.int32) In [3]: xx = dpt.permute_dims(dpt.reshape(x, (2, 617, 7873)), (1,2,0)) In [4]: yy = dpt.permute_dims(dpt.reshape(dpt.empty_like(x, dtype="f4"), (2, 617, 7873)), (1,2,0)) In [5]: %timeit yy[...] = xx 1.07 ms ± 93.8 µs per loop (mean ± std. dev. of 7 runs, 1,000 loops each) ``` in master the time is about 2.8 ms on Iris Xe.

coveralls · 2023-04-08T23:39:20Z

Coverage: 83.224%. Remained the same when pulling 6933675 on simplify-copy-and-cast-kernels into 8f2ba46 on master.

github-actions · 2023-04-08T23:40:20Z

View rendered docs @ https://intelpython.github.io/dpctl/pulls/1165/index.html

github-actions · 2023-04-09T00:05:31Z

Array API standard conformance tests for dpctl=0.14.3dev0=py310h76be34b_98 ran successfully.
Passed: 46
Failed: 788
Skipped: 282

- CopyAndCastContigFactory changed to reflect that contiguous copying and casting is now possible for more than strictly different data types

github-actions · 2023-04-10T23:15:08Z

Array API standard conformance tests for dpctl=0.14.3dev0=py310h76be34b_105 ran successfully.
Passed: 47
Failed: 787
Skipped: 282

oleksandr-pavlyk · 2023-04-11T00:47:57Z

@ndgrigorian Good catch!

ndgrigorian

LGTM

github-actions · 2023-04-11T11:33:55Z

Deleted rendered PR docs from intelpython.github.com/dpctl, latest should be updated shortly. 🤞

github-actions · 2023-04-11T12:10:49Z

Array API standard conformance tests for dpctl=0.14.3dev0=py310h76be34b_105 ran successfully.
Passed: 47
Failed: 787
Skipped: 282

oleksandr-pavlyk and others added 4 commits April 6, 2023 07:07

Simplified copy-and-cast kernels

bb39d3c

Functions should store typed pointers instead of typeless. The CastFnT effectively becomes a trivial call to convert_impl in its call operator. Also added few data movement optimizations.

Remove nd==2 special casing, as experiments show it does not bring pe…

6f7807a

…rformance gains

Copy and cast kernel for contiguous data added

2669110

oleksandr-pavlyk requested a review from ndgrigorian April 8, 2023 23:24

Fixed type dispatching for contiguous copy and cast

6933675

- CopyAndCastContigFactory changed to reflect that contiguous copying and casting is now possible for more than strictly different data types

ndgrigorian approved these changes Apr 11, 2023

View reviewed changes

oleksandr-pavlyk merged commit 02d1f94 into master Apr 11, 2023

oleksandr-pavlyk deleted the simplify-copy-and-cast-kernels branch April 11, 2023 11:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Simplify copy and cast kernels #1165

Simplify copy and cast kernels #1165

Uh oh!

oleksandr-pavlyk commented Apr 8, 2023

Uh oh!

coveralls commented Apr 8, 2023 •

edited

Loading

Uh oh!

github-actions bot commented Apr 8, 2023

Uh oh!

github-actions bot commented Apr 9, 2023

Uh oh!

github-actions bot commented Apr 10, 2023

Uh oh!

oleksandr-pavlyk commented Apr 11, 2023

Uh oh!

ndgrigorian left a comment

Uh oh!

github-actions bot commented Apr 11, 2023

Uh oh!

github-actions bot commented Apr 11, 2023

Uh oh!

Uh oh!

Simplify copy and cast kernels #1165

Simplify copy and cast kernels #1165

Uh oh!

Conversation

oleksandr-pavlyk commented Apr 8, 2023

Uh oh!

coveralls commented Apr 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 8, 2023

Uh oh!

github-actions bot commented Apr 9, 2023

Uh oh!

github-actions bot commented Apr 10, 2023

Uh oh!

oleksandr-pavlyk commented Apr 11, 2023

Uh oh!

ndgrigorian left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Apr 11, 2023

Uh oh!

github-actions bot commented Apr 11, 2023

Uh oh!

Uh oh!

coveralls commented Apr 8, 2023 •

edited

Loading