Overload `cholesky` method in order to prevent scalar indexing for `Diagonal` #384

simsurace · 2021-12-29T16:06:52Z

Makes this work without scalar indexing:

using CUDA, LinearAlgebra
CUDA.allowscalar(false)

n = 1024
d = CUDA.rand(n)
D = Diagonal(d)
cholesky(D)

Performance can probably be improved.

simsurace · 2021-12-29T17:08:08Z

Unfortunately I have no experience with oneAPI.jl. Will look into the failing tests later, if people think that this is a useful PR.

maleadt · 2021-12-30T08:23:00Z

Thanks for the PR!

Unfortunately I have no experience with oneAPI.jl. Will look into the failing tests later, if people think that this is a useful PR.

The failure is a generic GPU array failure where Base (specifically BLAS) functionality is called where a specialization is needed:

  ArgumentError: cannot take the host address of a oneArray{Float32, 2}
  Stacktrace:
    [1] unsafe_convert(#unused#::Type{Ptr{Float32}}, x::oneArray{Float32, 2})
      @ oneAPI ~/.cache/julia-buildkite-plugin/depots/c9f52312-b528-44e4-9501-6d408762012b/packages/oneAPI/KbEvp/src/array.jl:176
    [2] potrf!(uplo::Char, A::oneArray{Float32, 2})
      @ LinearAlgebra.LAPACK /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.6/LinearAlgebra/src/lapack.jl:3088
    [3] _chol!
      @ /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.6/LinearAlgebra/src/cholesky.jl:172 [inlined]
    [4] cholesky!(A::Hermitian{Float32, oneArray{Float32, 2}}, ::Val{false}; check::Bool)
      @ LinearAlgebra /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.6/LinearAlgebra/src/cholesky.jl:252
    [5] cholesky!(A::oneArray{Float32, 2}, ::Val{false}; check::Bool)
      @ LinearAlgebra /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.6/LinearAlgebra/src/cholesky.jl:285
    [6] #cholesky#134
      @ /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.6/LinearAlgebra/src/cholesky.jl:378 [inlined]
    [7] cholesky (repeats 2 times)
      @ /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.6/LinearAlgebra/src/cholesky.jl:378 [inlined]
    [8] macro expansion
      @ /var/lib/buildkite-agent/builds/gpuci3/julialang/gpuarrays-dot-jl/test/testsuite/linalg.jl:107 [inlined]

No GPUArrays in that stack trace, so maybe this PR is missing an overload (or relying on CUDA.jl-specific functionality, which isn't allowed for generic GPUArrays.jl implementations)?

simsurace · 2021-12-30T11:27:01Z

Ah, looks like cholesky is generally broken for oneArrays.
EDIT: will add tests for cholesky without the Diagonal wrapper.

simsurace · 2021-12-30T13:15:38Z

I think the reason for the failure is that the generic LinearAlgebra.LAPACK.potrf! method that calls into BLAS does not work for oneArrays, a specific overload analogous to the one in CUDA/lib/cusolver/dense.jl is missing. Shall I open an issue in oneAPI.jl? I presume the overload should be in there, not here.

maleadt · 2021-12-30T14:48:43Z

Those kind of overloads (replacing LinearAlgebra.BLAS and LinearAlgebra.LAPACK methods with GPU-specific ones) is something I'm trying to get rid of, actually, because CUBLAS really isn't a drop-in replacement for BLAS so it's a bad place for putting overrides.

That said, unless GPUArrays.jl can provide a generic potrf this is a bad test, so maybe it's better to move this implementation to CUDA.jl where the required functionality is available.

simsurace · 2021-12-30T20:52:39Z

So just to clarify: the generic cholesky overload introduced in this PR works on diagonal oneArrays and CuArrays. The problematic method was the one being called on dense oneArrays in the previous tests. I fixed the tests to not rely on these methods anymore. With this modification, everything passes.

simsurace · 2021-12-30T21:01:02Z

test/testsuite/linalg.jl

@@ -116,7 +104,7 @@
            n = 128
            d = AT(rand(Float32, n))
            D = Diagonal(d)
-            F = AT(collect(D))
+            F = collect(D)


This modification avoids calling a dense cholesky method, which is not available for oneArrays

Simone Carlo Surace added 4 commits December 29, 2021 14:03

Overload cholesky method

9d10288

Fix test

babc521

Improve performance

7102c8d

Return info

14a0f53

Simone Carlo Surace added 3 commits December 30, 2021 12:38

Print dispatched method for debugging purposes

d307cd1

Add test for cholesky

b3c3940

Fix test

0653949

Add dispatch info for debugging purposes

a4ec467

Remove bad test, fix diagonal test

9f7b8e2

Rearrange tests

828a536

simsurace commented Dec 30, 2021

View reviewed changes

maleadt merged commit 4cdb50b into JuliaGPU:master Dec 31, 2021

simsurace deleted the ss/cholesky branch January 21, 2022 09:18

This was referenced Jan 31, 2022

State of GPU support JuliaGaussianProcesses/KernelFunctions.jl#431

Open

Linear solvers #91

Open

maleadt mentioned this pull request Nov 10, 2022

Issues with cholesky(::Diagonal) #434

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overload `cholesky` method in order to prevent scalar indexing for `Diagonal` #384

Overload `cholesky` method in order to prevent scalar indexing for `Diagonal` #384

simsurace commented Dec 29, 2021

simsurace commented Dec 29, 2021

maleadt commented Dec 30, 2021

simsurace commented Dec 30, 2021 •

edited

Loading

simsurace commented Dec 30, 2021

maleadt commented Dec 30, 2021

simsurace commented Dec 30, 2021 •

edited

Loading

simsurace Dec 30, 2021 •

edited

Loading

Overload cholesky method in order to prevent scalar indexing for Diagonal #384

Overload cholesky method in order to prevent scalar indexing for Diagonal #384

Conversation

simsurace commented Dec 29, 2021

simsurace commented Dec 29, 2021

maleadt commented Dec 30, 2021

simsurace commented Dec 30, 2021 • edited Loading

simsurace commented Dec 30, 2021

maleadt commented Dec 30, 2021

simsurace commented Dec 30, 2021 • edited Loading

simsurace Dec 30, 2021 • edited Loading

Choose a reason for hiding this comment

Overload `cholesky` method in order to prevent scalar indexing for `Diagonal` #384

Overload `cholesky` method in order to prevent scalar indexing for `Diagonal` #384

simsurace commented Dec 30, 2021 •

edited

Loading

simsurace commented Dec 30, 2021 •

edited

Loading

simsurace Dec 30, 2021 •

edited

Loading