Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Modes] DPC++ backend #494

Merged
merged 116 commits into from
Sep 9, 2021
Merged
Show file tree
Hide file tree
Changes from 101 commits
Commits
Show all changes
116 commits
Select commit Hold shift + click to select a range
e71cdd9
oneAPI headers changes phase 1
subarnar Aug 6, 2020
4ca9bdc
oneAPI mode registration
mpanoop Aug 18, 2020
937da89
changing include directory from oneapi to dpcpp
Sep 3, 2020
27ca53f
modifying oneqpi to dpcpp in src/modes
Sep 3, 2020
b25018f
Modified dpcpp/memory to match dpcpp implementation. There is a todo …
Sep 3, 2020
74cbb61
Translation of device.cpp
Sep 21, 2020
358c073
Some modifications on kernel.cpp
Sep 21, 2020
5c105ee
Other modifications on kernel
Sep 21, 2020
26e7398
Implementation of kernel.cpp. Still issue with the fact that SYCL ker…
Sep 23, 2020
ecfc203
It compiles now :) but won't work before we find a way to enqueue arg…
Sep 24, 2020
dffee51
almost done, examploe not compiling
Sep 25, 2020
0d4e35b
want to reverse
Sep 25, 2020
9e7146c
reverseRevert "almost done, examploe not compiling"
Sep 25, 2020
8e2aacd
Revert "It compiles now :) but won't work before we find a way to enq…
Sep 25, 2020
c727102
test
Sep 25, 2020
b1bc32a
test :wq
Sep 25, 2020
0afe7ea
Revert "Implementation of kernel.cpp. Still issue with the fact that …
Sep 25, 2020
50151b8
testRevert "Revert "It compiles now :) but won't work before we find …
Sep 25, 2020
a54e363
Revert "Revert "It compiles now :) but won't work before we find a wa…
Sep 25, 2020
2f95d38
Revert "want to reverse"
Sep 25, 2020
fdff55f
Fixing issues, adding a way to access parameter addresses, but the ex…
Sep 25, 2020
26c130d
Adding some headers
Sep 26, 2020
b652e62
changing the parallel architecture of dpcpp
Sep 28, 2020
8f3853d
changing example to show the problem
Sep 28, 2020
a2f9985
solving compilation issues Merge branch 'dpcpp' of github.com:mpanoop…
Sep 28, 2020
fa2b21d
Makefile fix for dpcpp mode registration
mpanoop Sep 28, 2020
599bf2c
Fix to expose all the DPCPP devices
mpanoop Sep 28, 2020
85eaaa0
able to compile addVectors example in DPCPP
Sep 28, 2020
68872c2
Merge branch 'dpcpp' of github.com:mpanoop/occa-dev into dpcpp
Sep 28, 2020
cd69112
correcting issue but still not running
Sep 28, 2020
1cd5be1
Few issues solved
Sep 28, 2020
92ad93d
testing stupid example
Sep 29, 2020
f4216f1
fixing issues and kernel is now running
Sep 29, 2020
43fa82a
Solving pointer free issue
Sep 29, 2020
ea2d3de
adding parser for dpcpp, not working but registered to the framework
Sep 29, 2020
18c2725
Updated runEnv on bin/occa.cpp to emit status of OCCA_DPCPP_ENABLED
mpanoop Sep 29, 2020
0985835
Adding modificaiton to the DPCPP parser
Oct 1, 2020
7690066
Merge branch 'dpcpp' of github.com:mpanoop/occa-dev into dpcpp
Oct 1, 2020
7d5f4c7
Modifying example to take one additional parameter
Oct 1, 2020
9806201
Commiting OKL quick and dirty
Oct 8, 2020
dad63fd
Fixing OKL, it is now working
Oct 9, 2020
7111703
Patching computation of local and global ndranges
Oct 9, 2020
e31c507
Introduced lambda_t header and updated exprNode header and cpp for la…
mpanoop Oct 9, 2020
f2b519c
Added capture mode for lambda in lambda type header
mpanoop Oct 9, 2020
ea867ec
OKL translator is working
Oct 10, 2020
61a4a29
Merge branch 'dpcpp' of github.com:mpanoop/occa-dev into dpcpp
Oct 10, 2020
deb7884
fixing indexing
Oct 11, 2020
414bee0
Added support for device functions
mpanoop Oct 11, 2020
3a85b86
Doing some cleanup in src/lang/mode
Oct 11, 2020
6ac2b59
Modifying iso3dfd OKL version to be compliant with serial mode
Oct 11, 2020
1eafc36
adding blocking to OKL
Oct 12, 2020
7e12526
Modified the device function example to reflect the latest change in …
mpanoop Oct 12, 2020
e420997
Fixed the DPC++ device selection logic to reflect the OCCA platform I…
mpanoop Oct 15, 2020
55ff2cb
Adding some correction to example 24. The stencil computation was not…
Oct 16, 2020
f8f1aea
Solving conflicts
Oct 16, 2020
846b6f0
Strating to add shared local variable support
Oct 28, 2020
468d45c
adding support for shared variables, iso is failing for so,e unknown …
Oct 28, 2020
57aaae8
Remove the static allocation statement from OKL when generating DPCPP…
mpanoop Oct 28, 2020
fe1cc14
Support for Shared variables
Oct 28, 2020
3220b1f
Merge branch 'dpcpp' of github.com:mpanoop/occa-dev into dpcpp
Oct 28, 2020
0071afa
- Add global and local ranges at top of kernel before queue submit
kris-rowe Nov 24, 2020
e57c435
Merge branch 'master' into sync-with-libocca
kris-rowe Jan 13, 2021
456b1f6
Move file for `occa::dpcpp` classes to src/occa/internal.
kris-rowe Jan 13, 2021
8982eb6
Remove build artifacts.
kris-rowe Jan 13, 2021
8625c3a
- Move classes for OKL translation to src/internal
kris-rowe Jan 14, 2021
b0ea495
Merge pull request #1 from mpanoop/kris-update
mpanoop Jan 20, 2021
84b0a64
Begin to transition dpcpp backend to use "launcher" approach.
kris-rowe Jan 20, 2021
efe74f1
Insert command queue and nd_range and beginning of argument list.
kris-rowe Jan 20, 2021
d2376d8
Native DPC++ kernels now work with "launcher" approach.
kris-rowe Jan 26, 2021
d7d502b
Initial translation logic for dpc++ parser.
kris-rowe Feb 16, 2021
d21c1e8
Completed OKL translation logic for dpc++.
kris-rowe Feb 22, 2021
a7d2721
Add exception handling to dpcpp backend.
kris-rowe Feb 23, 2021
15fa7b8
Polyfill and clean-up.
kris-rowe Feb 26, 2021
6bb47b0
Change command queue to store a/pass-by value: a pointer is not needed.
kris-rowe Mar 2, 2021
d92aa6c
Merge remote-tracking branch 'libocca/main' into kris-update
kris-rowe Mar 2, 2021
2bb405b
Remove dpcpp::memory::getPtr as this is now defined in the base class.
kris-rowe Mar 2, 2021
743a9a2
Clean-up:
kris-rowe Mar 3, 2021
ce78a35
Adds support for atomics to SYCL/DPC++ backend.
kris-rowe Mar 9, 2021
38302f0
Merge pull request #2 from mpanoop/kris-update
mpanoop Mar 10, 2021
7d1d310
Use FindCUDAToolkit CMake module to avoid include path conflicts with
kris-rowe Mar 11, 2021
b706574
- Remove the sycl::queue and nd_range from kernel metadata for inline…
kris-rowe Mar 11, 2021
c962f0e
Add interoperability example for DPC++.
kris-rowe Mar 12, 2021
567a481
Add example using hostMalloc and unifiedMalloc.
kris-rowe Mar 12, 2021
9828706
Merge pull request #3 from mpanoop/kris-update
mpanoop Mar 15, 2021
4a80a68
Check if more than just the SYCL host device is available when
kris-rowe Mar 18, 2021
dab171f
Remove outdated comments.
kris-rowe May 10, 2021
d0c1ee0
Merge remote-tracking branch 'libocca/main' into dpcpp
kris-rowe May 10, 2021
3d05701
Remove extra examples.
kris-rowe May 10, 2021
3da14b0
Clean-up unused files.
kris-rowe May 10, 2021
d7d55e9
Update pointer class to follow the _Rule of Zero_
kris-rowe May 21, 2021
c1264ce
Update the translation of `@shared` variables so they can be passed to
kris-rowe May 21, 2021
234c463
Update `@shared` translation to use `group_local_memory` extension
kris-rowe Aug 5, 2021
c275be3
Change `dpcpp::stream::memcpy` integer parameter to match other
kris-rowe Aug 6, 2021
0aa1413
[dpcpp] Check the device id is less than the total number of devices not
kris-rowe Aug 6, 2021
736c346
[DPCPP] Fix indexing issue in memory::copyTo/copyFrom.
kris-rowe Aug 25, 2021
4522b85
Return 0 from `dpcpp::streamTag` timing routines for now.
kris-rowe Aug 27, 2021
4daaee1
Update name of native DPC++ example kernel to match other cases.
kris-rowe Sep 7, 2021
3c467fa
Update device registration and counting to include the SYCL host device.
kris-rowe Sep 7, 2021
2c81b0e
Format argument list to be consistent with other examples.
kris-rowe Sep 7, 2021
ec404ee
Move DPC++ related primitives to the header for `dpcppParser`
kris-rowe Sep 7, 2021
85d7d1b
Add missing `device::is_host()` function to dpcpp/polyfill.
kris-rowe Sep 7, 2021
f6927fc
Resolve @dmed256 comments on PR for the DPC++ backend.
kris-rowe Sep 8, 2021
c654233
Update CMake build to handle case where DPC++ components aren't found.
kris-rowe Sep 8, 2021
35196fe
Merge branch 'main' into main
kris-rowe Sep 8, 2021
cffe1c3
Get Makefile build working for DPC++ backend.
kris-rowe Sep 8, 2021
238c469
Merge remote-tracking branch 'origin/main'
kris-rowe Sep 8, 2021
fd269c1
Update DPC++ backend to use lockless caching like libocca/main.
kris-rowe Sep 8, 2021
dd54605
Update standard flags to C++17 to allow for inline static members in
kris-rowe Sep 8, 2021
944c53e
Update Makefile for example 20_native_dpcpp_kernel.
kris-rowe Sep 8, 2021
c274892
Remove swp file.
kris-rowe Sep 8, 2021
eeaebaa
Update examples test script to match shuffled directories.
kris-rowe Sep 8, 2021
121e4c1
Remove opencl tests causing CI pipeline failures.
kris-rowe Sep 8, 2021
bc72d8d
Update src/occa/internal/bin/occa.cpp
kris-rowe Sep 8, 2021
7103b19
Update code coverage script.
kris-rowe Sep 8, 2021
0aa9183
Undo source auto formatting.
kris-rowe Sep 8, 2021
e9bdde7
[DPCPP] Increase test coverage for the DPC++ backend.
kris-rowe Sep 9, 2021
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
61 changes: 43 additions & 18 deletions CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@ if(NOT CMAKE_BUILD_TYPE)
set(CMAKE_BUILD_TYPE "Release" CACHE STRING "Build type" FORCE)
endif()

set(CMAKE_CXX_STANDARD 11)
set(CMAKE_CXX_STANDARD 17)
set(CMAKE_CXX_STANDARD_REQUIRED ON)
set(CMAKE_CXX_EXTENSIONS OFF)

Expand All @@ -28,9 +28,10 @@ set(CMAKE_POSITION_INDEPENDENT_CODE ON)
option(ENABLE_OPENMP "Build with OpenMP if available" ON)
option(ENABLE_CUDA "Build with CUDA if available" ON)
option(ENABLE_OPENCL "Build with OpenCL if available" ON)
option(ENABLE_HIP "Build with HIP if available" ON)
option(ENABLE_METAL "Build with Metal if available" ON)
option(ENABLE_MPI "Build with MPI if available" ON)
option(ENABLE_HIP "Build with HIP if available" ON)
option(ENABLE_METAL "Build with Metal if available" ON)
option(ENABLE_DPCPP "Build with SYCL/DPCPP if available" ON)
option(ENABLE_MPI "Build with MPI if available" ON)

option(ENABLE_TESTS "Build tests" OFF)
option(ENABLE_EXAMPLES "Build simple examples" OFF)
Expand Down Expand Up @@ -121,26 +122,19 @@ endif(ENABLE_OPENMP)

#---[ CUDA ]----------------------------
if(ENABLE_CUDA)
find_package(CUDA)
find_package(CUDAToolkit)

if(CUDA_FOUND)
if(CUDAToolkit_FOUND)
set(OCCA_CUDA_ENABLED 1)

# Find the shared library rather than the static that find_package returns
find_library(CUDART_LIB
NAMES cudart
PATHS ${CUDA_TOOLKIT_ROOT_DIR}
PATH_SUFFIXES lib64 lib
DOC "CUDA RT lib location")
message("-- CUDA include dirs: ${CUDAToolkit_INCLUDE_DIRS}")
message("-- CUDA driver library: ${CUDAToolkit_LIBRARY_DIR}")

message("-- CUDA include dirs: ${CUDA_INCLUDE_DIRS}")
message("-- CUDA libraries: ${CUDART_LIB};cuda}")
target_link_libraries(libocca PRIVATE CUDA::cuda_driver)

target_include_directories(libocca PRIVATE ${CUDA_INCLUDE_DIRS})
target_link_libraries(libocca PRIVATE ${CUDART_LIB} cuda)
else (CUDA_FOUND)
else (CUDAToolkit_FOUND)
set(OCCA_CUDA_ENABLED 0)
endif(CUDA_FOUND)
endif(CUDAToolkit_FOUND)
endif(ENABLE_CUDA)
#=======================================

Expand Down Expand Up @@ -181,6 +175,33 @@ if(ENABLE_OPENCL)
endif(ENABLE_OPENCL)
#=======================================

#---[ SYCL/DPCPP ]-----------------------
if(ENABLE_DPCPP)
find_path(
SYCL_INCLUDE_DIRS
NAMES
CL/sycl.hpp
)

find_library(
SYCL_LIBRARIES
NAMES
sycl libsycl
)
set(SYCL_CXX_FLAGS "-fsycl")

set(OCCA_DPCPP_ENABLED 1)

message("-- SYCL include dirs: ${SYCL_INCLUDE_DIRS}")
message("-- SYCL libraries: ${SYCL_LIBRARIES}")
message("-- SYCL CXX flags: ${SYCL_CXX_FLAGS}")

target_include_directories(libocca PRIVATE ${SYCL_INCLUDE_DIRS})
target_link_libraries(libocca PRIVATE ${SYCL_LIBRARIES})
target_compile_options(libocca BEFORE PRIVATE "${SYCL_CXX_FLAGS}")
endif(ENABLE_DPCPP)
#=======================================

#---[ HIP ]-----------------------------
if(ENABLE_HIP)
find_package(HIP)
Expand Down Expand Up @@ -257,6 +278,10 @@ if(OCCA_OPENMP_ENABLED)
COMPILE_FLAGS ${OpenMP_CXX_FLAGS})
endif()

if(OCCA_DPCPP_ENABLED)
set_source_files_properties(${OCCA_SRC_cpp} PROPERTIES COMPILE_FLAGS ${SYCL_CXX_FLAGS})
endif()

if(ENABLE_FORTRAN)
file(GLOB_RECURSE OCCA_SRC_f90
RELATIVE ${OCCA_SOURCE_DIR} "src/*.f90")
Expand Down
1 change: 1 addition & 0 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -96,6 +96,7 @@ MAKE_COMPILED_DEFINES := $(shell cat "$(OCCA_DIR)/scripts/build/compiledDefinesT
s,@@OCCA_HIP_ENABLED@@,$(OCCA_HIP_ENABLED),g;\
s,@@OCCA_OPENCL_ENABLED@@,$(OCCA_OPENCL_ENABLED),g;\
s,@@OCCA_METAL_ENABLED@@,$(OCCA_METAL_ENABLED),g;\
s,@@OCCA_DPCPP_ENABLED@@,$(OCCA_DPCPP_ENABLED),g;\
s,@@OCCA_BUILD_DIR@@,$(OCCA_BUILD_DIR),g;"\
> "$(NEW_COMPILED_DEFINES)")

Expand Down
3 changes: 3 additions & 0 deletions examples/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,9 @@ macro(add_test_with_modes_and_nranks exe nranks)
if (OCCA_OPENCL_ENABLED)
add_test_with_mode_and_nranks(${exe} opencl "{mode: 'OpenCL', platform_id: 0, device_id: 0}" ${nranks})
endif()
if (OCCA_DPCPP_ENABLED)
add_test_with_mode_and_nranks(${exe} dpcpp "{mode: 'dpcpp', platform_id: 0, device_id: 0}" ${nranks})
endif()
if (OCCA_OPENMP_ENABLED)
add_test_with_mode_and_nranks(${exe} openmp "{mode: 'OpenMP'}" ${nranks})
endif()
Expand Down
2 changes: 1 addition & 1 deletion examples/cpp/01_add_vectors/addVectors.okl
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
const float *a,
const float *b,
float *ab) {
for (int i = 0; i < entries; ++i; @tile(16, @outer, @inner)) {
for (int i = 0; i < entries; ++i; @tile(4, @outer, @inner)) {
dmed256 marked this conversation as resolved.
Show resolved Hide resolved
ab[i] = a[i] + b[i];
}
}
8 changes: 3 additions & 5 deletions examples/cpp/01_add_vectors/main.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -14,20 +14,19 @@ occa::json parseArgs(int argc, const char **argv);
int main(int argc, const char **argv) {
occa::json args = parseArgs(argc, argv);

int entries = 5;
int entries = 12;

float *a = new float[entries];
float *b = new float[entries];
float *ab = new float[entries];

for (int i = 0; i < entries; ++i) {
a[i] = i;
b[i] = 1 - i;
b[i] = i;
ab[i] = 0;
}

occa::device device;
occa::kernel addVectors;
occa::memory o_a, o_b, o_ab;

//---[ Device Setup ]-------------------------------------
Expand Down Expand Up @@ -78,8 +77,7 @@ int main(int argc, const char **argv) {
o_ab = device.malloc(entries * sizeof(float));

// Compile the kernel at run-time
addVectors = device.buildKernel("addVectors.okl",
"addVectors");
occa::kernel addVectors = device.buildKernel("addVectors.okl","addVectors");

// Copy memory to the device
o_a.copyFrom(a);
Expand Down
4 changes: 1 addition & 3 deletions examples/cpp/06_shared_memory/reduction.okl
Original file line number Diff line number Diff line change
Expand Up @@ -2,17 +2,15 @@
const float *vec,
float *blockSum) {
// Partial reduction of vector using loop tiles of size block (power of 2)
for (int group = 0; group < ((entries + block - 1) / block); ++group; @outer) {
for (int group = 0; group < ((entries + block - 1) / block); ++group; @tile(16,@outer)) {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Any reason this file changed?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am not 100% sure why this was changed. It can be removed.

@shared float s_vec[block];

for (int item = 0; item < block; ++item; @inner) {
if ((group * block + item) < entries) {
s_vec[item] = vec[group * block + item];
} else {
s_vec[item] = 0;
}
}

for (int alive = ((block + 1) / 2); 0 < alive; alive /= 2) {
for (int item = 0; item < block; ++item; @inner) {
if (item < alive) {
Expand Down
2 changes: 1 addition & 1 deletion examples/cpp/10_native_cpp_kernels/main.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,7 @@ int main(int argc, const char **argv) {

for (int i = 0; i < entries; ++i) {
a[i] = i;
b[i] = 1 - i;
b[i] = i;
ab[i] = 0;
}

Expand Down
6 changes: 6 additions & 0 deletions examples/cpp/20_native_dpcpp_kernel/CMakeLists.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
if (OCCA_DPCPP_ENABLED)
compile_cpp_example(native_dpcpp_kernel main.cpp)

add_custom_target(cpp_example_native_dpcpp_kernel ALL COMMAND ${CMAKE_COMMAND} -E copy ${CMAKE_CURRENT_SOURCE_DIR}/addVectors.cpp addVectors.cpp)
add_dependencies(examples_cpp_native_dpcpp_kernel cpp_example_native_dpcpp_kernel)
endif()
23 changes: 23 additions & 0 deletions examples/cpp/20_native_dpcpp_kernel/Makefile
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
PROJ_DIR := $(dir $(abspath $(lastword $(MAKEFILE_LIST))))
ifndef OCCA_DIR
include $(PROJ_DIR)/../../../scripts/Makefile
else
include ${OCCA_DIR}/scripts/Makefile
endif

#---[ COMPILATION ]-------------------------------
headers = $(wildcard $(incPath)/*.hpp) $(wildcard $(incPath)/*.tpp)
sources = $(wildcard $(srcPath)/*.cpp)

objects = $(subst $(srcPath)/,$(objPath)/,$(sources:.cpp=.o))

${PROJ_DIR}/main: $(objects) $(headers) ${PROJ_DIR}/main.cpp
$(compiler) $(compilerFlags) -o ${PROJ_DIR}/main $(flags) $(objects) ${PROJ_DIR}/main.cpp $(paths) $(linkerFlags)

$(objPath)/%.o:$(srcPath)/%.cpp $(wildcard $(subst $(srcPath)/,$(incPath)/,$(<:.cpp=.hpp))) $(wildcard $(subst $(srcPath)/,$(incPath)/,$(<:.cpp=.tpp)))
$(compiler) $(compilerFlags) -o $@ $(flags) -c $(paths) $<

clean:
rm -f $(objPath)/*;
rm -f $(PROJ_DIR)/main
#=================================================
17 changes: 17 additions & 0 deletions examples/cpp/20_native_dpcpp_kernel/addVectors.cpp
Original file line number Diff line number Diff line change
@@ -0,0 +1,17 @@
#include <CL/sycl.hpp>

extern "C" void addVectors(::sycl::queue *q,
::sycl::nd_range<3> *ndrange,
int &entries,
int *a,
int *b,
int *c)
{
q->submit([&](::sycl::handler &h) {
h.parallel_for(*ndrange, [=](::sycl::nd_item<3> i) {
int ii = i.get_global_id(0) + i.get_global_id(1) * i.get_global_range(0) + i.get_global_id(2) * i.get_global_range(0) * i.get_global_range(1);
if (ii < entries)
c[ii] = a[ii] + b[ii];
});
});
}
109 changes: 109 additions & 0 deletions examples/cpp/20_native_dpcpp_kernel/main.cpp
Original file line number Diff line number Diff line change
@@ -0,0 +1,109 @@
#include <iostream>

#include <occa.hpp>

//---[ Internal Tools ]-----------------
// Note: These headers are not officially supported
// Please don't rely on it outside of the occa examples
#include <occa/internal/utils/cli.hpp>
#include <occa/internal/utils/testing.hpp>
//======================================

occa::json parseArgs(int argc, const char **argv);


int main(int argc, const char **argv) {
occa::json args = parseArgs(argc, argv);

int entries = 16;

int *a = new int[entries];
int *b = new int[entries];
int *ab = new int[entries];

for (int i = 0; i < entries; ++i) {
a[i] = i;
b[i] = i;
ab[i] = 0;
}

// Setup the platform and device IDs
occa::properties deviceProps;
deviceProps["mode"] = "dpcpp";
deviceProps["platform_id"] = (int) args["options/platform-id"];
deviceProps["device_id"] = (int) args["options/device-id"];

occa::device device(deviceProps);
// Allocate memory on the device
occa::memory o_a = device.malloc<int>(entries);
occa::memory o_b = device.malloc<int>(entries);
occa::memory o_ab = device.malloc<int>(entries);

// Compile a regular DPCPP kernel at run-time
occa::properties kernelProps;
kernelProps["okl/enabled"] = false;

occa::kernel addVectors = device.buildKernel("addVectors.cpp",
"addVectors",
kernelProps);

// Copy memory to the device
o_a.copyFrom(a);
o_b.copyFrom(b);
o_ab.copyFrom(ab);

addVectors.setRunDims(entries/4,4);
// Launch device kernel
addVectors(entries, o_a, o_b, o_ab);
// Copy result to the host
o_ab.copyTo(ab);

// Assert values
for (int i = 0; i < entries; ++i) {
std::cout << i << ": " << ab[i] << '\n';
}
for (int i = 0; i < entries; ++i) {
if (!occa::areBitwiseEqual(ab[i], a[i] + b[i])) {
throw 1;
}
}

// Free host memory
delete [] a;
delete [] b;
delete [] ab;

return 0;
}

occa::json parseArgs(int argc, const char **argv) {
// Note:
// occa::cli is not supported yet, please don't rely on it
// outside of the occa examples
occa::cli::parser parser;
parser
.withDescription(
"Example of using a regular SYCL/DPC++ kernel instead of an OCCA kernel"
)
.addOption(
occa::cli::option('p', "platform-id",
"DPC++ platform ID (default: 0)")
.withArg()
.withDefaultValue(0)
)
.addOption(
occa::cli::option('d', "device-id",
"DPC++ device ID (default: 0)")
.withArg()
.withDefaultValue(0)
)
.addOption(
occa::cli::option('v', "verbose",
"Compile kernels in verbose mode")
);

occa::json args = parser.parseArgs(argc, argv);
occa::settings()["kernel/verbose"] = args["options/verbose"];

return args;
}
2 changes: 2 additions & 0 deletions examples/cpp/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -17,3 +17,5 @@ add_subdirectory(15_cuda_interop)
# Don't force-compile OpenGL examples
# add_subdirectory(16_finite_difference)
# add_subdirectory(17_mandelbulb)

add_subdirectory(20_native_dpcpp_kernel)
Binary file added include/occa/core/.device.hpp.swp
Binary file not shown.
26 changes: 26 additions & 0 deletions include/occa/defines/errors.hpp
Original file line number Diff line number Diff line change
Expand Up @@ -124,4 +124,30 @@
#define OCCA_METAL_WARNING(message, expr) OCCA_METAL_WARNING2(expr, __FILE__, __PRETTY_FUNCTION__, __LINE__, message)
//======================================


//---[ DPCPP ]-------------------------
//======================================

//---[ DPCPP ]---------------------------
#define OCCA_DPCPP_TEMPLATE_CHECK(checkFunction, expr, filename, function, line, message) \
try \
{ \
expr; \
} \
catch (const ::sycl::exception &e) \
{ \
std::stringstream _check_ss; \
_check_ss << message; \
checkFunction(e, filename, function, line, _check_ss.str()); \
}

#define OCCA_DPCPP_ERROR3(expr, filename, function, line, message) OCCA_DPCPP_TEMPLATE_CHECK(occa::dpcpp::error, expr, filename, function, line, message)
#define OCCA_DPCPP_ERROR2(expr, filename, function, line, message) OCCA_DPCPP_ERROR3(expr, filename, function, line, message)
#define OCCA_DPCPP_ERROR(message, expr) OCCA_DPCPP_ERROR2(expr, __FILE__, __PRETTY_FUNCTION__, __LINE__, message)

#define OCCA_DPCPP_WARNING3(expr, filename, function, line, message) OCCA_DPCPP_TEMPLATE_CHECK(occa::dpcpp::warn, expr, filename, function, line, message)
#define OCCA_DPCPP_WARNING2(expr, filename, function, line, message) OCCA_DPCPP_WARNING3(expr, filename, function, line, message)
#define OCCA_DPCPP_WARNING(message, expr) OCCA_DPCPP_WARNING2(expr, __FILE__, __PRETTY_FUNCTION__, __LINE__, message)
//======================================

#endif
Loading