[Driver][SYCL] Improve dependency file behaviors for -fintelfpga #1145

mdtoguchi · 2020-02-19T02:12:21Z

When compiling AOT for FPGA dependency files are generated that are used
by the aoc compilation. Single step compilation is seamless as the dependency
file is generated then immediately used. When compiling to object, keeping
track of the dependency file is not so intuitive.

To alleviate problems of not being able to find the dependency file, the
original dependency file is bundled up with the destination fat object and
when used is unbundled and passed to the aoc compilation.

Signed-off-by: Michael D Toguchi michael.d.toguchi@intel.com

clang/lib/Driver/Driver.cpp

clang/lib/Driver/ToolChains/Clang.cpp

clang/lib/Driver/ToolChains/SYCL.cpp

clang/lib/Driver/Driver.cpp

This patch improves the tool's diagnostic upon finding a SPIR kernel within an LLVM module. Despite that the tool's only current use is within the SYCL FPGA flow, it's important to make the message target-agnostic, so that the tool is not tied to a particular device BE. A related commit to the Clang driver has extended these diagnostics with SYCL FPGA specifics without affecting the tool itself. This patch also introduces testing for the return code value. For example, this should allow the Clang driver users/developers to differentiate between the two possible causes of llvm-no-spir-kernel failure. Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

Signed-off-by: Alexey Bader <alexey.bader@intel.com>

intel#1141) Signed-off-by: Aleksander Fadeev <aleksander.fadeev@intel.com>

clang/lib/Driver/ToolChains/Clang.cpp

clang/lib/Driver/Driver.cpp

AGindinson

LGTM overall, just a couple more nits

clang/lib/Driver/Driver.cpp

AGindinson · 2020-02-20T23:19:08Z

clang/test/Driver/sycl-offload-intelfpga.cpp

+// RUN: %clangxx -### -fsycl -fintelfpga %t-1.o %t-2.o 2>&1 \
+// RUN:  | FileCheck -check-prefix=CHK-FPGA-DEP-FILES-OBJ -DINPUT1=%t-1.o -DINPUT2=%t-2.o %s


Would a clang-cl run be also applicable? It may be worth adding as per the usual "just in case"

Added a clang_cl test here.

Could you also add it for other tests? Sorry for missing this initially

clang/lib/Driver/ToolChains/Clang.cpp

AGindinson · 2020-02-21T06:59:41Z

clang/test/Driver/sycl-offload-intelfpga.cpp

+// RUN: %clangxx -### -fsycl -fintelfpga %t-1.o %t-2.o 2>&1 \
+// RUN:  | FileCheck -check-prefix=CHK-FPGA-DEP-FILES-OBJ -DINPUT1=%t-1.o -DINPUT2=%t-2.o %s


Could you also add it for other tests? Sorry for missing this initially

Move internal headers from include/CL/sycl to source directory to prevent implementation details leak to user application and enforce stable ABI. A few more changes were applied to make the movement possible: - addHostAccessorAndWait functions in accessor to avoid calls to RT internals from header file - Removed getImageInfo - Move buffer size acquisition from buffer constructor to SYCLMemObjT cpp to avoid calls to PI - getPluginFromContext function in context - Standard containers replaced with SYCL variants in sycl_mem_obj_i.hpp. Unique ptr replaced with shared - A few implementations moved from queue.hpp to queue.cpp - Some LIT tests temporarily include implementaion specific headers. They will be converted to unit tests later. Signed-off-by: Alexander Batashev <alexander.batashev@intel.com>

intel#1144) Since we really just want to be able to memcpy the type to the device, 'is-trivially-copyable' is not the correct trait. Since CWG1734, If we want to support trivially copyable types, we would be required to create 1 of 4 different mechanisms for having a type on the device (depending on the way the type is structured). Additionally, 2 of these ways require us to ALSO have the type be default constructible. This patch transitions to trivially-copy-constructible , so that we can simply memcpy from the existing one into new memory. Signed-off-by: Erich Keane <erich.keane@intel.com>

AGindinson

Thanks!

intel#1118) Signed-off-by: James Brodman <james.brodman@intel.com>

LowerWGScope pass performs required transformations to enable hierarchical parallelism semantics. This pass should not be skipped even if optimizations are disabled. Also some typos in the comments are fixed. Signed-off-by: Artur Gainullin <artur.gainullin@intel.com>

…el#1156) After intel#1068 has included the Demangle header, this fix to CMakeLists should guarantee successful builds in all configurations Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

SPIR-V OpGroupBroadcast accepts three forms of local ID: - scalar integer - vector integer with 2 components - vector integer with 3 components Signed-off-by: John Pennycook <john.pennycook@intel.com>

Also remove idle semicolon. Signed-off-by: Alexey Bader <alexey.bader@intel.com>

…#1162) Fix the cl_device_unified_shared_memory_capabilities_intel bitfield type name. Signed-off-by: Alexey Bader <alexey.bader@intel.com>

* [SYCL][LIBCLC] Additional libclc builtins to support SYCL work Adds builtins to libclc to support the CUDA backend for SYCL. Contributors Alexander Johnston <alexander@codeplay.com> David Wood <david.wood@codeplay.com> Victor Lomuller <victor@codeplay.com> Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL] CMake and lit support for SYCL CUDA backend Adds defines CMake and lit variables used for SYCL CUDA backend development and test Contributors Alexander Johnston <alexander@codeplay.com> Bjoern Knafla <bjoern@codeplay.com> Ruyman Reyes <ruyman@codeplay.com> Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL] Local Accessor Support for CUDA Provides the LocalAccessorToSharedMemory compiler pass required for supporting SYCL local accessors in CUDA. Contributors Alexander Johnston <alexander@codeplay.com> David Wood <david.wood@codeplay.com> Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL][CUDA] Change __spirv_BuiltIn.. to functions Changes the following builtins to functions __spirv_BuiltInGlobalSize __spirv_BuiltInWorkgroupSize __spirv_BuiltInNumWorkgroups __spirv_BuiltInLocalInvocationId __spirv_BuiltInWorkgroupId __spirv_BuiltInGlobalOffset Contributors David Wood <david.wood@codeplay.com> Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL][CUDA] Add SYCL CUDA support to clang driver Adds CUDA support for sycl compilation in the clang driver Contributors Alexander Johnston <alexander@codeplay.com> David Wood <david.wood@codeplay.com> Victor Lomuller <victor@codeplay.com> Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL][CUDA] Initial Implementation of the CUDA backend Contributors Alan Forbes <alan.forbes@codeplay.com> Alexander Johnston <alexander@codeplay.com> Bjoern Knafla <bjoern@codeplay.com> Daniel Soutar <daniel.soutar@codeplay.com> David Wood <david.wood@codeplay.com> Kumudha Narasimhan <kumudha.narasimhan@codeplay.com> Mehdi Goli <mehdi.goli@codeplay.com> Przemek Malon <przemek.malon@codeplay.com> Ruyman Reyes <ruyman@codeplay.com> Stuart Adams <stuart.adams@codeplay.com> Svetlozar Georgiev <svetlozar.georgiev@codeplay.com> Steffen Larsen <steffen.larsen@codeplay.com> Victor Lomuller <victor@codeplay.com> Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL] Update libclc install rules Have libclc install clc-* and libspirv-* to lib and share Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL][CUDA] Inline cl namespace to simplify SYCL API usage Synchronise the CUDA backend with the general SYCL changes from intel#974. Signed-off-by: Andrea Bocci <andrea.bocci@cern.ch> * Added missing flags for device-side builtins Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL][CUDA] Removing unnecessary tool from the tree Acked-by: Victor Lomuller <victor@codeplay.com> Signed-off-by: Ruyman <ruyman@codeplay.com> * [SYCL][PI] Fix kernel group info parameter conversion Signed-off-by: Steffen Larsen <steffen.larsen@codeplay.com> * [SYCL][CUDA] Refactor __SYCL_INLINE macro Synchronise the CUDA backend with the general SYCL changes from intel#1121. Signed-off-by: Andrea Bocci <andrea.bocci@cern.ch> * [SYCL] Have default_selector consider SYCL_BE Have the default_selector consider the env var SYCL_BE when rating device scores to make choosing a backend easier. Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL] Select GlobalPlugin based on SYCL_BE Rather than choose the last found plugin as GlobalPlugin, select it depending on the SYCL_BE env var. Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL] Improve default device selection checks Better checks for CUDA and OpenCL devices to match with SYCL_BE in the default device selection, based on the platform version info. Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL] Formatting update for device_selector.cpp Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL] Changed CUDA unit tests to call through plugin Signed-off-by: Steffen Larsen <steffen.larsen@codeplay.com> * [SYCL] Pass SYCL_BE=PI_OPENCL in check-sycl To ensure that the check-sycl targets test OpenCL devices, pass SYCL_BE=PI_OPENCL. This mirrors the check-sycl-cuda target which passes SYCL_BE=PI_CUDA. Without this it is nondeterministic which device is tested by check-sycl. Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL][CUDA] Remove PI_CUDA specific details from clang Removes PI_CUDA specific code paths and tests from clang, opting to always enable them. Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL][CUDA] Disable linear_id/opencl-interop.cpp for cuda Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL][CUDA] Further fixes to CUDA device selection Fix platform string comparison for CUDA platform detection. Fix device info platform query so that it uses the device's plugin, rather than the GlobalPlugin. Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL][CUDA] Code style and cleanup to CUDA support Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL] Enable asserts in all buildbot builds Signed-off-by: Alexander Johnston <alexander@codeplay.com> * [SYCL][CUDA] Minor test and build configuration Fix minor test and build configuration issues introduced in the development of the CUDA backend. Signed-off-by: Alexander Johnston <alexander@codeplay.com> Co-authored-by: Andrea Bocci <andrea.bocci@cern.ch> Co-authored-by: Ruyman <ruyman@codeplay.com> Co-authored-by: Steffen Larsen <56076654+steffenlarsen@users.noreply.github.com>

Signed-off-by: Alexey Bader alexey.bader@intel.com Co-Authored-By: Alexander Batashev <alexbatashev@outlook.com>

Error was reproducible in two cases: - using something like `numeric_limits<half>::min()` in within another `constexpr` - not treating SYCL headers as system ones with `-Winvalid-constexpr` treated as error Signed-off-by: Alexey Sachkov <alexey.sachkov@intel.com>

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

Event type triggers are misspelled "open"->"opened", etc. Default event type triggers should work fine. Signed-off-by: Alexey Bader <alexey.bader@intel.com>

…1053) We had issue with wrong mangling of s_upsample. I fixed it a long time ago, so we can delete workaround now. Signed-off-by: Ilya Mashkov <ilya.mashkov@intel.com>

Signed-off-by: Igor Dubinov <igor.dubinov@intel.com>

When compiling AOT for FPGA dependency files are generated that are used by the aoc compilation. Single step compilation is seamless as the dependency file is generated then immediately used. When compiling to object, keeping track of the dependency file is not so intuitive. To alleviate problems of not being able to find the dependency file, the original dependency file is bundled up with the destination fat object and when used is unbundled and passed to the aoc compilation. Signed-off-by: Michael D Toguchi <michael.d.toguchi@intel.com>

Signed-off-by: Michael D Toguchi <michael.d.toguchi@intel.com>

Use new FPGA dependency type which is used for unbundling of the FPGA dependency from an object. Signed-off-by: Michael D Toguchi <michael.d.toguchi@intel.com>

Signed-off-by: Michael D Toguchi <michael.d.toguchi@intel.com>

Use TY_FPGA_Dependencies for generated file from unbundle Clean up comments and add clang_cl specific test for dep unbundle Signed-off-by: Michael D Toguchi <michael.d.toguchi@intel.com>

Signed-off-by: Michael D Toguchi <michael.d.toguchi@intel.com>

bader · 2020-02-25T20:30:10Z

@mdtoguchi, it looks like something went wrong with your branch. It has irrelevant commits already present in the target branch.

mdtoguchi · 2020-02-25T21:30:05Z

@mdtoguchi, it looks like something went wrong with your branch. It has irrelevant commits already present in the target branch.

whoa. I don't know what happened. I rebased to fix the conflict. I'll start from scratch.

mdtoguchi · 2020-02-25T22:09:44Z

Created a new PR: #1186

mdtoguchi · 2020-03-05T17:34:20Z

redo has been merged, closing this one.

It is translated to a function with unmangled name __spirv_BuildNDRange_{1|2|3}D with struct return parameter and array arguments, since translator only translates it properly to SPIR-V with this signature. _ND postfix is requred because array arguments are mangled in the same way, so if there was no postfix, translator would produce functions with same name for different dimensions. Original commit: KhronosGroup/SPIRV-LLVM-Translator@a6ca745

mdtoguchi requested review from Fznamznon, AGindinson, domiyan and sndmitriev February 19, 2020 02:12

AGindinson self-assigned this Feb 19, 2020

AGindinson reviewed Feb 19, 2020

View reviewed changes

mdtoguchi requested a review from AGindinson February 20, 2020 00:50

bader and others added 2 commits February 20, 2020 11:18

[SYCL][NFC] Remove idle space (intel#1148)

59f39b2

Signed-off-by: Alexey Bader <alexey.bader@intel.com>

[SYCL] Forbid declaration of non-const static variables inside kernels (

7743e86

intel#1141) Signed-off-by: Aleksander Fadeev <aleksander.fadeev@intel.com>

AGindinson reviewed Feb 20, 2020

View reviewed changes

clang/lib/Driver/ToolChains/Clang.cpp Outdated Show resolved Hide resolved

clang/lib/Driver/Driver.cpp Outdated Show resolved Hide resolved

AGindinson reviewed Feb 20, 2020

View reviewed changes

AGindinson reviewed Feb 21, 2020

View reviewed changes

clang/lib/Driver/ToolChains/Clang.cpp Show resolved Hide resolved

AGindinson previously approved these changes Feb 21, 2020

View reviewed changes

Alexander Batashev and others added 2 commits February 21, 2020 14:03

mdtoguchi dismissed AGindinson’s stale review via b3752b8 February 21, 2020 16:45

AGindinson previously approved these changes Feb 21, 2020

View reviewed changes

jbrodman and others added 2 commits February 21, 2020 19:54

[SYCL][Doc][USM] Add refactored pointer and device queries to USM spec (

0438422

intel#1118) Signed-off-by: James Brodman <james.brodman@intel.com>

mdtoguchi dismissed AGindinson’s stale review via c5eadbf February 22, 2020 00:56

Artem Gindinson and others added 9 commits February 22, 2020 13:23

[SYCL] Add llvm/Demangle link dependency for llvm-no-spir-kernel (int…

1d8f577

…el#1156) After intel#1068 has included the Demangle header, this fix to CMakeLists should guarantee successful builds in all configurations Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

[SYCL] Fix __spirv_GroupBroadcast overloads (intel#1152)

5d73019

SPIR-V OpGroupBroadcast accepts three forms of local ID: - scalar integer - vector integer with 2 components - vector integer with 3 components Signed-off-by: John Pennycook <john.pennycook@intel.com>

[SYCL][NFC] Fix unreferenced variable warning (intel#1158)

c494112

Also remove idle semicolon. Signed-off-by: Alexey Bader <alexey.bader@intel.com>

[USM] Align OpenCL USM extension header with the specification (intel…

a0c0e33

…#1162) Fix the cl_device_unified_shared_memory_capabilities_intel bitfield type name. Signed-off-by: Alexey Bader <alexey.bader@intel.com>

[CI] Add clang-format checker to pre-commit checks (intel#1163)

80b0306

Signed-off-by: Alexey Bader alexey.bader@intel.com Co-Authored-By: Alexander Batashev <alexbatashev@outlook.com>

[SYCL][NFC] Remove idle flag (intel#1157)

e2130b1

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

[CI] Remove invalid type triggers (intel#1176)

12577b5

Event type triggers are misspelled "open"->"opened", etc. Default event type triggers should work fine. Signed-off-by: Alexey Bader <alexey.bader@intel.com>

imashkov and others added 11 commits February 25, 2020 19:22

[SYCL]Deletion of workaround for wrong mangling of s_upsample (intel#…

da0f66b

…1053) We had issue with wrong mangling of s_upsample. I fixed it a long time ago, so we can delete workaround now. Signed-off-by: Ilya Mashkov <ilya.mashkov@intel.com>

[SYCL] Refactor id<1> to size_t conversion (intel#1126)

3fe01fb

Signed-off-by: Igor Dubinov <igor.dubinov@intel.com>

[NFC] Address smaller review comments.

92684ed

Signed-off-by: Michael D Toguchi <michael.d.toguchi@intel.com>

[NFC] Add test to verify phases when extracting dependency from obj

c26fb0d

Signed-off-by: Michael D Toguchi <michael.d.toguchi@intel.com>

[Driver][SYCL][FPGA] Introduce FPGA dependency type.

c927bb4

Use new FPGA dependency type which is used for unbundling of the FPGA dependency from an object. Signed-off-by: Michael D Toguchi <michael.d.toguchi@intel.com>

[Driver] Add fpga_dep arch type to use for bundle/unbundle

d05b2e5

Signed-off-by: Michael D Toguchi <michael.d.toguchi@intel.com>

[Driver] Update based on review comments

e345df7

Use TY_FPGA_Dependencies for generated file from unbundle Clean up comments and add clang_cl specific test for dep unbundle Signed-off-by: Michael D Toguchi <michael.d.toguchi@intel.com>

[NFC] update missing triples functions for fpga_dep

0eb0bda

Signed-off-by: Michael D Toguchi <michael.d.toguchi@intel.com>

[NFC] Update test to include more clang-cl testing

7039a67

Signed-off-by: Michael D Toguchi <michael.d.toguchi@intel.com>

[NFC] Additional Triple switches that were missed for fpga_dep

385a8a4

Signed-off-by: Michael D Toguchi <michael.d.toguchi@intel.com>

mdtoguchi closed this Mar 5, 2020

		// RUN: %clangxx -### -fsycl -fintelfpga %t-1.o %t-2.o 2>&1 \
		// RUN: \| FileCheck -check-prefix=CHK-FPGA-DEP-FILES-OBJ -DINPUT1=%t-1.o -DINPUT2=%t-2.o %s

[Driver][SYCL] Improve dependency file behaviors for -fintelfpga #1145

[Driver][SYCL] Improve dependency file behaviors for -fintelfpga #1145

Uh oh!

Conversation

mdtoguchi commented Feb 19, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AGindinson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

AGindinson Feb 20, 2020

Choose a reason for hiding this comment

Uh oh!

mdtoguchi Feb 21, 2020

Choose a reason for hiding this comment

Uh oh!

AGindinson Feb 21, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AGindinson Feb 21, 2020

Choose a reason for hiding this comment

Uh oh!

AGindinson left a comment

Choose a reason for hiding this comment

Uh oh!

bader commented Feb 25, 2020

Uh oh!

mdtoguchi commented Feb 25, 2020

Uh oh!

mdtoguchi commented Feb 25, 2020

Uh oh!

mdtoguchi commented Mar 5, 2020

Uh oh!

Uh oh!