You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-[Srikanth Erusalagandi](mailto:serusal@xilinx.com), designer of the ``XVDPU-PCIE TRD`` platform
46
-
-[Tony McDowell](mailto:tmcdowe@xilinx.com), PCIe functional test and debug
47
-
-[Florent Werbrouck](mailt@florentw@xilinx.com), supporting the SR cases related to this project
51
+
#### Last update
48
52
49
-
### Last update
53
+
18 November 2021
50
54
51
-
18 October 2021
52
55
53
-
---
54
56
55
57
# 1 Introduction
56
58
@@ -59,7 +61,7 @@ This repository contains the **Pre- and Post-processing** kernels to be used in
59
61
The two accelerators were tested using data coming from the Semantic Segmentation CNN of this tutorial:
60
62
[VAI-KERAS-FCN8-SEMSEG](https://github.com/Xilinx/Vitis-AI-Tutorials/tree/master/Design_Tutorials/05-Keras_FCN8_UNET_segmentation), where the CNN was retrained with larger image sizes as 1920x832, but the accelerators are general enough to be used or easily adapted with few changes also to other Deep Learning applications, such as Object Detection or Image Classification.
61
63
62
-
At the moment we are targeting the VCK190 Pre-Production (PP) board, with the so called ``XVDPU-PCIE TRD`` platform, which contains a DPU designed with 96 AI Engine cores (over the 400 available) besides other PL resources (BRAMs, URAMs, FFs, LUTs, DSPs).
64
+
At the moment we are targeting the VCK190 Pre-Production (PP) board, with the so called ``XVDPU TRD`` platform, which contains a DPU designed with 96 AI Engine cores (over the 400 available) besides other PL resources (BRAMs, URAMs, FFs, LUTs, DSPs).
63
65
64
66
The two accelerators do not use any core from the AI Engine array of the Versal ACAP, to be more portable later also on MPSoC devices. Their design is done with **Vitis High Level Synthesis** (shortly **HLS** in the following of this document) within the Vitis suite.
65
67
@@ -347,17 +349,17 @@ Note that this latency is the time to process the entire frame (860x416x28) of d
347
349
348
350
# 3 Vitis GUI-based Design Flow
349
351
350
-
This section explains how to build the embedded system project with the Vitis GUI, now that you have developed the two accelerator kernels as standalone HLS projects. You must have available the following ``platform`` and ``petalinux`` folders/files related to the ``XVDPU-PCIE TRD`` platform design:
352
+
This section explains how to build the embedded system project with the Vitis GUI, now that you have developed the two accelerator kernels as standalone HLS projects. You must have available the following ``platform`` and ``petalinux`` folders/files related to the ``XVDPU TRD`` platform design:
Since the DPU core is not yet in this design, the two PL accelerators work with pre-defined scaling factors. In the real application the information about such scaling factors should arrive by searching for the ``fix_point`` attributes of the input and output tensors of the CNN subgraph running in the DPU.
0 commit comments