Skip to content

Commit be6c04a

Browse files
committed
Updated VDPU-PRE-POST-PL-ACC
1 parent 031b890 commit be6c04a

File tree

1 file changed

+17
-15
lines changed
  • Design_Tutorials/13-vdpu-pre-post-pl-acc

1 file changed

+17
-15
lines changed

Design_Tutorials/13-vdpu-pre-post-pl-acc/README.md

+17-15
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,6 @@ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
1313
See the License for the specific language governing permissions and
1414
limitations under the License.
1515
16-
Author: Daniele Bagni, Xilinx Inc
1716
-->
1817

1918
<table class="sphinxhide">
@@ -25,32 +24,35 @@ Author: Daniele Bagni, Xilinx Inc
2524
</tr>
2625
</table>
2726

27+
2828
# Pre- and Post-processing PL Accelerators for ML with Versal DPU
2929

3030
## Current status
3131

32+
3233
1. Designed with Vitis 2020.2 environment
3334

3435
2. Tested in hardware ON VCK190PP with ``XVDPU-PCIE TRD`` platform.
3536

36-
### Authors
37+
38+
#### Authors
3739

3840
- [Daniele Bagni](mailto:danieleb@xilinx.com):
3941
- [Peter Schillinger](mailto:peters@xilin.com):
4042
- [Herve Ratigner](mailto:herver@xilinx.com):
4143
- [Kay Migge](mailto:kaym@xilinx.com)
4244

43-
### Acknowledgements
45+
#### Acknowledgements
46+
- [Srikanth Erusalagandi](mailto:serusal@xilinx.com)
47+
- [Tony McDowell](mailto:tmcdowe@xilinx.com)
48+
- [Florent Werbrouck](mailt@florentw@xilinx.com)
49+
4450

45-
- [Srikanth Erusalagandi](mailto:serusal@xilinx.com), designer of the ``XVDPU-PCIE TRD`` platform
46-
- [Tony McDowell](mailto:tmcdowe@xilinx.com), PCIe functional test and debug
47-
- [Florent Werbrouck](mailt@florentw@xilinx.com), supporting the SR cases related to this project
51+
#### Last update
4852

49-
### Last update
53+
18 November 2021
5054

51-
18 October 2021
5255

53-
---
5456

5557
# 1 Introduction
5658

@@ -59,7 +61,7 @@ This repository contains the **Pre- and Post-processing** kernels to be used in
5961
The two accelerators were tested using data coming from the Semantic Segmentation CNN of this tutorial:
6062
[VAI-KERAS-FCN8-SEMSEG](https://github.com/Xilinx/Vitis-AI-Tutorials/tree/master/Design_Tutorials/05-Keras_FCN8_UNET_segmentation), where the CNN was retrained with larger image sizes as 1920x832, but the accelerators are general enough to be used or easily adapted with few changes also to other Deep Learning applications, such as Object Detection or Image Classification.
6163

62-
At the moment we are targeting the VCK190 Pre-Production (PP) board, with the so called ``XVDPU-PCIE TRD`` platform, which contains a DPU designed with 96 AI Engine cores (over the 400 available) besides other PL resources (BRAMs, URAMs, FFs, LUTs, DSPs).
64+
At the moment we are targeting the VCK190 Pre-Production (PP) board, with the so called ``XVDPU TRD`` platform, which contains a DPU designed with 96 AI Engine cores (over the 400 available) besides other PL resources (BRAMs, URAMs, FFs, LUTs, DSPs).
6365

6466
The two accelerators do not use any core from the AI Engine array of the Versal ACAP, to be more portable later also on MPSoC devices. Their design is done with **Vitis High Level Synthesis** (shortly **HLS** in the following of this document) within the Vitis suite.
6567

@@ -347,17 +349,17 @@ Note that this latency is the time to process the entire frame (860x416x28) of d
347349

348350
# 3 Vitis GUI-based Design Flow
349351

350-
This section explains how to build the embedded system project with the Vitis GUI, now that you have developed the two accelerator kernels as standalone HLS projects. You must have available the following ``platform`` and ``petalinux`` folders/files related to the ``XVDPU-PCIE TRD`` platform design:
352+
This section explains how to build the embedded system project with the Vitis GUI, now that you have developed the two accelerator kernels as standalone HLS projects. You must have available the following ``platform`` and ``petalinux`` folders/files related to the ``XVDPU TRD`` platform design:
351353

352354
```text
353355
# TRD platform file
354-
ZF_VDPU_TRD/platform/vck190_dpu_pcie/vck190_dpu_pcie.xpfm
356+
VDPU_TRD/platform/vck190_dpu_pcie/vck190_dpu_pcie.xpfm
355357
# Sysroot path
356-
ZF_VDPU_TRD/petalinux/xilinx-vck190-base-trd/images/linux/sdk/sysroots/aarch64-xilinx-linux/
358+
VDPU_TRD/petalinux/xilinx-vck190-base-trd/images/linux/sdk/sysroots/aarch64-xilinx-linux/
357359
# Root FS file
358-
ZF_VDPU_TRD/petalinux/xilinx-vck190-base-trd/images/linux/rootfs.ext4
360+
VDPU_TRD/petalinux/xilinx-vck190-base-trd/images/linux/rootfs.ext4
359361
# Linux Kernel Image file
360-
ZF_VDPU_TRD/petalinux/xilinx-vck190-base-trd/images/linux/Image
362+
VDPU_TRD/petalinux/xilinx-vck190-base-trd/images/linux/Image
361363
```
362364

363365
Since the DPU core is not yet in this design, the two PL accelerators work with pre-defined scaling factors. In the real application the information about such scaling factors should arrive by searching for the ``fix_point`` attributes of the input and output tensors of the CNN subgraph running in the DPU.

0 commit comments

Comments
 (0)