Skip to content

FPGA: Add note to large designs READMEs to warn users about large FPGA parts requirements #1407

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Mar 7, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions DirectProgramming/C++SYCL_FPGA/ReferenceDesigns/crr/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -53,6 +53,8 @@ You can also find more information about [troubleshooting build errors](/DirectP
>
> :warning: Make sure you add the device files associated with the FPGA that you are targeting to your Intel® Quartus® Prime installation.

> **Note**: You'll need a large FPGA part to be able to fit this design

### Performance

Performance results are based on testing as of July 20, 2020.
Expand Down
2 changes: 2 additions & 0 deletions DirectProgramming/C++SYCL_FPGA/ReferenceDesigns/db/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -54,6 +54,8 @@ You can also find more information about [troubleshooting build errors](/DirectP
>
> :warning: Make sure you add the device files associated with the FPGA that you are targeting to your Intel® Quartus® Prime installation.

> **Note**: You'll need a large FPGA part to be able to fit the query 9 variant of this design

### Performance

In this design, we accelerate four database queries as **offload accelerators**. In an offload accelerator scheme, the queries are performed by transferring the relevant data from the CPU host to the FPGA, starting the query kernel on the FPGA, and copying the results back. This means that the relevant performance number is the processing time (the wall clock time) from when the query is requested to the time the output data is accessible by the host. This includes the time to transfer data between the CPU and FPGA over PCIe (with an approximate read and write bandwidth of 6877 and 6582 MB/s, respectively). Most of the total query time is spent transferring the data between the CPU and FPGA, and the query kernels themselves are a small portion of the total latency.
Expand Down