Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sw: Add optimized AXPY, Covariance, SYRK and Doitgen kernels #185

Merged
merged 19 commits into from
Aug 29, 2024
Merged

Conversation

colluca
Copy link
Collaborator

@colluca colluca commented Aug 21, 2024

  • Allow $(APP)_INCDIRS to be extended on a per-app basis.
  • Extend axpy kernel
    • Support multiple tiles and alternative implementations
    • Optimize TCDM conflicts by appropriately placing data in TCDM
    • Support double-buffering
  • Extend covariance kernel
    • Support multiple tiles and alternative implementations
    • Remove divisions by statically generating reciprocals 1 / n and 1 / (n - 1)
    • Add optimized and baseline implementations
    • Support double-buffering
  • Add syrk and doitgen kernels (double-buffered, optimized).
  • Extend clean-traces target to remove all artifacts of make traces.
  • Remove .gitignore and Makefile files, outdated after sw: Switch to non-recursive Make build system #171.
  • Do not exit on exception in gen_trace.py, so performance metrics are dumped.
  • Uniformize kernels to generate data values in range [-1.0, 1.0] to avoid optimistic power results from using integers.
  • Increase error threshold on GEMM kernel, as opposite sign operands can lead to cancellation and increased errors.

@colluca colluca force-pushed the opt-axpy branch 7 times, most recently from 311fb26 to 5ac30f7 Compare August 23, 2024 13:09
@colluca colluca marked this pull request as ready for review August 29, 2024 07:50
@colluca colluca requested a review from paulsc96 as a code owner August 29, 2024 07:50
@colluca colluca merged commit 3ba276e into main Aug 29, 2024
25 of 27 checks passed
@colluca colluca deleted the opt-axpy branch August 29, 2024 07:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant