Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chacha20: Improve AVX2 performance #261

Merged
merged 3 commits into from
Aug 9, 2021

Commits on Aug 9, 2021

  1. chacha20: Add a backend::avx2::StateWord helper union

    This removes a bunch of instructions for accessing the 128-bit lanes.
    str4d committed Aug 9, 2021
    Configuration menu
    Copy the full SHA
    905d776 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    7eac360 View commit details
    Browse the repository at this point in the history
  3. chacha20: Optimise diagonalization in SSE2 and AVX2 backends

    The `b` state word is on the hot path, so we pivot the diagonalization
    to move the shuffles onto the other state words. See the code comment,
    or sneves/blake2-avx2#4 for additional details.
    str4d committed Aug 9, 2021
    Configuration menu
    Copy the full SHA
    7f11302 View commit details
    Browse the repository at this point in the history