Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add quantize-stats command for testing quantization #728

Merged
merged 6 commits into from
Apr 7, 2023

Commits on Apr 3, 2023

  1. quantize-stats command

    Command that calculates some statistics over the errors introduced by
    quantization, at the moment mean square error and max error for layer
    weights. Should be useful for testing quantization improvements.
    
    Needs some internal state from ggml and llama that should not be part of
    the public API.
    unbounded committed Apr 3, 2023
    Configuration menu
    Copy the full SHA
    ed667e9 View commit details
    Browse the repository at this point in the history
  2. quantize-stats: show percentiles

    Show some error percentiles, should be less noisy than just the max error.
    unbounded committed Apr 3, 2023
    Configuration menu
    Copy the full SHA
    5b1143e View commit details
    Browse the repository at this point in the history
  3. quantize-stats: use less scratch memory

    Test quantization in smaller chunks instead of layer-at-a-time.
    unbounded committed Apr 3, 2023
    Configuration menu
    Copy the full SHA
    a7d3c3f View commit details
    Browse the repository at this point in the history
  4. quantize-stats: misc improvements

    Show RMSE instead of MSE - keeps similar range to the other metrics.
    Regex match on layer pattern.
    unbounded committed Apr 3, 2023
    Configuration menu
    Copy the full SHA
    d491507 View commit details
    Browse the repository at this point in the history

Commits on Apr 5, 2023

  1. quantize-stats: add option to test against reference quantization

    Expose reference quantization implementation and add option to use it
    for tests.
    unbounded committed Apr 5, 2023
    Configuration menu
    Copy the full SHA
    63cfa43 View commit details
    Browse the repository at this point in the history
  2. Remove "internal" header files

    Move into main header with comment not to use, per PR feedback
    unbounded committed Apr 5, 2023
    Configuration menu
    Copy the full SHA
    41d4a86 View commit details
    Browse the repository at this point in the history