Revert LLAMA_NATIVE to OFF in flake.nix #5066

iSma · 2024-01-21T20:00:31Z

I've noticed that since PR #4605, performance (CPU-only) took a massive dive when using the Nix flake (I went from ~4 tokens/s to <0.5). It seems that the slowdown is caused by LLAMA_NATIVE=ON. Reverting to OFF (as it was before the PR) restores the expected performance.

This regression was observed on both an i7-1165G7 and a Ryzen 3800X running NixOS.

FWIW, the llama-cpp package in nixpkgs has LLAMA_NATIVE=OFF.

I'm not sure what the implications of turning off LLAMA_NATIVE are, maybe @philiptaron and @SomeoneSerge want to chime in.

SomeoneSerge · 2024-01-21T20:47:30Z

option(LLAMA_NATIVE "llama: enable -march=native flag" ON)

Oh yes, we surely would prefer that OFF. Ideally, we never resort to -march=native (which generates random outputs depending on the builder's scheduler, load, and hardware), but instead model concrete targets or concrete architecture levels as part of the derivation

philiptaron

Dang, that's my fault in doing the transcription. Call it an "off by on" error 😅 . Thanks for the PR; LGTM.

Revert LLAMA_NATIVE to OFF in flake.nix

4fd9f5f

SomeoneSerge approved these changes Jan 21, 2024

View reviewed changes

SomeoneSerge added the nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment label Jan 21, 2024

SomeoneSerge merged commit 504dc37 into ggerganov:master Jan 21, 2024
16 checks passed

philiptaron reviewed Jan 22, 2024

View reviewed changes

jordankanter pushed a commit to jordankanter/llama.cpp that referenced this pull request Feb 3, 2024

Revert LLAMA_NATIVE to OFF in flake.nix (ggerganov#5066)

cba7723

hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024

Revert LLAMA_NATIVE to OFF in flake.nix (ggerganov#5066)

722a301

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert LLAMA_NATIVE to OFF in flake.nix #5066

Revert LLAMA_NATIVE to OFF in flake.nix #5066

iSma commented Jan 21, 2024

SomeoneSerge commented Jan 21, 2024 •

edited

Loading

philiptaron left a comment

Revert LLAMA_NATIVE to OFF in flake.nix #5066

Revert LLAMA_NATIVE to OFF in flake.nix #5066

Conversation

iSma commented Jan 21, 2024

SomeoneSerge commented Jan 21, 2024 • edited Loading

philiptaron left a comment

Choose a reason for hiding this comment

SomeoneSerge commented Jan 21, 2024 •

edited

Loading