Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: ggml-org/llama.cpp
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: master@{1day}
Choose a base ref
...
head repository: ggml-org/llama.cpp
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: master
Choose a head ref
  • 16 commits
  • 46 files changed
  • 11 contributors

Commits on Aug 14, 2025

  1. Configuration menu
    Copy the full SHA
    8b24837 View commit details
    Browse the repository at this point in the history
  2. ggml: fix ggml_conv_1d_dw bug (ggml/1323)

    * ggml: fix ggml_conv_1d_dw bug
    
    * Fixed conv1d_dw weight tensor dimension.
    jason-ni authored and ggerganov committed Aug 14, 2025
    Configuration menu
    Copy the full SHA
    5ade300 View commit details
    Browse the repository at this point in the history
  3. sync : ggml

    ggml-ci
    ggerganov committed Aug 14, 2025
    Configuration menu
    Copy the full SHA
    3973163 View commit details
    Browse the repository at this point in the history
  4. server : add SWA checkpoints (#15293)

    * server : add SWA checkpoints
    
    ggml-ci
    
    * cont : server clean-up
    
    * server : handle state restore fails
    
    * llama : add extended llama_state_seq_ API
    
    * server : do not make checkpoints if --swa-full
    
    ggml-ci
    
    * llama : remove flags value for NONE
    
    * server : configure number of SWA checkpoints with CLI arg
    
    ggml-ci
    
    * args : fix scope of new argument
    ggerganov authored Aug 14, 2025
    Configuration menu
    Copy the full SHA
    d32e03f View commit details
    Browse the repository at this point in the history
  5. vulkan: perf_logger improvements (#15246)

    * vulkan: perf_logger improvements
    
    - Account for batch dimension in flops calculation.
    - Fix how "_VEC" is detected for mat_mul_id.
    - Fix "n" dimension for mat_mul_id (in case of broadcasting).
    - Include a->type in name.
    
    * use <=mul_mat_vec_max_cols rather than ==1
    jeffbolznv authored Aug 14, 2025
    Configuration menu
    Copy the full SHA
    863d341 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    1a01899 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    646944c View commit details
    Browse the repository at this point in the history
  8. gpt-oss: implement harmony parsing (#15181)

    * model : add harmony parser for gpt-oss
    
    * gpt-oss : fix grammar trigger from causing empty stack
    
    * gpt-oss: tweak the grammar trigger again
    
    * gpt-oss : add support for recipient in role header
    
    * gpt-oss : fix ungrouped tool calls in grammar
    
    * gpt-oss : loosen function name matching during parse
    
    * gpt-oss : clean up workarounds
    
    * gpt-oss : add template tests
    
    * gpt-oss : simulate thinking and tool call tags
    
    * gpt-oss : undo think tags when reasoning_format is none
    
    * gpt-oss : set special tokens back to user defined
    
    * gpt-oss : update openai-gpt-oss template
    
    * server : filter out harmony thought messages
    
    * gpt-oss : simplify parsing
    aldehir authored Aug 14, 2025
    Configuration menu
    Copy the full SHA
    b204a5a View commit details
    Browse the repository at this point in the history
  9. HIP: Cleanup hipification header (#15285)

    add expicit conversion operator to support older versions of rocm
    Switch over to hip_bf16 from legacy hip_bfloat16
    Simplify RDNA3 define
    Reduce swap over of new hipblas api to rocm 6.5 as this version is used for rocm 7.0 previews
    
    ---------
    
    Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
    IMbackK and JohannesGaessler authored Aug 14, 2025
    Configuration menu
    Copy the full SHA
    5ba36f6 View commit details
    Browse the repository at this point in the history
  10. devops : fix compile bug when the BASE_CUDA_DEV_CONTAINER is based on…

    … Ubuntu 24.04 (#15005)
    
    fixes #15004
    
    Co-authored-by: Paolo Greppi <paolo.greppi@libpf.com>
    simevo and Paolo Greppi authored Aug 14, 2025
    Configuration menu
    Copy the full SHA
    e4e9159 View commit details
    Browse the repository at this point in the history
  11. llama : add 18-layer model type for Gemma 3-270m (#15319)

    This commit adds support for the 18-layer model type in the Gemma3
    series, which is the size of the Gemma3-270m model.
    
    The motivation for this commit is was the only change required for
    Gemma3-270m to be converted to GGUF format and used with llama.cpp.
    
    Once the model has been converted and uploaded to Huggingface it can be
    used like this:
    ```console
    $ ./build/bin/llama-cli -hf ggml-org/gemma-3-270m-GGUF:Q8_0
    ```
    danbev authored Aug 14, 2025
    Configuration menu
    Copy the full SHA
    7a0de96 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    f75b830 View commit details
    Browse the repository at this point in the history
  13. eval-callback : stop on first NaN (#15320)

    * eval-callback : stop on first NaN
    
    * cont : log error
    ggerganov authored Aug 14, 2025
    Configuration menu
    Copy the full SHA
    df36bce View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    4227c9b View commit details
    Browse the repository at this point in the history

Commits on Aug 15, 2025

  1. test-opt: fix backend support check (#15317)

    * test-opt: fix backend support check
    
    * Update tests/test-opt.cpp
    
    Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
    
    ---------
    
    Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
    JohannesGaessler and ggerganov authored Aug 15, 2025
    Configuration menu
    Copy the full SHA
    b07791a View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    7aeee88 View commit details
    Browse the repository at this point in the history
Loading