Misc. bug: 2 GB at the start

### Name and Version

$ /ucrt64/bin/llama-cli.exe --version
ggml_opencl: selected platform: 'AMD Accelerated Parallel Processing'

ggml_opencl: device: 'Caicos (OpenCL 1.2 AMD-APP (1800.11))'
Unsupported GPU: Caicos
ggml_opencl: drop unsupported device.

ggml_opencl: device: 'AMD Ryzen 7 1700X Eight-Core Processor          (OpenCL 1.2 AMD-APP (1800.11))'
Unsupported GPU: AMD Ryzen 7 1700X Eight-Core Processor
ggml_opencl: drop unsupported device.
version: 52594 (c7e6113b84)
built with gcc.exe (Rev8, Built by MSYS2 project) 15.2.0 for x86_64-w64-mingw32



### Operating systems

Windows

### Which llama.cpp modules do you know to be affected?

libllama (core library)

### Command line

```shell

```

### Problem description & steps to reproduce

Just entering the main() in a program linking to "ggml", "ggml-base", "mtmd", "llama" consumes 2 GB of RAM. 
VMMap shows it as 16 blocks 128 MB each. Are these allocations necessary? Can they be postponed to when I really use an LLM?

<img width="1916" height="1076" alt="Image" src="https://github.com/user-attachments/assets/a58d4265-eeb6-461d-a605-c2f64b0e9f20" />

### First Bad Commit

_No response_

### Relevant log output

```shell

```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Misc. bug: 2 GB at the start #18024

Name and Version

Operating systems

Which llama.cpp modules do you know to be affected?

Command line

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Misc. bug: 2 GB at the start #18024

Description

Name and Version

Operating systems

Which llama.cpp modules do you know to be affected?

Command line

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions