gguf-py: gguf_writer: Use bytearray to build metadata #4051

KerfuffleV2 · 2023-11-12T22:45:25Z

Repeatedly concatenating a bytes object to build the metadata in the GGUF writer is insanely slow. The part that mainly hurts is the vocab arrays, the non-array values are generally too small for the inefficiency to really be noticeable.

With this change, building the metadata for a model goes from around 5-6 sec to instant.

cebtenzzre

I don't think we need a file object here. Why not use bytearray? It has .append and .extend methods.

KerfuffleV2 · 2023-11-12T23:03:47Z

Why not use bytearray?

The stuff I found seemed to indicate BytesIO was faster for this type of use case (appending chunks). This makes sense, since a bytearray has to efficiently support random access to individual elements. I would guess it's not a big difference though and I didn't benchmark it myself.

Bump gguf-py package version

KerfuffleV2 · 2023-11-12T23:17:44Z

@cebtenzzre Okay, so I tested it and they're exactly the same speed. I hacked convert.py to add the vocab metadata 10 times for a model with 64,000 vocab entries. It was like 3.020 sec for one and 3.019 for the other. Since using bytearray requires less modifications I switched to that.

* gguf-py: gguf_writer: Use BytesIO to build metadata * Use bytearray instead Bump gguf-py package version

gguf-py: gguf_writer: Use BytesIO to build metadata

446ee3c

KerfuffleV2 added the script Script related label Nov 12, 2023

cebtenzzre reviewed Nov 12, 2023

View reviewed changes

Use bytearray instead

2393050

Bump gguf-py package version

KerfuffleV2 changed the title ~~gguf-py: gguf_writer: Use BytesIO to build metadata~~ gguf-py: gguf_writer: Use bytearray to build metadata Nov 12, 2023

cebtenzzre approved these changes Nov 12, 2023

View reviewed changes

KerfuffleV2 merged commit 21fd874 into ggml-org:master Nov 12, 2023

KerfuffleV2 deleted the feat-gguf-py-optimize-metadata branch November 17, 2023 03:11

olexiyb pushed a commit to Sanctum-AI/llama.cpp that referenced this pull request Nov 23, 2023

gguf-py: gguf_writer: Use bytearray to build metadata (ggml-org#4051)

73d2aaa

* gguf-py: gguf_writer: Use BytesIO to build metadata * Use bytearray instead Bump gguf-py package version

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gguf-py: gguf_writer: Use bytearray to build metadata #4051

gguf-py: gguf_writer: Use bytearray to build metadata #4051

Uh oh!

KerfuffleV2 commented Nov 12, 2023 •

edited

Loading

Uh oh!

cebtenzzre left a comment

Uh oh!

KerfuffleV2 commented Nov 12, 2023

Uh oh!

KerfuffleV2 commented Nov 12, 2023

Uh oh!

Uh oh!

gguf-py: gguf_writer: Use bytearray to build metadata #4051

gguf-py: gguf_writer: Use bytearray to build metadata #4051

Uh oh!

Conversation

KerfuffleV2 commented Nov 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cebtenzzre left a comment

Choose a reason for hiding this comment

Uh oh!

KerfuffleV2 commented Nov 12, 2023

Uh oh!

KerfuffleV2 commented Nov 12, 2023

Uh oh!

Uh oh!

KerfuffleV2 commented Nov 12, 2023 •

edited

Loading