DeadLetterQueueUtils#extractSegmentId improvement: replace split with index of and substring methods. by mashhurs · Pull Request #18874 · elastic/logstash

mashhurs · 2026-03-23T17:41:06Z

Release notes

What does this PR do?

DLQ flush operation is heavy, under the hood DeadLetterQueueUtils#extractSegmentId will be called. The String#split is also CPU intensive that internally utilizes Pattern#compile. Since the logic is simple, it can be replaceable with indexOf and substring.

Why is it important/What is the impact to the user?

A bit performance improvement when using DLQ.

Checklist

My code follows the style guidelines of this project
~~[ ] I have commented my code, particularly in hard-to-understand areas~~
~~[ ] I have made corresponding changes to the documentation~~
~~[ ] I have made corresponding change to the default configuration files (and/or docker env variables)~~
I have added tests that prove my fix is effective or that my feature works

Author's Checklist

[ ]

How to test this PR locally

Functionality test can be done by enabling DLQ and changing its settings, such as make size small to make DLQ full.
But performance measurement is a bit hard on powerful machines, so here is the benchmark - #18883

Related issues

Use cases

Screenshots

Logs

…ith simple index of and substring methods.

github-actions · 2026-03-23T17:41:18Z

🤖 GitHub comments

Just comment with:

run docs-build : Re-trigger the docs validation. (use unformatted text in the comment!)
run exhaustive tests : Run the exhaustive tests Buildkite pipeline.

mergify · 2026-03-23T17:41:45Z

This pull request does not have a backport label. Could you fix it @mashhurs? 🙏
To fixup this pull request, you need to add the backport labels for the needed
branches, such as:

backport-8./d is the label to automatically backport to the 8./d branch. /d is the digit.
If no backport is necessary, please add the backport-skip label

yaauie · 2026-03-24T07:11:48Z

logstash-core/src/main/java/org/logstash/common/io/DeadLetterQueueUtils.java

    static int extractSegmentId(Path p) {
-        return Integer.parseInt(p.getFileName().toString().split("\\.log")[0]);
+        final String fileName = p.getFileName().toString();
+        final int dotIndex = fileName.indexOf(".log");


What ensures that we're given a Path p whose filename ends with .log?

If we don't, the dotIndex will be -1, and then we'll get an obscure IndexOutOfBoundsException on the next line when invoking fileName.substring(0, -1).

I'd prefer a capturing regex (e.g., ^([0-9]+)[.]log$), and throwing a clear exception if we don't have a match.

Right! Initial intention is to just replace the split with indexOf & substring. There is a listFiles (filters files end with .log) safeguard for current DLQ full situation but I generally agree that the method itself isn't safe. I have added lines to intentionally throw exception for the undesired "file doesn't end with .log" case.

andsel · 2026-03-24T09:41:57Z

FYI I've created #18883 to benchmark the various solutions.

…st cases.

elasticmachine · 2026-03-25T19:30:39Z

💚 Build Succeeded

Buildkite Build
Commit: c1182d2

History

💚 Build #4500 succeeded e8b0e30

DeadLetterQueueUtils: avoid using heavy split operation, replace it w…

e8b0e30

…ith simple index of and substring methods.

mashhurs marked this pull request as draft March 23, 2026 18:10

yaauie reviewed Mar 24, 2026

View reviewed changes

Handle exception case if file name doesn't end with .log. Add unit te…

c1182d2

…st cases.

mashhurs marked this pull request as ready for review March 25, 2026 21:47

mashhurs self-assigned this Mar 25, 2026

mashhurs added the performance improvements label Mar 25, 2026

mashhurs requested a review from yaauie March 25, 2026 21:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DeadLetterQueueUtils#extractSegmentId improvement: replace split with index of and substring methods.#18874

DeadLetterQueueUtils#extractSegmentId improvement: replace split with index of and substring methods.#18874
mashhurs wants to merge 2 commits intoelastic:mainfrom
mashhurs:improve-dlq-segmentid-fetch

mashhurs commented Mar 23, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 23, 2026

Uh oh!

mergify bot commented Mar 23, 2026

Uh oh!

yaauie Mar 24, 2026

Uh oh!

mashhurs Mar 25, 2026

Uh oh!

andsel commented Mar 24, 2026

Uh oh!

elasticmachine commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

mashhurs commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Release notes

What does this PR do?

Why is it important/What is the impact to the user?

Checklist

Author's Checklist

How to test this PR locally

Related issues

Use cases

Screenshots

Logs

Uh oh!

github-actions bot commented Mar 23, 2026

🤖 GitHub comments

Uh oh!

mergify bot commented Mar 23, 2026

Uh oh!

yaauie Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

mashhurs Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

andsel commented Mar 24, 2026

Uh oh!

elasticmachine commented Mar 25, 2026

💚 Build Succeeded

History

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mashhurs commented Mar 23, 2026 •

edited

Loading