Dial down detail of B-tree description #135761

hkBst · 2025-01-20T10:01:05Z

fixes #134088, though it is a shame to lose some of this wonderful detail.

EDIT: newest versions keep old detail, but move it down a bit.

rustbot · 2025-01-20T10:01:08Z

Could not assign reviewer from: workingjubilee.
User(s) workingjubilee are either the PR author, already assigned, or on vacation, and there are no other candidates.
Use r? to specify someone else to assign.

rustbot · 2025-01-20T10:01:14Z

r? @tgross35

rustbot has assigned @tgross35.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

hanna-kruppe · 2025-01-20T16:54:58Z

library/alloc/src/collections/btree/map.rs

+/// A B-tree resembles a [binary search tree], but each leaf (node) contains
+/// an entire array (of unspecified size) of elements, instead of just a single element.


I realize this talk about leaves is taken from #134088 but it doesn't reflect how BTreeMap works. All nodes (leaf and interior nodes) contain an array of elements and search may terminate in any them, not just in leaves. That's also closer to how binary search trees usually work. B+ trees, a variant of the traditional B-tree, do the "only leaves store elements, interior nodes guide search to the right leaf" thing but Rust's BTree* types are not like that.

Right, so IIUC each node either has the element in its array, or it is in the left child, or it is in the right child.

Now corrected to "node" instead of "leaf (node)".

hanna-kruppe · 2025-01-20T16:56:43Z

library/alloc/src/collections/btree/map.rs

+/// A search first traverses the tree structure to find, in logarithmic time, the correct leaf.
+/// This leaf is then searched linearly, which is very fast on modern hardware.


This may just be collateral damage from the leaves vs interior distinction, but it seems strange to only mention the linear search for leaves. The current implementation does it for every node, if it's mentioned for one it should be mentioned for both. Alternatively, it may be considered an implementation detail that distracts from the main point (logarithmic time complexity w.r.t. total number of elements, despite linear search in each node). But then it shouldn't be mentioned for leaves either. Certainly the way each node is searched is less central to the mechanical sympathy of B-trees than the "B" constant: fewer and larger nodes are obviously good (up to a point), while linear vs binary search is a more nuanced decision.

Yes, this is definitely collateral damage from my faulty guessing at the internal workings. I guess I was too hasty dismissing this possibility as not being compatible with logarithmic behavior, but I see now that I was wrong. Thanks for clarifying.
I don't agree about larger nodes being obviously good. I think it has more to do with the minimum number of bits that a real machine can look at, due to the size of its cache lines or some other detail. If your algorithm uses smaller chunks, then it is wasting bandwidth. So I would say smaller is obviously better (since it gets you from linear search to logarithmic search), but only up to a point.

I've removed this detail and included something about the natural granularity of data for modern machines.

In the context of comparison against BST, anything more than one key per node is a "larger node". You're right that the granularity of memory/storage access (cache lines in main memory, block/page size for disks) is a useful rule of thumb. Larger nodes also make insertion and deletion more expensive. But you're mistaken about search complexity: as the existing comments note, you can maintain optimal number of comparisons (up to a small constant factor) by doing binary search in each node. Sometimes that turns out a bit slower than a linear search, but it's competitive or superior in other cases.

In any case, I don't think it's useful to try and communicate these nuances in the BTreeMap docs.

hanna-kruppe · 2025-01-20T17:05:12Z

library/alloc/src/collections/btree/map.rs

-/// A B-Tree instead makes each node contain B-1 to 2B-1 elements in a contiguous array. By doing
-/// this, we reduce the number of allocations by a factor of B, and improve cache efficiency in


If you do want to salvage e.g. a brief mention of fewer nodes with more elements being better for reducing heap allocations and cache misses, consider this suggestions from #134088 (comment)

If we do wish to explain the theory behind B-tree performance, we can do it near the end, when we are not crushed for time and space.

I've restored the old comments with the suggested fixes from @danielrainer from #134088 in a new background section below everything.

hanna-kruppe · 2025-01-20T17:09:24Z

library/alloc/src/collections/btree/map.rs

-/// and possibly other factors. Using linear search, searching for a random element is expected
-/// to take B * log(n) comparisons, which is generally worse than a BST. In practice,
-/// however, performance is excellent.
+/// A B-tree resembles a [binary search tree], but each leaf (node) contains


I agree with @workingjubilee (#134088 (comment)) that

our first primary goal is not to explain "why not BinarySearchTreeMap?" We don't have a BinarySearchTreeMap, but we do have HashMap, so our first task is "why should you EVER look at this data structure and not HashMap?" This is true even if we never directly compare, in the type's docs, against HashMap.

So while you're already condensing the existing description, do you think you could also include this aspect? Essentially, say more about this being an "ordered map" before diving into how it maintains order internally.

I've included a short introduction that focuses on the ordered aspect and lifted the existing text about the order produced by iterators up. Also the section about not mucking about with the order is now much closer to the top and less unexpected (hopefully).

tgross35 · 2025-01-20T19:44:28Z

For any of the details that are still correct but just shouldn't be user-facing, leaving them as a non-doc comment seems preferable to removing them.

Cc @workingjubilee if you want to yoink this review since you were involved in #134088

hkBst · 2025-01-21T11:10:43Z

@tgross35 I've restored the old comments (with small corrections from the original bug report) to a new section at the end.

hanna-kruppe · 2025-01-21T17:22:32Z

library/alloc/src/collections/btree/map.rs

-/// triggers a heap-allocation, and every single comparison should be a cache-miss. Since these
-/// are both notably expensive things to do in practice, we are forced to, at the very least,
-/// reconsider the BST strategy.
+/// An ordered map is a map in which the keys are totally ordered.


I've occasionally seen people be confused about ordered maps in this sense vs. "map that maintains some consistent order" (e.g., insertion order) as exemplified by Python's OrderedDict or http://crates.io/crates/ordermap in Rust. I'd throw in something about the entries being stored in sorted order before detailing what determines the sort order.

This is a valuable insight and I can see how the first sentence can be mistaken for talking about a map with insertion order. It now says: "Given a key type with a [total order], an ordered map stores its entries in key order." which I feel is much harder to misunderstand. Let me know if you have more valuable suggestions.

hanna-kruppe

I do not have any further suggestions but I gave up my bors permissions in 2020 (wow, that long ago?) when I stopped contributing regularly, so I can't move this any further. It's up to @tgross35 or another reviewer if they're too busy.

hanna-kruppe · 2025-02-05T16:32:50Z

library/alloc/src/collections/btree/map.rs

+///
+/// # Background
+///
+/// A B-tree is (like) a [binary search tree], but adapted to the natural granularity that modern machines like to consume data at.


Isn't this line a little too long? But I don't see tidy complaining so idk, just looks out of place 🤷

Yeah, I think this should be wrapped.

hkBst · 2025-02-19T14:36:12Z

@tgross35 or @workingjubilee, would be great if you could take a look.

tgross35 · 2025-03-04T20:28:28Z

I didn't realize this was still assigned to me, I'm not entirely sure what the vision from the original issue was

r? @workingjubilee

rustbot · 2025-03-04T20:28:30Z

Could not assign reviewer from: workingjubilee.
User(s) workingjubilee are either the PR author, already assigned, or on vacation. Please use r? to specify someone else to assign.

tgross35 · 2025-03-04T20:29:02Z

Ah sorry, I'll take that back then and try to have a look

hkBst · 2025-07-14T08:13:12Z

@tgross35 friendly ping

tgross35 · 2025-07-29T03:22:24Z

Argh sorry, I thought this got rerolled a long time ago (probably should have). I still have no strong thoughts here so I'm just going to do that now

r? libs

ibraheemdev · 2025-08-19T03:47:43Z

library/alloc/src/collections/btree/map.rs

+/// triggers a heap-allocation, and for every comparison a node needs to be loaded,
+/// which could result in a cache miss. Since both heap-allocations and cache-misses


Suggested change

/// triggers a heap-allocation, and for every comparison a node needs to be loaded,

/// which could result in a cache miss. Since both heap-allocations and cache-misses

/// triggers a heap-allocation, and comparison is a potential cache-miss due to the indirection.

/// Since both heap-allocations and cache-misses

ibraheemdev · 2025-08-19T03:49:18Z

r=me after the nits.

rustbot · 2025-08-24T10:50:30Z

This PR was rebased onto a different master commit. Here's a range-diff highlighting what actually changed.

Rebasing is a normal part of keeping PRs up to date, so no action is needed—this note is just to help reviewers.

hkBst · 2025-08-24T12:20:04Z

@rustbot ready

ibraheemdev · 2025-08-24T19:18:45Z

@bors r+ rollup

bors · 2025-08-24T19:18:48Z

📌 Commit 1b77387 has been approved by ibraheemdev

It is now in the queue for this repository.

Dial down detail of B-tree description fixes rust-lang#134088, though it is a shame to lose some of this wonderful detail. r? `@workingjubilee` EDIT: newest versions keep old detail, but move it down a bit.

Dial down detail of B-tree description fixes rust-lang#134088, though it is a shame to lose some of this wonderful detail. r? ``@workingjubilee`` EDIT: newest versions keep old detail, but move it down a bit.

Rollup of 6 pull requests Successful merges: - #135761 (Dial down detail of B-tree description) - #144373 (remove deprecated Error::description in impls) - #145620 (Account for impossible bounds making seemingly unsatisfyable dyn-to-dyn casts) - #145783 (add span to struct pattern rest (..)) - #145817 (cg_llvm: Replace the `llvm::Bool` typedef with a proper newtype) - #145820 (raw-dylib-elf: set correct `DT_VERDEFNUM`) r? `@ghost` `@rustbot` modify labels: rollup

Dial down detail of B-tree description fixes rust-lang#134088, though it is a shame to lose some of this wonderful detail. r? ```@workingjubilee``` EDIT: newest versions keep old detail, but move it down a bit.

Rollup of 5 pull requests Successful merges: - #135761 (Dial down detail of B-tree description) - #144373 (remove deprecated Error::description in impls) - #145620 (Account for impossible bounds making seemingly unsatisfyable dyn-to-dyn casts) - #145817 (cg_llvm: Replace the `llvm::Bool` typedef with a proper newtype) - #145820 (raw-dylib-elf: set correct `DT_VERDEFNUM`) r? `@ghost` `@rustbot` modify labels: rollup

Dial down detail of B-tree description fixes rust-lang#134088, though it is a shame to lose some of this wonderful detail. r? ````@workingjubilee```` EDIT: newest versions keep old detail, but move it down a bit.

Zalathar · 2025-08-25T05:41:42Z

Tracking down a failure in rollup #145833.

@bors try jobs=dist-various-2

Dial down detail of B-tree description try-job: dist-various-2

rust-bors · 2025-08-25T06:51:20Z

☀️ Try build successful (CI)
Build commit: 776e4d7 (776e4d77395f0403e79a600d5b16c3ca0e6249af, parent: a1dbb443527bd126452875eb5d5860c1d001d761)

Rollup of 10 pull requests Successful merges: - #135761 (Dial down detail of B-tree description) - #145620 (Account for impossible bounds making seemingly unsatisfyable dyn-to-dyn casts) - #145788 (Fix attribute target checking for macro calls) - #145794 (bootstrap.py: Improve CPU detection on NetBSD) - #145817 (cg_llvm: Replace the `llvm::Bool` typedef with a proper newtype) - #145820 (raw-dylib-elf: set correct `DT_VERDEFNUM`) - #145828 (Update `bitflags` to 2.9.3.) - #145830 (Remove the lifetime from `ExpTokenPair`/`SeqSep`.) - #145836 (Remove outdated bug comments) - #145842 (rustc-dev-guide subtree update) r? `@ghost` `@rustbot` modify labels: rollup

Rollup merge of #135761 - hkBst:patch-9, r=ibraheemdev Dial down detail of B-tree description fixes #134088, though it is a shame to lose some of this wonderful detail. r? `@workingjubilee` EDIT: newest versions keep old detail, but move it down a bit.

Dial down detail of B-tree description fixes rust-lang#134088, though it is a shame to lose some of this wonderful detail. r? `@workingjubilee` EDIT: newest versions keep old detail, but move it down a bit.

Rollup of 10 pull requests Successful merges: - rust-lang/rust#135761 (Dial down detail of B-tree description) - rust-lang/rust#145620 (Account for impossible bounds making seemingly unsatisfyable dyn-to-dyn casts) - rust-lang/rust#145788 (Fix attribute target checking for macro calls) - rust-lang/rust#145794 (bootstrap.py: Improve CPU detection on NetBSD) - rust-lang/rust#145817 (cg_llvm: Replace the `llvm::Bool` typedef with a proper newtype) - rust-lang/rust#145820 (raw-dylib-elf: set correct `DT_VERDEFNUM`) - rust-lang/rust#145828 (Update `bitflags` to 2.9.3.) - rust-lang/rust#145830 (Remove the lifetime from `ExpTokenPair`/`SeqSep`.) - rust-lang/rust#145836 (Remove outdated bug comments) - rust-lang/rust#145842 (rustc-dev-guide subtree update) r? `@ghost` `@rustbot` modify labels: rollup

rustbot assigned tgross35 Jan 20, 2025

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-libs Relevant to the library team, which will review and decide on the PR/issue. labels Jan 20, 2025

hanna-kruppe reviewed Jan 20, 2025

View reviewed changes

hkBst requested a review from hanna-kruppe January 21, 2025 11:11

hanna-kruppe reviewed Jan 21, 2025

View reviewed changes

hkBst requested a review from hanna-kruppe February 5, 2025 13:21

hanna-kruppe approved these changes Feb 5, 2025

View reviewed changes

tgross35 assigned workingjubilee and unassigned tgross35 Mar 4, 2025

tgross35 assigned tgross35 and unassigned workingjubilee Mar 4, 2025

rustbot assigned ibraheemdev and unassigned tgross35 Jul 29, 2025

ibraheemdev reviewed Aug 19, 2025

View reviewed changes

ibraheemdev added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Aug 19, 2025

rust-cloud-vms bot force-pushed the patch-9 branch from a9447d1 to f0e44d6 Compare August 24, 2025 10:30

This comment has been minimized.

Sign in to view

Prevent confusion with insertion-ordered maps.

1b77387

rust-cloud-vms bot force-pushed the patch-9 branch from f0e44d6 to 1b77387 Compare August 24, 2025 10:50

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Aug 24, 2025

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Aug 24, 2025

jhpratt mentioned this pull request Aug 24, 2025

Rollup of 6 pull requests #145831

Closed

jhpratt mentioned this pull request Aug 24, 2025

Rollup of 5 pull requests #145832

Closed

Zalathar mentioned this pull request Aug 25, 2025

Rollup of 5 pull requests #145833

Closed

This comment has been minimized.

Sign in to view

rust-bors bot added a commit that referenced this pull request Aug 25, 2025

Auto merge of #135761 - hkBst:patch-9, r=<try>

776e4d7

Dial down detail of B-tree description try-job: dist-various-2

Zalathar mentioned this pull request Aug 25, 2025

Rollup of 10 pull requests #145844

Merged

bors merged commit 9b46273 into rust-lang:master Aug 25, 2025
11 checks passed

rustbot added this to the 1.91.0 milestone Aug 25, 2025

		/// A B-tree resembles a [binary search tree], but each leaf (node) contains
		/// an entire array (of unspecified size) of elements, instead of just a single element.

		/// A search first traverses the tree structure to find, in logarithmic time, the correct leaf.
		/// This leaf is then searched linearly, which is very fast on modern hardware.

		/// A B-Tree instead makes each node contain B-1 to 2B-1 elements in a contiguous array. By doing
		/// this, we reduce the number of allocations by a factor of B, and improve cache efficiency in

		/// triggers a heap-allocation, and for every comparison a node needs to be loaded,
		/// which could result in a cache miss. Since both heap-allocations and cache-misses

Dial down detail of B-tree description #135761

Dial down detail of B-tree description #135761

Uh oh!

Conversation

hkBst commented Jan 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rustbot commented Jan 20, 2025

Uh oh!

rustbot commented Jan 20, 2025

Uh oh!

hanna-kruppe Jan 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tgross35 commented Jan 20, 2025

Uh oh!

hkBst commented Jan 21, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hanna-kruppe left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hkBst commented Feb 19, 2025

Uh oh!

tgross35 commented Mar 4, 2025

Uh oh!

rustbot commented Mar 4, 2025

Uh oh!

tgross35 commented Mar 4, 2025

Uh oh!

hkBst commented Jul 14, 2025

Uh oh!

tgross35 commented Jul 29, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ibraheemdev commented Aug 19, 2025

Uh oh!

This comment has been minimized.

rustbot commented Aug 24, 2025

Uh oh!

hkBst commented Aug 24, 2025

Uh oh!

ibraheemdev commented Aug 24, 2025

Uh oh!

bors commented Aug 24, 2025

Uh oh!

Zalathar commented Aug 25, 2025

Uh oh!

This comment has been minimized.

rust-bors bot commented Aug 25, 2025

Uh oh!

hkBst commented Jan 20, 2025 •

edited

Loading

hanna-kruppe Jan 20, 2025 •

edited

Loading