Improve speed by moving function collections to stack arrays #12

TheDan64 · 2017-09-09T15:55:54Z

A common pattern being used looks something like this:

fn inkwell_does_stuff(input: &[&InkwellValue]) {
    let mut input = Vec<LLVMValueRef> = input.iter().map(|val| val.as_value_ref()).collect(); 

    // call LLVM function with raw ptr to input
}

What we're basically doing is taking a bunch of our own types and mapping them to a sequence of raw llvm pointers. The problem is that Vec will allocate on the heap, but this sequence is only needed for the scope of the function and never gets directly returned. So, I think we can improve this pattern by using something like arrayvec which stores the contents on the stack. This should also work really well because we never modify the size of these vectors, just use them as an intermediate data location for LLVM to read from.

Alternatively, if there's a way to just collect into a a stack slice, that would work too.

This might make for a good first PR if anyone's interested since it doesn't require technical knowledge of LLVM, just Rust.

TheDan64 · 2017-09-11T00:46:26Z

This might not actually be possible with modern rust. Apparently it might require VLA (variable length arrays), which do not exist today because they're easy to blow through the stack with. Will remove the milestone but keep the issue open for future review.

Arrayvec could work today if we specify a compile time upper bound, but I'm not sure we want to do that because it seems like an artificial limitation to our users.

TheDan64 · 2017-09-11T16:09:55Z

Related RFC issue: rust-lang/rfcs#618

TheDan64 · 2018-05-09T21:05:03Z

If we don't have rust support for this (and this does indeed turn out to be a significant speed issue - which it may not) we could add some sort of scoped helper method which takes a fn and calls it on a C alloca/malloca ptr (possibly transmuting into a reference since we know the lifetime is valid for the scope - though the data might be uninitialized so the fn should be unsafe). Bonus points if the helper method can make os calls to check that there's enough remaining stack space to make the stack allocation, returning a Result/Err if not. Though it's worth noting there are possible issues when inlining functions which use alloca since the effective lifetime is extended to a larger scope. Maybe this is worth exploring in a separate crate: alloca/scoped_alloca both seem like available names

TheDan64 · 2019-04-29T23:50:19Z

rust-lang/rust#48055 might be a better approach to this

TheDan64 · 2019-08-08T00:49:55Z

Or https://github.com/HeroicKatora/static-alloc

TheDan64 · 2019-08-08T03:27:25Z

Added experimental feature flag for testing this: 673a3d5

TheDan64 added enhancement help wanted labels Sep 9, 2017

TheDan64 added this to the 0.2.0 milestone Sep 9, 2017

TheDan64 removed this from the 0.2.0 milestone Sep 11, 2017

TheDan64 added the blocked label Sep 13, 2017

TheDan64 added wishlist and removed help wanted labels Nov 10, 2017

seanyoung mentioned this issue Dec 4, 2020

test_module::test_garbage_ir_fails_create_module_from_ir triggers assertion in llvm 10 #198

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve speed by moving function collections to stack arrays #12

Improve speed by moving function collections to stack arrays #12

TheDan64 commented Sep 9, 2017

TheDan64 commented Sep 11, 2017 •

edited

Loading

TheDan64 commented Sep 11, 2017 •

edited

Loading

TheDan64 commented May 9, 2018

TheDan64 commented Apr 29, 2019

TheDan64 commented Aug 8, 2019

TheDan64 commented Aug 8, 2019

Improve speed by moving function collections to stack arrays #12

Improve speed by moving function collections to stack arrays #12

Comments

TheDan64 commented Sep 9, 2017

TheDan64 commented Sep 11, 2017 • edited Loading

TheDan64 commented Sep 11, 2017 • edited Loading

TheDan64 commented May 9, 2018

TheDan64 commented Apr 29, 2019

TheDan64 commented Aug 8, 2019

TheDan64 commented Aug 8, 2019

TheDan64 commented Sep 11, 2017 •

edited

Loading

TheDan64 commented Sep 11, 2017 •

edited

Loading