forward type alignments to allocators #12430

krux02 · 2019-10-15T07:50:17Z

Introduce NIM_ALIGNOF in nimbase.h in the style of NIM_ALIGN.
Add alignment to RTTI.
Forward all alignment constraints of allocations to the allocator (allocator does not support custom alignment yet).
Fixes a problem with SIMD types in lambda lifting.
This is some foundation work to solve Memory corruption on newSeq[MyAlignedType](1) #7865.
Introduce define nimAlignPragma for bootstrapping (original alignas PR didn't need it).

This PR was previeously knows as Fix for newSeq on Aligned Type, since that part isn't solved, I renamed it.

When the tests are passing, this PR should be ready for review (and possibly merge).

cooldome · 2019-11-13T08:25:03Z

I have tested this one and I find it very useful. It makes type alignment information to flow though all the levels way down to allocator. Allocator is not yet alignment aware but it can be fixed in next PR.

One more issue needs to be addressed. Closure environments needs to be aligned too when they capture a variable of aligned type

cooldome · 2019-11-13T08:28:26Z

Test case for closures. Compile in release mode.

type
  m128d {.importc: "__m128d", header: "immintrin.h".} = object

proc add(a: m128d; b: m128d): m128d {.importc: "_mm_add_pd", header: "immintrin.h".}
proc set1*(a: float): m128d {.importc: "_mm_set1_pd", header: "immintrin.h".}
func `+`(a,b: m128d): m128d = add(a, b)


proc lambdaGen(a, b: float) : auto =
  let x1 = set1(2.0 + a)
  let x2 = set1(-23.0 - b)
  let capturingLambda = proc(x: m128d): m128d =
    let cc = x1 + x1
    let bb = x2 + set1(12.5)
    result = cc + bb + x
  return capturingLambda

let f1 = lambdaGen(2.0 , 2.221)
let f2 = lambdaGen(-1.226 , 3.5)

echo f1(set1(2.0))
echo f2(set1(-23.0

lib/system.nim

Araq · 2019-11-27T15:37:24Z

lib/system/alloc.nim

+    when defined(nimAlignPragma):
+      data {.align: MemAlign.}: UncheckedArray[byte]      # start of usable memory
+    else:
+      data: UncheckedArray[byte]


Dangerous for bootstrapping purposes. Why not keep the old, more correct variant for it?

Yea, I thought that you might comment on this part. Well, I changed the value for MemAlign from 8 to 16, this is also alignment guarantee for malloc in C. This will also work for many SIMD data types if native optimizations are not turned on.

Why not keep the old, more correct variant? Well the old variant is with the new value of MemAlign not correct anymore. It is true that alignment constraints aren't met during bootstrapping. But the question is: "Are the alignment constraints necessary during compilation?", and I would say: "No". There are no SIMD types used in the compiler yet. And if there are ever SIMD types used in the compiler, the branch without the align pragme should be dead by then.
I also wouldn't call it "dangerous", everything that might happen is, during Bootstrapping on 32 bit systems, float64 operations might be not as fast as they could be.

lib/system/mmdisp.nim

Araq · 2019-11-27T15:41:51Z

tests/seq/thardalignmentconstraint.nim

+disabled: true
+"""
+
+# does not yet work


Well the tests are green now, at least.

It is disabled. Maybe I should remove it since you don't like disabled tests.

krux02 · 2019-11-29T16:33:46Z

@cooldome you example is incomplete, but it got the point. I could verify it on my end that the example worked with this PR applied. But I can't explain why, I didn't do anything in lambda lifting. All I do is forwarding the alignment information to the allocators. When I added your example to the tests, it failed on 32 bit systems. So I wouldn't say I fixed anything in that direction. But I think this PR will help to make it work in the future.

cooldome · 2019-12-03T14:42:47Z

Somehow I missed that a big progress in this PR. Starting active testing.

@krux02: in 32bit systems the problem that stack it self is not 16 byte aligned hence fixing allocator doesn't safe a day. I gave up SSE vectorization in 32bit mode even in C++. I do the following builds these days: 32 bit unvectorised, 64bit SSE, 64bit AVX, 64bit AVX2. Fixing 32bit would require your own calling convention which out of scope of this PR. This calling convention should be used for proc and closures.

I don't think you need to do anything special in lambda lifting. LL defines the environment object and as long as new(MyEnvObject) is aligned then everything should work

cooldome · 2019-12-03T17:23:03Z

I have tried it, looks like SSE with 16 bytes alignment now works, but not the AVX that needs 32 byte alignment. I guess it boils down to allocator that still have one fixed alignment,

krux02 · 2020-04-17T21:19:14Z

This was very painful to merge with the development branch. I won't do it again. Either merge it or reject it.

Araq · 2020-04-19T05:51:26Z

Irrelevant CI failure, merging.

krux02 added 6 commits October 15, 2019 09:47

first initial version

799763e

add test

7a326e4

keyword selection logic. Disable test.

0d510fb

Merge branch 'devel' into fix-newseq-alignedtype

6a50871

some fixes

9498099

use alignas

c22c3db

krux02 mentioned this pull request Nov 11, 2019

implemented alignas pragma #12643

Merged

krux02 added 3 commits November 22, 2019 13:37

Merge branch 'devel' into fix-newseq-alignedtype

bb8127d

NIM_ALIGNOF

4395f05

use align pragma

7bcbba5

krux02 force-pushed the fix-newseq-alignedtype branch from 8c0ea49 to 7bcbba5 Compare November 22, 2019 13:43

add test, minor cleanup

698676e

krux02 force-pushed the fix-newseq-alignedtype branch from 87bdd83 to 698676e Compare November 22, 2019 17:18

krux02 changed the title ~~WIP: Fix for newSeq on Aligned Type~~ forward type alignments to allocators Nov 22, 2019

krux02 added 3 commits November 23, 2019 00:51

Merge branch 'devel' into fix-newseq-alignedtype

8ce4576

remove test again

fcd78b0

Merge branch 'devel' into fix-newseq-alignedtype

fc6965e

Araq reviewed Nov 27, 2019

View reviewed changes

lib/system.nim Show resolved Hide resolved

Araq reviewed Nov 27, 2019

View reviewed changes

lib/system/mmdisp.nim Show resolved Hide resolved

Araq reviewed Nov 27, 2019

View reviewed changes

cooldome mentioned this pull request Dec 18, 2019

[WIP] Aligned allocation #12926

Closed

krux02 added 3 commits April 17, 2020 04:18

Merge branch 'devel' into fix-newseq-alignedtype

c680cb1

cleanup

a9b7583

more cleanup

4b47a5f

more cleanup

2b3c569

Araq merged commit 4005f0d into nim-lang:devel Apr 19, 2020

shirleyquirk mentioned this pull request Oct 2, 2020

seq[T] does not respect alignof(T) #14642

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

forward type alignments to allocators #12430

forward type alignments to allocators #12430

krux02 commented Oct 15, 2019 •

edited

Loading

cooldome commented Nov 13, 2019 •

edited

Loading

cooldome commented Nov 13, 2019

Araq Nov 27, 2019

krux02 Nov 27, 2019

Araq Nov 27, 2019

krux02 Nov 27, 2019

krux02 commented Nov 29, 2019

cooldome commented Dec 3, 2019

cooldome commented Dec 3, 2019

krux02 commented Apr 17, 2020

Araq commented Apr 19, 2020

forward type alignments to allocators #12430

forward type alignments to allocators #12430

Conversation

krux02 commented Oct 15, 2019 • edited Loading

cooldome commented Nov 13, 2019 • edited Loading

cooldome commented Nov 13, 2019

Araq Nov 27, 2019

Choose a reason for hiding this comment

krux02 Nov 27, 2019

Choose a reason for hiding this comment

Araq Nov 27, 2019

Choose a reason for hiding this comment

krux02 Nov 27, 2019

Choose a reason for hiding this comment

krux02 commented Nov 29, 2019

cooldome commented Dec 3, 2019

cooldome commented Dec 3, 2019

krux02 commented Apr 17, 2020

Araq commented Apr 19, 2020

krux02 commented Oct 15, 2019 •

edited

Loading

cooldome commented Nov 13, 2019 •

edited

Loading