[LangRef] Relax semantics of writeonly / memory(write) #95238

nikic · 2024-06-12T12:40:28Z

Instead of making writes immediate undefined behavior, consider these attributes in terms of their externally observable effects. We don't care if a location is read within the function, as long as it has no impact on observed behavior. In particular, allow:

Reading a location after writing it.
Reading a location before writing it (within the function) returns a poison value.

The latter could be further relaxed to also allow things like "reading the value and then writing it back", but I'm not sure how one would specify that operationally (so that proof checkers can verify it).

While here, also explicitly mention the fact that reads and writes to allocas and read from constant globals are memory(none).

Fixes #95152.

Instead of making writes immediate undefined behavior, consider these attributes in terms of their externally observable effects. We don't care if a location is read within the function, as long as it has no impact on observed behavior. In particular, allow: * Reading a location after writing it. * Reading a location before writing it (within the function) returns a poison value. The latter could be further relaxed to also allow things like "reading the value and then writing it back", but I'm not sure how one would specify that operationally (so that proof checkers can verify it). Fixes llvm#95152.

llvmbot · 2024-06-12T12:41:01Z

@llvm/pr-subscribers-llvm-transforms

@llvm/pr-subscribers-llvm-ir

Author: Nikita Popov (nikic)

Changes

Instead of making writes immediate undefined behavior, consider these attributes in terms of their externally observable effects. We don't care if a location is read within the function, as long as it has no impact on observed behavior. In particular, allow:

Reading a location after writing it.
Reading a location before writing it (within the function) returns a poison value.

The latter could be further relaxed to also allow things like "reading the value and then writing it back", but I'm not sure how one would specify that operationally (so that proof checkers can verify it).

While here, also explicitly mention the fact that reads and writes to allocas and read from constant globals are memory(none).

Fixes #95152.

Full diff: https://github.com/llvm/llvm-project/pull/95238.diff

1 Files Affected:

(modified) llvm/docs/LangRef.rst (+19-2)

diff --git a/llvm/docs/LangRef.rst b/llvm/docs/LangRef.rst
index f39b8dc6c90d4..315baad5c6e81 100644
--- a/llvm/docs/LangRef.rst
+++ b/llvm/docs/LangRef.rst
@@ -1598,8 +1598,10 @@ Currently, only the following parameter attributes are defined:
     through this pointer argument (even though it may read from the memory that
     the pointer points to).
 
-    If a function reads from a writeonly pointer argument, the behavior is
-    undefined.
+    This attribute is understood in the same way as the ``memory(write)``
+    attribute. That is, the pointer may still be read as long as the read is
+    not observable outside the function. See the ``memory`` documentation for
+    precise semantics.
 
 ``writable``
     This attribute is only meaningful in conjunction with ``dereferenceable(N)``
@@ -1973,6 +1975,21 @@ example:
     - ``memory(readwrite, argmem: none)``: May access any memory apart from
       argument memory.
 
+    The supported access kinds are:
+
+    - ``readwrite``: Any kind of access to the location is allowed.
+    - ``read``: The location is only read. Writing the location is immediate
+      undefined behavior. This includes the case where the location is read and
+      then the same value is written back.
+    - ``write``: Only writes to the location are observable outside the function
+      call. However, the function may still internally read the location after
+      writing it, as this is not observable. Reading the location prior to
+      writing it results in a poison value.
+    - ``none``: No reads or writes to the location are observed outside the
+      function. It is always valid read and write allocas, and read global
+      constants, even if ``memory(none)`` is used, as these effects are not
+      externally observable.
+
     The supported memory location kinds are:
 
     - ``argmem``: This refers to accesses that are based on pointer arguments

nunoplopes · 2024-06-12T12:57:45Z

I understand the motivation, and the text looks good.

But this going to be painful to implement in Alive2 😅 The semantics amounts to quantifying the input memory to a function and ensuring it returns the same value/memory for every input memory. It's UB if not. Well, I guess the read(none) is similar.

aeubanks

thanks, this makes sense

llvm/docs/LangRef.rst

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp

Instead of making writes immediate undefined behavior, consider these attributes in terms of their externally observable effects. We don't care if a location is read within the function, as long as it has no impact on observed behavior. In particular, allow: * Reading a location after writing it. * Reading a location before writing it (within the function) returns a poison value. The latter could be further relaxed to also allow things like "reading the value and then writing it back", but I'm not sure how one would specify that operationally (so that proof checkers can verify it). While here, also explicitly mention the fact that reads and writes to allocas and read from constant globals are `memory(none)`. Fixes llvm#95152.

nikic requested review from nunoplopes, aeubanks and efriedma-quic June 12, 2024 12:40

llvmbot added the llvm:ir label Jun 12, 2024

nikic mentioned this pull request Jun 12, 2024

[MemCpyOpt] Call slot optimization doesn't respect writeonly #95152

Closed

aeubanks approved these changes Jun 12, 2024

View reviewed changes

llvm/docs/LangRef.rst Outdated Show resolved Hide resolved

llvm/docs/LangRef.rst Outdated Show resolved Hide resolved

llvmbot added the llvm:transforms label Jun 17, 2024

dtcxzyw reviewed Jun 17, 2024

View reviewed changes

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp Outdated Show resolved Hide resolved

wording

484ab8b

nikic force-pushed the langref-writeonly branch from 10c7548 to 484ab8b Compare June 17, 2024 09:15

fix typo

704831e

nikic merged commit 9cbedd9 into llvm:main Jun 19, 2024
8 checks passed

nikic deleted the langref-writeonly branch June 19, 2024 07:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[LangRef] Relax semantics of writeonly / memory(write) #95238

[LangRef] Relax semantics of writeonly / memory(write) #95238

Uh oh!

nikic commented Jun 12, 2024

Uh oh!

llvmbot commented Jun 12, 2024 •

edited

Loading

Uh oh!

nunoplopes commented Jun 12, 2024

Uh oh!

aeubanks left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[LangRef] Relax semantics of writeonly / memory(write) #95238

[LangRef] Relax semantics of writeonly / memory(write) #95238

Uh oh!

Conversation

nikic commented Jun 12, 2024

Uh oh!

llvmbot commented Jun 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nunoplopes commented Jun 12, 2024

Uh oh!

aeubanks left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

llvmbot commented Jun 12, 2024 •

edited

Loading