[LLVM-Reduce] - Distinct Metadata Reduction #96072

rbintel · 2024-06-19T13:54:05Z

Add a distinct metadata reduction pass, which traverses the whole unnamed metadata tree and applies reduction where possible. Previous version could do this only partially, either removing named metadata, metadata attached to instructions or debug information.
Modify current named node operand reduction, make it more aggressive by generalizing the algorithm instead of reducing hard-coded instructions, I see no issue in trying a more aggressive reduction and rolling it back in case it doesn't go through.
Refactor some of the tests to suit new functionality:

Remove the --abort-on-invalid-reduction flag from remove-dp-values.ll tests, if it is included, the new named metadata reduction algorithm will fail at some point, if not, the test passes, valid IR is generated and the module is reduced, which costs a few more iterations.
Refactor remove-metadata.ll, the new functionality will not only erase the top level nodes but also their children nodes, so one can't expect them to be present after the run.
Refactor remove-named-metadata.ll, the expected behaviour now is to also remove the operands of !some.unknown.named.
Add a test for the new functionality, expected behaviour is to leave no boring nodes and all interesting nodes.

github-actions · 2024-06-19T13:54:26Z

Thank you for submitting a Pull Request (PR) to the LLVM Project!

This PR will be automatically labeled and the relevant teams will be
notified.

If you wish to, you can add reviewers by using the "Reviewers" section on this page.

If this is not working for you, it is probably because you do not have write
permissions for the repository. In which case you can instead tag reviewers by
name in a comment by using @ followed by their GitHub username.

If you have received no comments on your PR for a week, you can request a review
by "ping"ing the PR by adding a comment “Ping”. The common courtesy "ping" rate
is once a week. Please remember that you are asking for valuable time from other developers.

If you have further questions, they may be answered by the LLVM GitHub User Guide.

You can also ask questions in a comment on this PR, on the LLVM Discord or on the forums.

github-actions · 2024-06-20T10:12:30Z

⚠️ Python code formatter, darker found issues in your code. ⚠️

You can test this locally with the following command:

darker --check --diff -r 4d6b9921b3801709dca9245b5b4d7c17944a036f...211101936383973e0895f8ca1570c4b52acab335 llvm/test/tools/llvm-reduce/Inputs/reduce-distinct-metadata.py llvm/test/tools/llvm-reduce/Inputs/remove-metadata.py

View the diff from darker here.

--- reduce-distinct-metadata.py	2024-06-19 13:48:15.000000 +0000
+++ reduce-distinct-metadata.py	2024-06-20 10:12:10.486785 +0000
@@ -3,17 +3,17 @@
 import sys
 import re
 
 input = open(sys.argv[1], "r").read().splitlines()
 
-depth_map = {"0": 1, "1": 3, "2": 3, "3": 2, "4" : 1}
+depth_map = {"0": 1, "1": 3, "2": 3, "3": 2, "4": 1}
 
 
 for i in range(len(depth_map)):
-  counter = 0 
-  for line in input:
-    if re.match(rf".*interesting_{i}.*", line) != None:
-      counter += 1
-  if counter != depth_map[str(i)]:
-    sys.exit(1)
+    counter = 0
+    for line in input:
+        if re.match(rf".*interesting_{i}.*", line) != None:
+            counter += 1
+    if counter != depth_map[str(i)]:
+        sys.exit(1)
 
 sys.exit(0)

michalpaszkowski

Hi @rbintel! Thank you for the pull request. I added some initial comments on the style below.

llvm/tools/llvm-reduce/deltas/ReduceDistinctMetadata.cpp

llvm/tools/llvm-reduce/deltas/ReduceDistinctMetadata.h

llvm/test/tools/llvm-reduce/remove-dp-values.ll

llvm/tools/llvm-reduce/deltas/ReduceDistinctMetadata.cpp

michalpaszkowski

Thank you! I think the change looks good to me! One last small ask, please update the files which are missing new lines at the end. I think for .ll files this is a good practice to add a new line, but for Python files missing a new line is a PEP8 error.

michalpaszkowski · 2024-07-26T05:04:28Z

@SLTozer @OCHyams @ormris @arsenm I am tagging you all in case you have anything to add in the review. Would be grateful for any additional comments as you have more experience in this area.

arsenm · 2024-07-26T06:26:08Z

Remove the --abort-on-invalid-reduction flag

We should only ever be adding uses of it, not removing it. We should always be trying to avoid invalid IR

llvm/test/tools/llvm-reduce/Inputs/remove-metadata.py

llvm/test/tools/llvm-reduce/reduce-distinct-metadata.ll

llvm/test/tools/llvm-reduce/remove-dp-values.ll

llvm/test/tools/llvm-reduce/remove-metadata.ll

llvm/tools/llvm-reduce/deltas/ReduceDistinctMetadata.cpp

arsenm · 2024-07-26T06:34:34Z

llvm/tools/llvm-reduce/deltas/ReduceMetadata.cpp

-// Named metadata with simple list-like behavior, so that it's valid to remove
-// operands individually.


We should preserve this. If we want a mode where we blindly rip operands out of anything we don't understand, probably should be a flag

…m#102637) This prevents some unnecessary conversions to/from int64_t and IntegerAttr.

Zero-initializing all of them accidentally left the last member active. Only initialize the first one.

…ions (llvm#102715)

…02749) Including unions, where this is more important.

…102753) Pointer::activate() propagates up anyway, so that is handled. But we need to call activate() in any case since the parent might not be a union, but the activate() is still needed. Always call it and hope that the InUnion flag takes care of the potential performance problems.

- Fix include guards for headers under utils/TableGen to match their paths.

Update the list of opcodes handled by the constant_fold_binop combine to match the ones that are folded in CSEMIRBuilder::buildInstr.

A dominance query of a block that is in a different function is ill-defined, so assert that getNode() is only called for blocks that are in the same function. There are two cases, where this behavior did occur. LoopFuse didn't explicitly do this, but didn't invalidate the SCEV block dispositions, leaving dangling pointers to free'ed basic blocks behind, causing use-after-free. We do, however, want to be able to dereference basic blocks inside the dominator tree, so that we can refer to them by a number stored inside the basic block.

This patch fixes: clang/lib/Serialization/ASTReader.cpp:11484:13: error: unused variable '_' [-Werror,-Wunused-variable]

The use of _ requires either: - (void)_ and curly braces, or - [[maybe_unused]]. For simple repetitions like these, we can use traditional for loops for readable warning-free code.

They don't have a body and we need to implement them ourselves. Use the Memcpy op to do that.

-fsized-deallocation was recently made the default for C++17 onwards (llvm#90373). While here, remove unneeded -faligned-allocation.

… functions. NFC

This script helps the release managers merge backport PR's. It does the following things: * Validate the PR, checks approval, target branch and many other things. * Rebases the PR * Checkout the PR locally * Pushes the PR to the release branch * Deletes the local branch I have found the script very helpful to merge the PR's.

I tried to add a limit to number of blocks visited in the paths() function but even with a very high limit the transformation coverage was being reduced. After looking at the code it seemed that the function was trying to create paths of the form `SwitchBB...DeterminatorBB...SwitchPredecessor`. This is inefficient because a lot of nodes in those paths (nodes before DeterminatorBB) would be irrelevant to the optimization. We only care about paths of the form `DeterminatorBB_Pred DeterminatorBB...SwitchBB`. This weeds out a lot of visited nodes. In this patch I have added a hard limit to the number of nodes visited and changed the algorithm for path calculation. Primarily I am traversing the use-def chain for the PHI nodes that define the state. If we have a hole in the use-def chain (no immediate predecessors) then I call the paths() function. I also had to the change the select instruction unfolding code to insert redundant one input PHIs to allow the use of the use-def chain in calculating the paths. The test suite coverage with this patch (including a limit on nodes visited) is as follows: Geomean diff: dfa-jump-threading.NumTransforms: +13.4% dfa-jump-threading.NumCloned: +34.1% dfa-jump-threading.NumPaths: -80.7% Compile time effect vs baseline (pass enabled by default) is mostly positive: https://llvm-compile-time-tracker.com/compare.php?from=ad8705fda25f64dcfeb6264ac4d6bac36bee91ab&to=5a3af6ce7e852f0736f706b4a8663efad5bce6ea&stat=instructions:u Change-Id: I0fba9e0f8aa079706f633089a8ccd4ecf57547ed

Copying a triple is cheap, but not free, so let's not do that if there's no reason to do so. Trivial cleanup.

This enables the use of the more efficient dominator tree node access.

We were using tryGetRealPathName in certain places, which resolves symlinks (sometimes). This was resulting in discrepancies in behavior, depending on how a file was first reached. This path migrates all usages of tryGetRealPathName to regular getName instead. This implies one backward incompatible change for header-filtering. Our ignore-header option used to filter against suffixes of absolute paths, whereas now filter can receive working-directory relative paths in some cases, possibly braking existing filters. Chances of really braking users is pretty low: - We'll still filter against absolute paths when header is outside the working directory (e.g. /usr/bin/include/some/linux/header.h.) - Most projects run builds in a working directory that's nested inside the repository, hence relative paths still contain all the segments relative to repository root and anything else is unlikely to be meaningful. e.g. if a header is in `$HOME/work/llvm-project/clang-tools-extra/header.h` with builds being run in `$home/work/llvm-project/build`, we'll still filter against `../clang-tools-extra/header.h` which has all the useful segments as a suffix. - This is also a change in how we handle symlinks, but this is aligned with what we do in rest of our tools (clangd, tidy checks etc.). We tend to not resolve any symlinks for the file.

We are using `has_initialize` to check the class has `initialize` function instead of the `getOperationName` function.

* Add a distinct metadata reduction pass, which traverses the whole unnamed metadata tree and applies reduction where possible. Previous version could do this only partially, either removing named metadata, metadata attached to instructions or debug information. * Modify current named node operand reduction, make it more aggressive by generalizing the algorithm instead of reducing hard-coded instructions, I see no issue in trying a more aggressive reduction and rolling it back in case it doesn't go through. * Refactor some of the tests to suit new functionality: - Remove the --abort-on-invalid-reduction flag from remove-dp-values.ll tests, if it is included, the new named metadata reduction algorithm will fail at some point, if not, the test passes, valid IR is generated and the module is reduced, which costs a few more iterations. - Refactor remove-metadata.ll, the new functionality will not only erase the top level nodes but also their children nodes, so one can't expect them to be present after the run. - Refactor remove-named-metadata.ll, the expected behaviour now is to also remove the operands of !some.unknown.named. - Add a test for the new functionality, expected behaviour is to leave no boring nodes and all interesting nodes.

--aggressive-md flag. Addressed the issues pointed out by arsenm.

…llvm-project into barinov-reduce-distinct

under the --aggressive-md flag.

OCHyams · 2024-08-16T09:57:57Z

@SLTozer @OCHyams @ormris @arsenm I am tagging you all in case you have anything to add in the review. Would be grateful for any additional comments as you have more experience in this area.

Sorry for the radio-silence on this, I've been on and off PTO and this slipped through the cracks for me

arsenm · 2024-08-16T11:57:29Z

What's going on with this? The last batch of comments weren't addressed and it was closed without comment?

rbintel · 2024-08-16T12:03:52Z

@arsenm Closed this pull request because of a bad rebase. Will reopen another one with code review fixes and a clean commit history.

michalpaszkowski requested changes Jun 20, 2024

View reviewed changes

michalpaszkowski requested review from SLTozer and OCHyams June 20, 2024 10:39

michalpaszkowski reviewed Jun 20, 2024

View reviewed changes

llvm/tools/llvm-reduce/deltas/ReduceDistinctMetadata.cpp Outdated Show resolved Hide resolved

michalpaszkowski requested a review from ormris June 20, 2024 11:01

rbintel requested a review from michalpaszkowski June 21, 2024 14:29

michalpaszkowski approved these changes Jul 26, 2024

View reviewed changes

michalpaszkowski requested a review from arsenm July 26, 2024 05:01

michalpaszkowski added the llvm-reduce label Jul 26, 2024

arsenm reviewed Jul 26, 2024

View reviewed changes

cjacek and others added 16 commits August 10, 2024 15:03

[LLD][NFC] Make InputFile::getMachineType const. (llvm#102737)

2849ebb

[mlir][vector] Use DenseI64ArrayAttr in vector.multi_reduction (llv…

5f26497

…m#102637) This prevents some unnecessary conversions to/from int64_t and IntegerAttr.

[clang][Interp] Only zero-init first union member (llvm#102744)

ac47edd

Zero-initializing all of them accidentally left the last member active. Only initialize the first one.

[Clang][Sema][OpenMP] Allow thread_limit to accept multiple express…

1c26992

…ions (llvm#102715)

[clang][Interp] Ignore unnamed bitfields when zeroing records (llvm#1…

8d908b8

…02749) Including unions, where this is more important.

[NFC] Fix TableGen include guards to match paths (llvm#102746)

8a61bfc

- Fix include guards for headers under utils/TableGen to match their paths.

[GISel] Handle more opcodes in constant_fold_binop (llvm#102640)

9bb7c11

Update the list of opcodes handled by the constant_fold_binop combine to match the ones that are folded in CSEMIRBuilder::buildInstr.

[Serialization] Fix a warning

ac83582

This patch fixes: clang/lib/Serialization/ASTReader.cpp:11484:13: error: unused variable '_' [-Werror,-Wunused-variable]

[Serialization] Use traditional for loops (NFC) (llvm#102761)

4ce2f98

The use of _ requires either: - (void)_ and curly braces, or - [[maybe_unused]]. For simple repetitions like these, we can use traditional for loops for readable warning-free code.

[clang][Interp] Handle union copy/move ctors (llvm#102762)

496b224

They don't have a body and we need to implement them ourselves. Use the Memcpy op to do that.

[sanitizer,test] Restore -fno-sized-deallocation coverage

c27415f

-fsized-deallocation was recently made the default for C++17 onwards (llvm#90373). While here, remove unneeded -faligned-allocation.

[dfsan] Use namespace qualifier and internalize accidentally exported…

80eea01

… functions. NFC

aengelke and others added 10 commits August 13, 2024 12:15

[MC] Avoid useless triple copy (llvm#103026)

2b077ed

Copying a triple is cheap, but not free, so let's not do that if there's no reason to do so. Trivial cleanup.

[IR] Add block number traits to CFG (llvm#102758)

8fc3a79

This enables the use of the more efficient dominator tree node access.

[mlir] Fix misleading comment for type trait (llvm#103041)

b7863d1

We are using `has_initialize` to check the class has `initialize` function instead of the `getOperationName` function.

[Matrix] Add test showing unintended implicit sign conversion warning.

103cdd4

Code Review adjustments

ae34119

Put the aggressive metadata reduction under

5a6d7c5

--aggressive-md flag. Addressed the issues pointed out by arsenm.

Merge branch 'barinov-reduce-distinct' of https://github.com/rbintel/…

32cd0d1

…llvm-project into barinov-reduce-distinct

Code review adjustments, put aggressive metadata reduction algorithm

75b3087

under the --aggressive-md flag.

rbintel requested review from DeinAlptraum, daniel-grumberg, cyndyishida, aaupov, maksfb, rafaelauler, ayermolo, dcci, Endilll and a team as code owners August 13, 2024 15:14

rbintel closed this Aug 13, 2024

rbintel deleted the barinov-reduce-distinct branch August 13, 2024 15:20

ldionne removed the request for review from a team August 13, 2024 15:23

rbintel restored the barinov-reduce-distinct branch August 13, 2024 15:26

Endilll removed their request for review August 16, 2024 12:43

rbintel mentioned this pull request Aug 16, 2024

[LLVM-Reduce] - Distinct Metadata Reduction #104624

Merged

rbintel deleted the barinov-reduce-distinct branch August 20, 2024 10:58

		// Named metadata with simple list-like behavior, so that it's valid to remove
		// operands individually.

[LLVM-Reduce] - Distinct Metadata Reduction #96072

[LLVM-Reduce] - Distinct Metadata Reduction #96072

Uh oh!

Conversation

rbintel commented Jun 19, 2024

Uh oh!

github-actions bot commented Jun 19, 2024

Uh oh!

github-actions bot commented Jun 20, 2024

Uh oh!

michalpaszkowski left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

michalpaszkowski left a comment

Choose a reason for hiding this comment

Uh oh!

michalpaszkowski commented Jul 26, 2024

Uh oh!

arsenm commented Jul 26, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

arsenm Jul 26, 2024

Choose a reason for hiding this comment

Uh oh!

OCHyams commented Aug 16, 2024

Uh oh!

arsenm commented Aug 16, 2024

Uh oh!

rbintel commented Aug 16, 2024

Uh oh!

Uh oh!