Feature request: Filtering execution log #12075

GabrielGhe · 2020-09-10T00:55:31Z

Description of the problem / feature request:

When using --execution_log_json_file, we generate a very big json file and it's very slow to create this gigantic json file. We're only interested in a couple of points (output files, remote cache hit, size, duration). I know that the duration can be found in the profile log and when we download from the remote cache (created with --profile), but that file doesn't specify exactly what it's trying to download from the remote cache. A single build create an execution log of over 1GB in size so we have to turn it off.

Feature requests: what underlying problem are you trying to solve with this feature?

Add the ability to filter what user wants in the execution log

What operating system are you running Bazel on?

Ubuntu 18.04

What's the output of `bazel info release`?

release 3.4.1

Have you found anything relevant by searching the web?

No

Any other information, logs, or outputs that you want to share?

No

The text was updated successfully, but these errors were encountered:

meisterT · 2021-09-21T10:50:16Z

when we download from the remote cache (created with --profile), but that file doesn't specify exactly what it's trying to download from the remote cache

what would you like to see there?

altonchuzhan · 2022-03-15T13:06:19Z

bazel/src/main/java/com/google/devtools/build/lib/exec/SpawnLogContext.java

Lines 91 to 106 in 09cd1fc

    
           try { 
        
             for (Map.Entry<PathFragment, ActionInput> e : inputMap.entrySet()) { 
        
               ActionInput input = e.getValue(); 
        
               if (input instanceof VirtualActionInput.EmptyActionInput) { 
        
                 continue; 
        
               } 
        
               Path inputPath = execRoot.getRelative(input.getExecPathString()); 
        
               if (inputPath.isDirectory()) { 
        
                 listDirectoryContents(inputPath, builder::addInputs, metadataProvider); 
        
               } else { 
        
                 Digest digest = computeDigest(input, null, metadataProvider); 
        
                 builder.addInputsBuilder().setPath(input.getExecPathString()).setDigest(digest); 
        
               } 
        
             } 
        
           } catch (IOException e) { 
        
             logger.atWarning().withCause(e).log("Error computing spawn inputs");

I thought it could be done by filter out some big section, for example the input part.
It is quite annoying that I want to fetch a cache hit flag from 8.1G json data in my case.

filter target/action is another option, it may looks like

bazel aquery 'mnemonic("TestRunner", //...)'

altonchuzhan · 2022-03-15T13:07:52Z

when we download from the remote cache (created with --profile), but that file doesn't specify exactly what it's trying to download from the remote cache

what would you like to see there?

Is there any other way to retrieve a flag fro cache hit on specific target? I got no luck when finding it in profile data.

GabrielGhe · 2022-11-18T04:41:09Z

@meisterT

GabrielGhe · 2022-11-18T04:45:20Z

when we download from the remote cache (created with --profile), but that file doesn't specify exactly what it's trying to download from the remote cache

what would you like to see there?

The profile might not be the best place to add extra information about action size/sha256 etc. Ideally we would want some way to filter the execution log or reduce it's size. Our execution log can get to over 5 GB on a clean build.

Open to other ideas. The use case is that we want to be able to find the first actions that were cache misses that invalidated many other actions.

Thanks!

jin added team-Performance Issues for Performance teams type: feature request untriaged labels Sep 15, 2020

sventiffe added the P3 We're not considering working on this, but happy to review a PR. (No assignee) label Nov 9, 2020

meisterT added help wanted Someone outside the Bazel team could own this and removed untriaged labels Nov 9, 2020

meisterT mentioned this issue Jun 20, 2023

Implement a way to be able to debug cache misses that can be always on without too much overhead #18643

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: Filtering execution log #12075

Feature request: Filtering execution log #12075

GabrielGhe commented Sep 10, 2020

meisterT commented Sep 21, 2021

altonchuzhan commented Mar 15, 2022 •

edited

Loading

altonchuzhan commented Mar 15, 2022

GabrielGhe commented Nov 18, 2022

GabrielGhe commented Nov 18, 2022

Feature request: Filtering execution log #12075

Feature request: Filtering execution log #12075

Comments

GabrielGhe commented Sep 10, 2020

Description of the problem / feature request:

Feature requests: what underlying problem are you trying to solve with this feature?

What operating system are you running Bazel on?

What's the output of bazel info release?

Have you found anything relevant by searching the web?

Any other information, logs, or outputs that you want to share?

meisterT commented Sep 21, 2021

altonchuzhan commented Mar 15, 2022 • edited Loading

altonchuzhan commented Mar 15, 2022

GabrielGhe commented Nov 18, 2022

GabrielGhe commented Nov 18, 2022

What's the output of `bazel info release`?

altonchuzhan commented Mar 15, 2022 •

edited

Loading