viewer: use treemap for bundle visualization #10312

connorjclark · 2020-02-09T05:59:18Z

One of a few features I'm extracting from the bundle-analysis (#10064) branch.

Design doc: https://docs.google.com/document/d/10FQJqkRPFXhR1PDny_c0kvm7qQtokR5o7Oyw0n2_N44/edit?usp=sharing

In draft b/c UX hasn't been considered yet.

try it locally

yarn && yarn build-viewer && yarn viewer

# other terminal
node lighthouse-cli https://www.coursehero.com/sg/ --preset experimental --view

To save a distributable copy, Ctrl+S, "Complete Webpage", and distribute the HTML + the _files. Optionally trim the HTML by replacing the <main>...</main> with what is in treemap/index.html.

commit ecb6260 Author: Connor Clark <cjamcl@google.com> Date: Wed Dec 11 15:17:52 2019 -0800 no index commit 52796db Author: Connor Clark <cjamcl@google.com> Date: Wed Dec 11 15:17:02 2019 -0800 no unshift commit 943e19c Author: Connor Clark <cjamcl@google.com> Date: Wed Dec 11 15:14:21 2019 -0800 deletionBraceCount commit caa269b Author: Connor Clark <cjamcl@google.com> Date: Wed Dec 11 15:13:25 2019 -0800 comments commit e6fde71 Author: Connor Clark <cjamcl@google.com> Date: Wed Dec 11 15:02:50 2019 -0800 semver commit 6480198 Author: Connor Clark <cjamcl@google.com> Date: Tue Dec 10 21:18:36 2019 -0800 update frontend commit 5d17268 Author: Connor Clark <cjamcl@google.com> Date: Tue Dec 10 21:07:44 2019 -0800 rm some ts ignores commit 7129ef9 Author: Connor Clark <cjamcl@google.com> Date: Tue Dec 10 21:04:09 2019 -0800 trim unneeded code commit 0b655b6 Author: Connor Clark <cjamcl@google.com> Date: Tue Dec 10 20:46:11 2019 -0800 better cdt lib gen commit 3333c4f Author: Connor Clark <cjamcl@google.com> Date: Tue Dec 10 20:19:20 2019 -0800 add sections to source map type commit 73eca7b Author: Connor Clark <cjamcl@google.com> Date: Tue Dec 10 20:00:02 2019 -0800 move SDK up b/c global pollution is gone commit 47e8a41 Author: Connor Clark <cjamcl@google.com> Date: Tue Dec 10 19:44:59 2019 -0800 comment commit 9f47880 Author: Connor Clark <cjamcl@google.com> Date: Mon Dec 9 15:02:57 2019 -0800 remove unused js commit 7d50227 Author: Connor Clark <cjamcl@google.com> Date: Sat Dec 7 00:36:07 2019 -0800 nul commit ed4bb21 Author: Connor Clark <cjamcl@google.com> Date: Sat Dec 7 00:33:39 2019 -0800 remove ParsedURL commit 43da869 Author: Connor Clark <cjamcl@google.com> Date: Sat Dec 7 00:30:08 2019 -0800 tmp commit 0d73a8b Author: Connor Clark <cjamcl@google.com> Date: Sat Dec 7 00:22:10 2019 -0800 reduce code commit 4a9127d Author: Connor Clark <cjamcl@google.com> Date: Fri Dec 6 23:52:44 2019 -0800 delete some stuff not needed commit dff8358 Author: Connor Clark <cjamcl@google.com> Date: Fri Dec 6 23:24:48 2019 -0800 errors commit 808736d Author: Connor Clark <cjamcl@google.com> Date: Fri Dec 6 22:56:20 2019 -0800 Common.console commit 122f2a2 Author: Connor Clark <cjamcl@google.com> Date: Fri Dec 6 22:54:42 2019 -0800 ts commit 2f80d5a Author: Connor Clark <cjamcl@google.com> Date: Fri Dec 6 22:50:18 2019 -0800 remove hacky globalThis.cdt commit 7ad9389 Author: Connor Clark <cjamcl@google.com> Date: Fri Dec 6 21:24:00 2019 -0800 do not pollute global array commit fa07a80 Author: Connor Clark <cjamcl@google.com> Date: Fri Dec 6 21:17:56 2019 -0800 mapping test commit f9408f3 Author: Connor Clark <cjamcl@google.com> Date: Fri Dec 6 20:49:23 2019 -0800 count lines commit 6d27caa Author: Connor Clark <cjamcl@google.com> Date: Fri Dec 6 20:41:09 2019 -0800 initial

joshribakoff · 2020-05-04T04:37:31Z

cc @joshribakoff, fyi we saw your PR to source-map-explorer (danvk/source-map-explorer#145). Thought you might be interested in this effort. happy to hear any suggestions you may have :)

What would be really cool is if the tool could visualize how the code executes over time. I've also studied your definitions of TTI, and done much research into the topic & I think there is a use case to visualize CPU usage, over time, as a heatmap gradient mapped onto the treemap. Often TTI fails to capture main thread blocking events that happen after the metric, rather than giving a TTI metric, a full blown visualization of how work unfolds over time would be more useful.

There's also a use case for a tool that ingests user access patterns, and does knapsack algorithms + a heuristic to determine optimal bundle splitting. For example, if 40% of users land on the home page, and 40% of users land on the "channel" page, it is reasonable to infer those 2 entries should be their own bundle, by looking at the correlations in the user access patterns. There is a use case for a tool to visualize this.

I requested access to the design doc, thanks for tagging me!

The key take away I would stress is that there are 2 dimensions to split bundles - you ideally want to be able to visualize & split bundles by how frequently the code changes (to minimize cache invalidation), and the other dimension is user access patterns - you want to visualize the "minimal code" for all app boots, and you want to categorize app boots into entry points to inform your bundle splitting, generally speaking, or at least this is my take away.

I would actually also tag @TheLarkInn who has some opinions about how vendor bundles are an anti-pattern, it his musings which prompted me to make the original heatmap demo!

connorjclark · 2020-05-04T19:00:51Z

What would be really cool is if the tool could visualize how the code executes over time. I've also studied your definitions of TTI, and done much research into the topic & I think there is a use case to visualize CPU usage, over time, as a heatmap gradient mapped onto the treemap. Often TTI fails to capture main thread blocking events that happen after the metric, rather than giving a TTI metric, a full blown visualization of how work unfolds over time would be more useful.

We've found that using color gradients to denote information is very confusing. For one, it's not very color-blind accessible. Even as a color-normal person, I find it difficult to get any relative sense by comparing colors. That's why we cut "color by" from the first demo.

In a similar vein, I was thinking: a slider that starts at no execution time for anything to bucketing execution time to the correct node as it happens. On the slider, we could put critical moments like FCP, LCP, etc...

Unclear to me if this would be more cool than useful :)

There's also a use case for a tool that ingests user access patterns, and does knapsack algorithms + a heuristic to determine optimal bundle splitting. For example, if 40% of users land on the home page, and 40% of users land on the "channel" page, it is reasonable to infer those 2 entries should be their own bundle, by looking at the correlations in the user access patterns. There is a use case for a tool to visualize this.

At the moment Lighthouse consuming outside data (from GA, for example) is out of scope.

Besides that, normally when we talk about code splitting the goal is to only provide the code needed for the page, not try to make subsequent page loads on different parts of the site more likely to be in a warm cache. What you're suggesting almost seems like the opposite of what we'd want to do in Lighthouse–if the landing page doesn't use any of the code that the channel page uses, we'd want to suggest splitting it out. (FYI–I'm not sure how we'd do this yet).

BTW I have seen tools that consume user access patterns to create predictive preloads/prerenders. https://www.youtube.com/watch?v=UwjzFGCAuLw (s/o @shortdiv)

I requested access to the design doc, thanks for tagging me!

Ah it's super old anyhow, not useful anymore. things drifted and it wasn't updated.

The key take away I would stress is that there are 2 dimensions to split bundles - you ideally want to be able to visualize & split bundles by how frequently the code changes (to minimize cache invalidation), and the other dimension is user access patterns - you want to visualize the "minimal code" for all app boots, and you want to categorize app boots into entry points to inform your bundle splitting, generally speaking, or at least this is my take away.

Even with the most integrated toolset, I think we can only hope to know about user access patterns.

Code splitting advice is a hard problem, fortunately it's not part of this tool's initial release.

I would actually also tag @TheLarkInn who has some opinions about how vendor bundles are an anti-pattern, it his musings which prompted me to make the original heatmap demo!

Advising against bundles with vendor in the name might be something we could do. I recall hearing this advice, but would need to brush up on it to know if it's universally applicable.

joshribakoff · 2020-05-07T02:59:49Z

@connorjclark

We've found that using color gradients to denote information is very confusing. For one, it's not very color-blind accessible. Even as a color-normal person, I find it difficult to get any relative sense by comparing colors. That's why we cut "color by" from the first demo.

Code cov can be mapped to grayscale. Some of the color choices shown in the screenshots in the PR are quite unintuitive. I see your point, and I did not fully read through the PR to learn about the "color by" discussion ;)

Unclear to me if this would be more cool than useful :)

I also see your point :)

At the moment Lighthouse consuming outside data (from GA, for example) is out of scope.

Besides that, normally when we talk about code splitting the goal is to only provide the code needed for the page, not try to make subsequent page loads on different parts of the site more likely to be in a warm cache.

But if users pre-fetch, that can mess with your heuristics right? Maybe this is out of scope for this PR & I'm not super familiar with your tool.

What you're suggesting almost seems like the opposite of what we'd want to do in Lighthouse–if the landing page doesn't use any of the code that the channel page uses, we'd want to suggest splitting it out. (FYI–I'm not sure how we'd do this yet).

I guess what I'm saying is that if 45% of users hit the "home" page & 45% of users hit the "channel page", and 80% of the code for those 2 pages is shared, and 80% of users who hit one page visit the other page, this information is ideally leveraged, if we're talking about an ideal bundle splitting strategy. Maybe its out of scope here for this PR.

Advising against bundles with vendor in the name might be something we could do. I recall hearing this advice, but would need to brush up on it to know if it's universally applicable.

Its not the naming, its the anti-pattern of having "common" chunks, which has every piece of code used on 3 or more pages (a common webpack default). This "common" chunk is then often served on every page. This often results in lots of "common" code being present for pages that don't use all of the "common" code, even if its not executed the parse/eval time has a large cost, especially on mobile devices.

Sean's advice was to use the Chrome code coverage in the devtools & shoot for bundle splitting that gives you 80% coverage on FPL. I found Chrome's dev code coverage feature to be unusable because its just raw data, hence why I built the heatmap tool that mapped the raw code coverage data from chrome devtools onto a color coded treemap.

I'm not familiar with what you're trying to solve here, but I hope this helps. Thanks for making this awesome tool & making the web/world better!

connorjclark and others added 30 commits December 9, 2019 15:17

initial

4a62a54

do not need _ yet

7886f5e

test for unused bundle

65cc13c

lint

2ed3b7b

fix null mapping bug

652059e

fix counting

b48a400

fix smoke test

7bdf8b4

comment

b6a4a08

comment

b840b10

lint

c3fd09e

better bundle size filter

36c7bee

todo

912f9f1

update bundle.js

ea03907

count new lines

a862683

dont double count

4ff3804

fix test

fa429e0

fix test

74b642d

destructure

1c9d984

lastColumnOfMapping

0d440b2

Merge remote-tracking branch 'origin/master' into unused-js-bundles

60ac62f

Merge remote-tracking branch 'origin/master' into unused-js-bundles

c27aa4a

Merge remote-tracking branch 'origin/master' into unused-js-bundles

e2d1035

fix

e317115

fix one test

d68a555

fix default config test

3d9c5f4

fix

1bd219d

update

4b8fe87

Merge remote-tracking branch 'origin/master' into unused-js-bundles

39a4938

rm old code

d338faf

comment

54192da

vercel bot deployed to Preview April 30, 2020 22:00 View deployment

connorjclark mentioned this pull request May 1, 2020

JS analysis blockers ☂️ #10369

Closed

16 tasks

resource summary

c26c223

vercel bot deployed to Preview May 4, 2020 03:59 View deployment

organize treemap/index.html

2b9ca3e

vercel bot deployed to Preview May 4, 2020 04:04 View deployment

connorjclark added 2 commits May 3, 2020 21:11

execution time

f104231

fix html

41c906f

vercel bot deployed to Preview May 4, 2020 04:11 View deployment

connorjclark mentioned this pull request May 4, 2020

core(resource-summary): rename size to transferSize #10700

Merged

use settings multiplier for execution time

454b3b6

vercel bot deployed to Preview May 6, 2020 06:53 View deployment

aggregate inline

92f3adf

vercel bot deployed to Preview May 6, 2020 07:12 View deployment

Merge remote-tracking branch 'origin/master' into viewer-treemap

f4eb084

vercel bot deployed to Preview May 10, 2020 22:42 View deployment

connorjclark added 2 commits May 10, 2020 23:09

wip

5944fd5

Merge remote-tracking branch 'origin/subrow-layout' into viewer-treemap

c03271c

vercel bot deployed to Preview May 11, 2020 06:11 View deployment

fix transfer size

dfb15f5

vercel bot deployed to Preview May 11, 2020 06:18 View deployment

connorjclark removed the needs-discussion label Jun 17, 2020

update

7fbba12

vercel bot deployed to Preview August 11, 2020 20:29 View deployment

connorjclark closed this Aug 11, 2020

connorjclark mentioned this pull request Aug 12, 2020

Lighthouse Treemap ☂️ #11254

Closed

24 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

viewer: use treemap for bundle visualization #10312

viewer: use treemap for bundle visualization #10312

connorjclark commented Feb 9, 2020 •

edited

Loading

joshribakoff commented May 4, 2020

connorjclark commented May 4, 2020

joshribakoff commented May 7, 2020 •

edited

Loading

viewer: use treemap for bundle visualization #10312

viewer: use treemap for bundle visualization #10312

Conversation

connorjclark commented Feb 9, 2020 • edited Loading

try it locally

joshribakoff commented May 4, 2020

connorjclark commented May 4, 2020

joshribakoff commented May 7, 2020 • edited Loading

connorjclark commented Feb 9, 2020 •

edited

Loading

joshribakoff commented May 7, 2020 •

edited

Loading