Discussion PR: bundle analysis #10064

connorjclark · 2019-12-03T21:18:23Z

I've been working on a few new audits re: bundle analysis. There's a few good breakpoints for dividing this up into smaller PRs, but I wanted to throw this up here to start discussion on some implementation details.

This branch has:

core(source-maps): workaround CORS for fetching maps #9459 merged in
bundle-duplication audit
valid-source-maps audit
new detail type for rendering multiple values
builds the CDT source map impl. to a commonjs lib

run

SIZE_MODE=cdt node lighthouse-cli https://web.dev/ --only-audits=bundle-duplication,valid-source-maps --view # passes

# or use https://www.coursehero.com to see a failing audit

# SIZE_MODE=moz (or don't set at all) will use a different counting impl.

source map implementation

There is the popular mozilla/source-map, and there is Chrome DevTool's version. The bundle-duplication audit uses this to count how many bytes are mapped to an input source file.

mozilla/source-map uses WASM to parse the mappings. CDT is all JS. They run in equivalent times, as far as the usage is bundle-duplication is concerned.

To efficiently attribute the byte costs, a lastGeneratedColumn is required, which is missing in CDT but exists in mozilla/source-map. It was a simple patch, and is something we need to upstream to CDT for @paulirish's CDT bundle visualizer anyhow.

CDT adds 20KB, mozilla/source-map adds 49KB.

CDT is written in esmodules, so I've made a build script that transpiles chrome-devtools-frontend to commonjs. There were also a few hacks required - there is some global prototype pollution, but I've contained all global scope pollution to a toplevel globalThis.cdt.

Question: Currently, there is no speed benefit to using CDT, but there is a size benefit. Which should we use?

Question: Is there a way to turn off tsc for lighthouse-core/lib/cdt/generated?

`bundle-duplication`

Question: I investigated a threshold for byte duplication, since there can be many "0KB"s and that's kinda noisy. I landed on a 100 byte threshold with 0.05 granularity. We could go higher, but there can be a long tail and the sum of all of them can be significant. We could throw them all into a "the rest" item (like font-size does), but that's not actionable. Thoughts?

multi

bundle-duplication reports files (from source maps) that are duplicated, and for each module (normalized to ignore nested node_modules) there can be n duplicates. For this audit to be actionable, it must be clear which source file each duplicate comes from. Additionally, each item needs to aggregate the information for a module, such that they can be sorted in the table.

That means we need a column that can contain multiple entries. In the interest of backwards compatibility and not needing to change tons internal of code, I suggest this format:

If the header is marked as multi: true, then the details renderer will lookup values in item.multi[key] and render each. To maintain the interface all the other byte efficiency audits use, the top-level item.wastedBytes is the sum of all multi.wastedBytes.

The other top-level properties are unnecessary for bundle-duplication, but I didn't attempt to tweak the TS interface to drop them entirely. Maybe ByteEfficiencyItem could be a union of a multi type and a normal one?

Thoughts?

`valid-source-maps`

Surfaces a few things.

Load errors (map was defined but fetching failed)
missing sourcesContent (we can punt on this, not used yet)
script looks like a bundle, but did not define a map (haven't done yet)

I'm also interested in pulling in source-map-validator to do some validations, but that isn't critical imo.

Question: is a best-practices audit the best way to surface this data? It's kinda meta to report, in that lack of this data means other audits can't function. What if those other audits hotlink to this one, if it's missing required data?

connorjclark · 2019-12-07T00:03:51Z

Closing, this has served its purpose.

paulirish · 2019-12-07T22:20:09Z

FWIW I'm super excited about this whole effort! 🎉

connorjclark and others added 30 commits July 25, 2019 13:53

core(source-maps): workaround CORS for fetching maps

eb2b32e

changes

637f8f4

fetcher.js

f20e927

top, isoloation

6fcfd56

more dum dum styles

61d0c2f

samesite

457f8c9

Update lighthouse-core/gather/fetcher.js

f7a7652

initial pass

d946ffe

size heurisitic. valid source maps sizes

44c8de9

refactor valid source maps

ab5d599

Merge remote-tracking branch 'origin/master' into fetch-no-cors

9265a01

lint

1f3ec5d

squash and merge the cors stuff

9ba5d69

debug

74b3018

del

496ca2c

del

f72e02a

render multi value

4f59e9d

todo

64eed68

refactor

1c5122d

comment

7cb7d4e

accurate module size

3e0bed0

comments

1c87ba8

fix

cf50488

use cdt

89eff9d

lastColumnNumber

57112f6

comments

6c5a8b4

initial pass

ba753d3

size heurisitic. valid source maps sizes

08837e8

refactor valid source maps

8614548

squash and merge the cors stuff

2977944

vercel bot deployed to staging December 5, 2019 04:22 View deployment

collapse, defer title for treemap

4450683

vercel bot deployed to staging December 5, 2019 21:10 View deployment

connorjclark added 2 commits December 5, 2019 13:16

minor

a982e5b

only show viz for bundles

6e3d7c5

vercel bot deployed to staging December 5, 2019 21:26 View deployment

unused js + bundle powers

89a7f55

vercel bot deployed to staging December 5, 2019 23:34 View deployment

better multi stuffs

437bdb7

vercel bot deployed to staging December 6, 2019 00:49 View deployment

more extensible heading multi

21b0f8e

vercel bot deployed to staging December 6, 2019 01:30 View deployment

connorjclark added 2 commits December 5, 2019 17:47

colors yay dark mode

e76dfd8

no zebra

085bfd4

vercel bot deployed to staging December 6, 2019 01:51 View deployment

use icon for viz btn

c70448b

vercel bot deployed to staging December 6, 2019 02:11 View deployment

_ key

bfe516d

vercel bot deployed to staging December 6, 2019 02:30 View deployment

trivial

44dc401

vercel bot deployed to staging December 7, 2019 00:02 View deployment

connorjclark closed this Dec 7, 2019

connorjclark mentioned this pull request Dec 7, 2019

core(gather): new computed artifact, js-bundles #10078

Merged

connorjclark mentioned this pull request Dec 9, 2019

report(details-renderer): support sub-rows within a table #10084

Merged

connorjclark mentioned this pull request Jan 8, 2020

Integration with Chrome Devtools Coverage #10195

Closed

This was referenced Feb 9, 2020

viewer: use treemap for bundle visualization #10312

Closed

new_audit: valid-source-maps #10313

Closed

core(duplicated-javascript): new audit #10314

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discussion PR: bundle analysis #10064

Discussion PR: bundle analysis #10064

connorjclark commented Dec 3, 2019 •

edited

Loading

connorjclark commented Dec 7, 2019

paulirish commented Dec 7, 2019

Discussion PR: bundle analysis #10064

Discussion PR: bundle analysis #10064

Conversation

connorjclark commented Dec 3, 2019 • edited Loading

run

source map implementation

bundle-duplication

multi

valid-source-maps

connorjclark commented Dec 7, 2019

paulirish commented Dec 7, 2019

connorjclark commented Dec 3, 2019 •

edited

Loading

`bundle-duplication`

`valid-source-maps`