fix(html): regexp match script tag ignore comment #20341

btea · 2025-07-03T08:30:46Z

Description

fix #20340
refs #18386

sapphi-red

I don't think stripLiteral works with HTML files.
Maybe we can use parse5.Tokenizer to do something similar to stripLiteral but I'm not sure if that is performant than parsing + traversing the HTML directly.

btea · 2025-07-03T09:18:27Z

You are right. The local test result was correct by accident. The type value in the script tag was removed, resulting in the regular expression not matching.

Can we add another regular expression to match the comment content in HTML? 🤔

sapphi-red · 2025-07-03T10:41:43Z

I guess we need to at least tokenize it to properly handle comments. In other words, I don't think regex is sufficient.

sapphi-red · 2025-07-07T08:51:38Z

Would you check the performance difference? I'd like to know how much additional overhead this would have. Also we can check the HTML with regex before parsing so that we can skip the parse if that's faster.

btea · 2025-07-07T10:04:23Z

I will check it.

btea · 2025-07-08T04:44:12Z

I created a template project containing only js for testing through pnpm create vite.

I tested it locally and the results of 10 runs are as follows. It seems that when importmap does not exist, regular matching can be used to avoid a huge waste of traversal performance.

	preImportMapHook(ms)	postImportMapHook(ms)
no-importmap	14.69 ± 1.65	1.05 ± 0.10
no-importmap-with-regexp	0.48 ± 0.17	0.28 ± 0.17
have-importmap	15.63 ± 3.27	1.37 ± 0.18
have-importmap-with-regexp	15.48 ± 3.65	1.61 ± 0.26

I retested on a different device, and the updated results look reasonable.

The following table shows the original logical performance of Vite through regular matching.

	preImportMapHook(ms)	postImportMapHook(ms)
no-importmap	0.42 ± 0.11	0.28 ± 0.08
have-importmap	0.60 ± 0.09	0.30 ± 0.04

sapphi-red · 2025-07-09T03:33:27Z

Thanks for checking the perf 💚

packages/vite/src/node/plugins/html.ts

sapphi-red

Thanks!

bluwy · 2025-07-16T02:24:10Z

Trying to understand this a bit better, based on the perf result, isn't the regex one better? In most apps so far I think "no importmap" setups are more common.

To fix this with regex, couldn't we do html.replace(//gs, (s) => ' '.repeat(s.length)) since html comment stripping should be simpler? Maybe this would incur additional work but maybe not as much as doing a parse.

sapphi-red · 2025-07-16T02:30:55Z

The original regex implementation is faster than the parsing approach. But with the "regex filter early return" (if (!importMapRE.test(html)) return), we can keep the perf for most apps (the setups without <script type="importmap"> as a string), as the parsing won't happen.
While we can strip html comments with regex to speed up the non-common apps, I think we can parse it as that would be more robost and this is not the common case.

bluwy · 2025-07-16T03:39:24Z

Ah I missed that the PR still has an early regex check, so in practice this would only slow down setups using importmaps.

Still, I find 15ms a bit much for a step during html processing that only re-arranges some tags, it'll compound with other html processing we do. I would lean on keeping regex unless people are hitting problems often with it. I think thus far there weren't many issues.

sapphi-red · 2025-07-16T04:22:57Z

@btea I wonder if the 15ms is coming from the actual parsing or this lazy load of parse5. Because I don't think preImportMapHook would take 15x time than postImportMapHook.

vite/packages/vite/src/node/plugins/html.ts

Line 193 in 6bc8bf6

const { parse } = await import('parse5')

Also is this the total of 10 runs? or is it the average of it?

btea · 2025-07-16T04:44:13Z

Yes, I added const startTime = performance.now() and const endTime = performance.now() at the beginning and end of this function respectively, and calculated the results. The above values are the average and variance.

https://github.com/vitejs/vite/pull/20341/files#diff-89bae1df62862bb7f4a03d82a1e9cbf4ac6d0c042f21fbbacb0a2238bd050042L1158

fix: regexp match script tag ignore comment

614fc41

sapphi-red requested changes Jul 3, 2025

View reviewed changes

btea added 2 commits July 3, 2025 20:59

fix: update

b59ed97

fix: update

e841321

btea requested a review from sapphi-red July 6, 2025 01:30

sapphi-red added feat: html p2-edge-case Bug, but has workaround or limited in scope (priority) feat: build labels Jul 7, 2025

fix: update

a001ce2

sapphi-red requested changes Jul 9, 2025

View reviewed changes

packages/vite/src/node/plugins/html.ts Outdated Show resolved Hide resolved

packages/vite/src/node/plugins/html.ts Outdated Show resolved Hide resolved

sapphi-red changed the title ~~fix: regexp match script tag ignore comment~~ fix(html): regexp match script tag ignore comment Jul 9, 2025

fix: update

22db9b2

sapphi-red requested changes Jul 10, 2025

View reviewed changes

packages/vite/src/node/plugins/html.ts Outdated Show resolved Hide resolved

packages/vite/src/node/plugins/html.ts Show resolved Hide resolved

fix: update

3bc1096

sapphi-red approved these changes Jul 11, 2025

View reviewed changes

btea requested a review from bluwy July 11, 2025 09:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix(html): regexp match script tag ignore comment #20341

fix(html): regexp match script tag ignore comment #20341

btea commented Jul 3, 2025 •

edited by sapphi-red

Loading

Uh oh!

sapphi-red left a comment

Uh oh!

btea commented Jul 3, 2025

Uh oh!

sapphi-red commented Jul 3, 2025

Uh oh!

sapphi-red commented Jul 7, 2025

Uh oh!

btea commented Jul 7, 2025

Uh oh!

btea commented Jul 8, 2025 •

edited

Loading

Uh oh!

sapphi-red commented Jul 9, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sapphi-red left a comment

Uh oh!

bluwy commented Jul 16, 2025

Uh oh!

sapphi-red commented Jul 16, 2025

Uh oh!

bluwy commented Jul 16, 2025

Uh oh!

sapphi-red commented Jul 16, 2025

Uh oh!

btea commented Jul 16, 2025

Uh oh!

Uh oh!

Uh oh!

fix(html): regexp match script tag ignore comment #20341

Are you sure you want to change the base?

fix(html): regexp match script tag ignore comment #20341

Conversation

btea commented Jul 3, 2025 • edited by sapphi-red Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

sapphi-red left a comment

Choose a reason for hiding this comment

Uh oh!

btea commented Jul 3, 2025

Uh oh!

sapphi-red commented Jul 3, 2025

Uh oh!

sapphi-red commented Jul 7, 2025

Uh oh!

btea commented Jul 7, 2025

Uh oh!

btea commented Jul 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sapphi-red commented Jul 9, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sapphi-red left a comment

Choose a reason for hiding this comment

Uh oh!

bluwy commented Jul 16, 2025

Uh oh!

sapphi-red commented Jul 16, 2025

Uh oh!

bluwy commented Jul 16, 2025

Uh oh!

sapphi-red commented Jul 16, 2025

Uh oh!

btea commented Jul 16, 2025

Uh oh!

Uh oh!

btea commented Jul 3, 2025 •

edited by sapphi-red

Loading

btea commented Jul 8, 2025 •

edited

Loading