perf(lexer): only check for hashbang at start of file #12521

overlookmotel · 2025-07-25T18:12:01Z

Small optimization to lexer. A hashbang can only appear at very start of file, so only check for hashbang when getting first token. This streamlines the byte handler for #, because a # anywhere else can only be a private identifier.

Note: self.token.set_is_on_new_line(true); in read_hashbang_comment is not required, because it's always true already.

overlookmotel · 2025-07-25T18:12:19Z

perf(lexer): only check for hashbang at start of file #12521 👈 (View in Graphite)
main

How to use the Graphite Merge Queue

Add either label to this PR to merge it via the merge queue:

0-merge - adds this PR to the back of the merge queue
hotfix - for urgent hot fixes, skip the queue and merge this PR next

You must have a Graphite account in order to use the merge queue. Sign up using this link.

_{An organization admin has enabled the Graphite Merge Queue in this repository.} _{Please do not merge from GitHub as this will restart CI on PRs being processed by the merge queue.}

This stack of pull requests is managed by Graphite. Learn more about stacking.

codspeed-hq · 2025-07-25T18:17:48Z

CodSpeed Instrumentation Performance Report

Merging #12521 will improve performances by 8.47%

_{Comparing 07-25-perf_lexer_only_check_for_hashbang_at_start_of_file (47a565f) with main (c72f49e)}

Summary

⚡ 4 improvements
✅ 30 untouched benchmarks

Benchmarks breakdown

	Benchmark	`BASE`	`HEAD`	Change
⚡	`lexer[RadixUIAdoptionSection.jsx]`	20.8 µs	20 µs	+3.91%
⚡	`lexer[binder.ts]`	930.2 µs	870.5 µs	+6.85%
⚡	`lexer[cal.com.tsx]`	5.8 ms	5.3 ms	+8.47%
⚡	`lexer[react.development.js]`	384 µs	357.1 µs	+7.54%

camc314

nice 💪

overlookmotel · 2025-07-25T18:52:13Z

Hmm. I'm not sure that benchmark is right. First version made no difference at all. I may have made a mistake.

Copilot

Pull Request Overview

This PR optimizes the lexer by restricting hashbang comment detection to only the first token of a file. Since hashbang comments can only appear at the very start of a file, this eliminates unnecessary checks for every # character encountered during lexing.

Adds a dedicated first_token() method that specifically checks for hashbang comments
Simplifies the # byte handler to only handle private identifiers
Updates parser and benchmark code to use the new first_token() method

Reviewed Changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
tasks/benchmark/benches/lexer.rs	Updates benchmark to use `first_token()` method and adds EOF check
crates/oxc_parser/src/lib.rs	Replaces `bump_any()` with `first_token()` call in parser initialization
crates/oxc_parser/src/lexer/mod.rs	Adds new `first_token()` method and inlines `read_next_token()`
crates/oxc_parser/src/lexer/comment.rs	Makes `read_hashbang_comment()` unsafe and removes unnecessary line setting
crates/oxc_parser/src/lexer/byte_handlers.rs	Simplifies `#` handler to only process private identifiers

crates/oxc_parser/src/lexer/comment.rs

tasks/benchmark/benches/lexer.rs

Boshen · 2025-08-10T07:32:34Z

Merge activity

Aug 10, 7:32 AM UTC: The merge label '0-merge' was detected. This PR will be added to the Graphite merge queue once it meets the requirements.
Aug 10, 7:35 AM UTC: Boshen added this pull request to the Graphite merge queue.
Aug 10, 7:42 AM UTC: Merged by the Graphite merge queue.

Small optimization to lexer. A hashbang can only appear at very start of file, so only check for hashbang when getting first token. This streamlines the byte handler for `#`, because a `#` anywhere else can only be a private identifier. Note: `self.token.set_is_on_new_line(true);` in `read_hashbang_comment` is not required, because it's always `true` already.

) Small optimization to lexer. A hashbang can only appear at very start of file, so only check for hashbang when getting first token. This streamlines the byte handler for `#`, because a `#` anywhere else can only be a private identifier. Note: `self.token.set_is_on_new_line(true);` in `read_hashbang_comment` is not required, because it's always `true` already.

github-actions bot added A-parser Area - Parser C-performance Category - Solution not expected to change functional behavior, only performance labels Jul 25, 2025

overlookmotel marked this pull request as ready for review July 25, 2025 18:14

overlookmotel force-pushed the 07-25-perf_lexer_only_check_for_hashbang_at_start_of_file branch from 875bccc to 41cb852 Compare July 25, 2025 18:30

camc314 reviewed Jul 25, 2025

View reviewed changes

overlookmotel marked this pull request as draft July 25, 2025 18:52

Boshen force-pushed the 07-25-perf_lexer_only_check_for_hashbang_at_start_of_file branch from 41cb852 to c181f5f Compare August 10, 2025 07:29

Boshen marked this pull request as ready for review August 10, 2025 07:29

Copilot AI review requested due to automatic review settings August 10, 2025 07:29

Copilot AI reviewed Aug 10, 2025

View reviewed changes

crates/oxc_parser/src/lexer/comment.rs Show resolved Hide resolved

tasks/benchmark/benches/lexer.rs Show resolved Hide resolved

Boshen added the 0-merge Merge with Graphite Merge Queue label Aug 10, 2025

graphite-app bot force-pushed the 07-25-perf_lexer_only_check_for_hashbang_at_start_of_file branch from c181f5f to 47a565f Compare August 10, 2025 07:36

graphite-app bot merged commit 47a565f into main Aug 10, 2025
31 checks passed

graphite-app bot deleted the 07-25-perf_lexer_only_check_for_hashbang_at_start_of_file branch August 10, 2025 07:42

graphite-app bot removed the 0-merge Merge with Graphite Merge Queue label Aug 10, 2025

oxc-bot mentioned this pull request Aug 12, 2025

release(crates): v0.82.0 #13014

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

perf(lexer): only check for hashbang at start of file #12521

perf(lexer): only check for hashbang at start of file #12521

Uh oh!

overlookmotel commented Jul 25, 2025 •

edited

Loading

Uh oh!

overlookmotel commented Jul 25, 2025

Uh oh!

codspeed-hq bot commented Jul 25, 2025 •

edited

Loading

Uh oh!

camc314 left a comment

Uh oh!

overlookmotel commented Jul 25, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Boshen commented Aug 10, 2025 •

edited by graphite-app bot

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

perf(lexer): only check for hashbang at start of file #12521

perf(lexer): only check for hashbang at start of file #12521

Uh oh!

Conversation

overlookmotel commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

overlookmotel commented Jul 25, 2025

How to use the Graphite Merge Queue

Uh oh!

codspeed-hq bot commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed Instrumentation Performance Report

Merging #12521 will improve performances by 8.47%

Summary

Benchmarks breakdown

Uh oh!

camc314 left a comment

Choose a reason for hiding this comment

Uh oh!

overlookmotel commented Jul 25, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Boshen commented Aug 10, 2025 • edited by graphite-app bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge activity

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

overlookmotel commented Jul 25, 2025 •

edited

Loading

codspeed-hq bot commented Jul 25, 2025 •

edited

Loading

Boshen commented Aug 10, 2025 •

edited by graphite-app bot

Loading