Supported Modifier Flags

In the Oct, 2021 plenary, @michaelficarra asked that we outline and provide motivating examples for each flag we are considering as a supported modifier.

The flags currently under consideration are:

- `i` &mdash; ignore-case
  - **Rationale** &mdash; Toggling ignore-case is especially useful when matching patterns with varying case sensitivity, or when parsing patterns provided via JSON configuration. Especially useful when working with complex Unicode character ranges.
  - **Example** &mdash; Match upper case ascii letter followed by upper or lower case ascii letter or '
    ```js
    const re = /^[A-Z](?i)[a-z']+$/;
    re.test("O'Neill"); // true
    re.test("o'neill"); // false
    
    // alternatively (defaulting to ignore-case):
    const re2 = /^(?-i:[A-Z])[a-z']+$/i;
    ```
  - **Example** &mdash; Match word starting with `D` followed by word starting with `D` or `d` (from .NET documentation, see [^1])
    ```js
    const re = /\b(D\w+)(?ix)\s(d\w+)\b/g;
    const input = "double dare double Double a Drooling dog The Dreaded Deep";
    re.exec(input); // ["Drooling dog", "Drooling", "dog"]
    re.exec(input); // ["Dreaded Deep", "Dreaded", "Deep"]
    ```
- `m` &mdash; multiline
  - **Rationale** &mdash; Flexibility in matching beginning-of-buffer vs. beginning-of-line or end-of-buffer vs. end-of-line in a complex pattern.
  - **Example** &mdash; Match a frontmatter block at the start of a file
    ```js
    const re = /^---(?m)$((?:^(?!---$).*$)*)^---$/;
    re.test("---a"); // false
    re.test("---\n---"); // true
    re.test("---\na: b\n---"); // true
    ```
- `s` &mdash; dot-all (i.e., "single line")
  - **Rationale** &mdash; Control over `.` matching semantics within a pattern.
  - **Example**
    ```js
    const re = /a.c(?s:.)*x.z/;
    re.test("a\ncx\nz"); // flse
    re.test("abcdxyz"); // true
    re.test("aBc\nxYz"); // true
    ```
- `x` &mdash; Extended Mode. This flag is proposed by https://github.com/tc39/proposal-regexp-x-mode
  - **Rationale** &mdash; Would allow control over significant whitespace handling in a pattern.
  - **Example** &mdash; Disabling `x` mode when composing a complex pattern:
    ```js
    const idPattern = `[a-z]{2} \d{4}`; // space required
    const re = new RegExp(String.raw`
      # match the id
      (?<id>(?-x:${idPattern}))
      
      # match a separator
      :\s
      
      # match the value
      (?<value>\w+)
    `, "x");
    
    re.exec("aa0123: foo")?.groups; // undefined
    re.exec("aa 0123: foo")?.groups; // { id: "aa 0123", value: "foo" }
    ```

Flags likely too complex to support:
- `u` &mdash; Unicode. This flag affects how a pattern is parsed, not how it is matched. Supporting it would likely require a cover grammar and additional static semantics.
- `v` &mdash; Extended Unicode. This flag is proposed by https://github.com/tc39/proposal-regexp-set-notation as an extension of the `u` flag and would have the same difficulties.

Flags that will never be supported:
- `g` &mdash; Global. This flag affects the index at which matching starts and not the matching behavior itself. Changing it mid pattern would have no effect.
- `y` &mdash; Sticky. This flag affects the index at which matching starts and not the matching behavior itself. Changing it mid pattern would have no effect.
- `d` &mdash; Indices. This flag affects the match result. Changing it mid pattern would have no effect.

[^1]: https://docs.microsoft.com/en-us/dotnet/standard/base-types/miscellaneous-constructs-in-regular-expressions#inline-options


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Supported Modifier Flags #1

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Supported Modifier Flags #1

Description

Footnotes

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions