Flatten i18n message keys in message JSON

## Problem


Our current vue/i18n messages files utilise nested JSON objects to discriminate keys. A sample from our `en.json5` looks like this:

```json5
{
  "404": {
    title: "The content you’re looking for seems to have disappeared.",
    main: "Go to {link} or search for something similar from the field below.",
  },
  hero: {
    subtitle: "Explore more than 800 million creative works",
    description: "An extensive library of free stock photos, images, and audio, available for free use.",
    search: {
      placeholder: "Search for content",
    },
    disclaimer: {
      content: "All {openverse} content is under a {license} or is in the public domain.",
      /**
      Interpolated into hero.disclaimer.content:
       _{license}_ part of "All Openverse content is under a {license} or is in the public domain."
       */
      license: "Creative Commons license",
    },
  },
}
```

Code that references these keys does so in a dot-delimited path format. For example, `hero.disclaimer.content` references the key at that path in the nested object.

While nested objects may provide a slight advantage to authorship of the messages file, it presents a severe disadvantage when trying to find the message from a key in the code referencing it. Using the example above, if you wanted to see the message associated with the key `hero.disclaimer.content`, and tried to search the codebase for that literal string, you would not find it. Instead, you would have to know to navigate to the `en.json5`, and then find the `hero` key, find its `disclaimer` key, and then finally the `content` key. Sometimes you can shortcut this by searching for just the final part of the key, in this case `content`. However, we have 31 keys that end with the segment `.content`, so searching `content:` in the file would still require looking through individual instances to find it. The example above also uses relatively small and shallow nesting and does not illustrate the additional difficulty of navigating deeply nested keys in large collections of messages, like `sensitive.designations.userReported.title.description.a`.

## Description



Instead, I propose we "flatten" the messages objects where the keys are the full path to the message. In other words, remove all nesting.

For example, the messages excerpt above would turn into this, instead:

```json5
{
  "404.title": "The content you’re looking for seems to have disappeared.",
  "404.main": "Go to {link} or search for something similar from the field below.",

  "hero.subtitle": "Explore more than 800 million creative works",
  "hero.description": "An extensive library of free stock photos, images, and audio, available for free use.",
  "hero.search.placeholder": "Search for content",
  "hero.disclaimer.content": "All {openverse} content is under a {license} or is in the public domain.",
  /**
  Interpolated into hero.disclaimer.content:
    _{license}_ part of "All Openverse content is under a {license} or is in the public domain."
    */
  "hero.disclaimer.license": "Creative Commons license",
}
```

This format is backwards compatible with our existing messages objects. If you replace the 404 and hero objects with the flattened version above and run your local frontend, there is zero issue.

**The benefit of this approach is that it is easily searchable in both directions**. From the messages file, it is easier to find uses of the keys. From runtime code, it is easier to find the content of the translation string. Additionally, our `json-to-pot` script could be simplified, as right now it has to collapse keys when converting to POT, because `POT` is a flat-format.

Our POT-to-JSON conversion re-explodes the keys into the nested object. We should retain this behaviour in the final output messages files. A meaningful downside to the flat format is an increase in the total character size of the keys, due to the repeated strings.

<details>
<summary>The nested example minifies to 509 characters (expand for minified version).</summary>

```json
{"404":{"title":"The content you’re looking for seems to have disappeared.","main":"Go to {link} or search for something similar from the field below."},"hero":{"subtitle":"Explore more than 800 million creative works","description":"An extensive library of free stock photos, images, and audio, available for free use.","search":{"placeholder":"Search for content",},"disclaimer":{"content":"All {openverse} content is under a {license} or is in the public domain.","license":"Creative Commons license"}}}
```

</details>

<details>
<summary>The flattened example minifies to 527 characters (expand for minified version).</summary>

```json
"404.title":"The content you’re looking for seems to have disappeared.","404.main":"Go to {link} or search for something similar from the field below.","hero.subtitle":"Explore more than 800 million creative works","hero.description":"An extensive library of free stock photos, images, and audio, available for free use.","hero.search.placeholder":"Search for content","hero.disclaimer.content":"All {openverse} content is under a {license} or is in the public domain.","hero.disclaimer.license":"Creative Commons license"}
```

</details>

Because this change is proposed to improve authorship (not transport or anything else relevant to the final generated files), and because it is backwards compatible with the nested format, we should retain the nested format for the produced JSON files.

To implement this change, we will need to flatten the `en.json5`'s keys. This can be done by hand, or using something like [this online JSON flattening tool](https://www.coderstool.com/flatten-json), except that tool and others like it strip comments and make other unwanted transformations, so would still require manual changes to address those... Because `json-to-pot` already has to flatten the keys for use in the POT files, it shouldn't be too hard to adapt our existing `json-to-pot` script into a `json-to-flattened-json` script that preserves comments, single-quoted strings (which we use to avoid needing to escape double quotes in some strings), etc. I would recommend that approach, it seems like the least tedious option to me!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flatten i18n message keys in message JSON #4979

sarayourfriend
openedon Sep 23, 2024

Problem

Description

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Flatten i18n message keys in message JSON #4979

Description

sarayourfriendopenedon Sep 23, 2024

Problem

Description

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

sarayourfriend
openedon Sep 23, 2024