Skip to content

Check spelling, links of website HTML #56

Open

Description

Now that #53 is more formally publishing a website, additional measures would be a nice addition to achieve (and keep) high quality text, and reduce review time.

I don't know if these would be things that would need their own JEP, or are just nuts-and-bolts that fall out of (and would need documenting anwyay) in #29.

Notionally these would be run on every PR push.

spelling

tool: hunspell

There are a bunch of other ones, but hunspell, can be installed via most system package manager or conda. windows is less rosy.

I've found it best set up to test against generated HTML, where the semantics are more clear than markdown.... though would be happy to hear if anyone has something better.

While somewhat odius, maintaining a single dictionary.txt (just a sorted list of words, or a cleverer, but less portable stem-aware confection) across a corpus is good for keeping jargon and names consistent, e.g. JupyterHub vs Jupyterhub.

This also interestingly adds some more punch to the "new terms" part of the template.

links

tool: https://github.com/jupyterlab/pytest-check-links

Used on other jupyter projects, though other options exist. We recently added some caching stuff to try to reduce the amount of rate limiting that was occurring for some public sites. The in-document anchor checking doesn't quite work for sphinx output, but a fix has been merged, and could probably be released.

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions