A Github Action that spell checks Python, Markdown, and Text files.
This action uses PySpelling to check spelling in source files in the designated repository.
- Customizable configuration and spell checking using PySpelling
- Support for the following formats: Python, Markdown and plain text
- Support for
aspell
, Do see the section on Language Support for details - Support for the following languages:
- English
- German
- Spanish
- Do see the section on Language Support for details
- Per repository and format custom word list to avoid errors based on words not known to default dictionary, see: PySpelling for more options
- Flexible repository layout integration via file name matching using Wildcard Match
- Support for Python's Markdown extensions, namely the
pymdown-extensions
via PySpelling
- First you have to add a configuration for the spelling checker
- Create a file named:
.spellcheck.yml
or.spellcheck.yaml
, do note if both files exist the prior will have precedence. Do note the recommendation is hidden files since these configuration files are not first rate citizens of your repository. You can also provide your own configuration file. Check out spellcheck configuration section down below. - Paste the contents of the outlined example, which is a configuration for Markdown, useful for your README file
Do note that this action requires the contents of the repository, so it is recommended used with the Checkout action.
You have to define this part in your workflow, since it not a part of the action itself.
Example:
name: Spellcheck Action
on: push
jobs:
build:
name: Spellcheck
runs-on: ubuntu-latest
steps:
# The checkout step
- uses: actions/checkout@master
- uses: rojopolis/spellcheck-github-actions@0.22.1
name: Spellcheck
This configuration file must be created in a the .github/workflows/
directory.
For example, it could be named .github/workflows/spelling_action.yml
for easy identification, if other actions are present.
By default, this action will use the sources:
list under each task in your config file to identify which files to scan. You can override this behaviour by setting source_files
to the list of files or file patterns you want scanning.
When this option is used, you must also specify the task_name
to override the sources:
list for.
By default, all tasks in your config file will be run. By setting task_name
you can override this and run only the task you require.
A configuration for designated source files could look as follows:
Example:
name: Spellcheck Action
on: push
jobs:
build:
name: Spellcheck
runs-on: ubuntu-latest
steps:
# The checkout step
- uses: actions/checkout@master
- uses: rojopolis/spellcheck-github-actions@0.22.1
name: Spellcheck
with:
source_files: README.md CHANGELOG.md notes/Notes.md
task_name: Markdown
Do check the PySpelling documentation for elaborate details on configuration of.
PySpelling uses the Python Markdown project. PySpelling allows for configuration of the Markdown handling using the [pymdown-extensions]
authored by the author of PySpelling.
If for example wanted to use the superfences
extension, you could configure it as follows:
- pyspelling.filters.markdown:
markdown_extensions:
- pymdownx.superfences:
Current Spellcheck Action support the following extensions (in alphabetical order):
- Arithmatex
- B64
- BetterEm
- Caret
- Critic
- Details
- Emoji
- EscapeAll
- Extra
- Highlight
- InlineHilite
- Keys
- MagicLink
- Mark
- PathConverter
- ProgressBar
- SaneHeaders
- SmartSymbols
- Snippets
- StripHTML
- SuperFences
- Tabbed
- Tasklist
- Tilde
Please consult the documentation for the extensions for more details.
Currently only the case of use of superfences
has been requested as outlined in the above example.
Do also see the Diagnostics sections below, demonstrating diagnostics emitted from Python Markdown, which might require the use of an extension.
You can either provide a path to the configuration file or save a file in the root of your repository with a predefined name (list below). If config_path
is provided then it will be used and the other configuration options will be ignored. If config_path
is not provided then the repository is searched after a first match
Example:
name: Spellcheck Action
on: push
jobs:
build:
name: Spellcheck
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@master
- uses: rojopolis/spellcheck-github-actions@0.22.1
name: Spellcheck
with:
config_path: config/.spellcheck.yml # put path to configuration file here
source_files: source/scanning.md source/triggers.md
task_name: Markdown
.spellcheck.yml
.spellcheck.yaml
spellcheck.yml
spellcheck.yaml
(the old default)
And is attempted read in that order, meaning first match is used, This means that you can use files prefixed with the .
to have a less intrusive Spellcheck configuration in your repository.
matrix:
- name: Markdown
aspell:
lang: en
dictionary:
encoding: utf-8
pipeline:
- pyspelling.filters.markdown:
- pyspelling.filters.html:
comments: false
ignores:
- code
- pre
sources:
- '**/*.md'
default_encoding: utf-8
The above configuration will check the spelling of your repository's README.md
and other Markdown files against an English dictionary. If your Markdown is named differently, correct or add additional patterns under sources
, Markdown is sometimes named .mkdn
.
When and if the run locates spelling errors, you have two options:
- Correct the spelling errors in the relevant files
- Add the relevant words to a custom word list, to be ignored
If you do the latter, you have to add the following to the Spellcheck configuration, under dictionary
:
wordlists:
- .wordlist.txt
This supplies a custom list of words to supply the default dictionary for the specified language, in this case set to English en
under aspell
.
The complete configuration should resemble this:
matrix:
- name: Markdown
aspell:
lang: en
dictionary:
wordlists:
- .wordlist.txt
encoding: utf-8
pipeline:
- pyspelling.filters.markdown:
- pyspelling.filters.html:
comments: false
ignores:
- code
- pre
sources:
- '**/*.md'
default_encoding: utf-8
Change the configuration to suit your repository and needs, please see the examples/
directory for more example configurations.
This action currently only support aspell
, please see the section on Language Support below.
In the section for aspell
you can specify the main language, for example en
, via the lang
parameter.
You can further specify dialect, using the d
parameter.
See the documentation for PySpelling for more details.
The GitHub Action helps you make sure most spelling errors do not make it into your repository. You can however check your spelling prior to committing and pushing to your repository.
This simply uses the contents of our spelling toolchain:
$ pyspelling -c .spellcheck.yml
Misspelled words:
...
!!!Spelling check failed!!!
We can correct the error(s) pointed out by PySpelling as we go by adding new words to our local file: .wordlist.txt
And at some point we get:
$ pyspelling -c .spellcheck.yml
Spelling check passed :)
Now we should be good to go.
Do note you could also use the entrypoint.sh
, which is the script used in the Docker image.
± sh entrypoint.sh
Using pyspelling on repository files outlined in .spellcheck.yml
----------------------------------------------------------------
Spelling check passed :)
Currently only the following languages are supported via GNU Aspell:
- English via the
aspell-en
Debian package, supporting:- American (
en_US
), - British (
en_GB
), - Canadian (
en_CA
) - and Australian (
en_AU
)
- American (
- German via the
aspell-de
Debian package, supporting:- German (
de_DE
), - Swiss (
de_CH
) - Austrian (
de_AT
)
- German (
- Spanish via the
aspell-es
Debian package
Additional languages can be added by request, please open an issue.
Hunspell is supported by PySpelling, but is not currently supported by this action
Please open an issue or PR, if Hunspell should be evaluated for possible inclusion.
The awesome tool dependabot lets you scan your used GitHub Marketplace Actions and lets you know if they are in need of an update.
The update is proposed via a pull request, which can be accepted or declined, it will itself take care of deleting pull requests if these become irrelevant.
You specify the configuration in the file: .github/depedabot.yml
in your repository using this action - actually it scans all your actions.
# Basic dependabot.yml file
# REF: https://docs.github.com/en/code-security/supply-chain-security/keeping-your-actions-up-to-date-with-dependabot
version: 2
updates:
# Enable version updates for Actions
- package-ecosystem: "github-actions"
# Look for `.github/workflows` in the `root` directory
directory: "/"
# Check for updates once a week
schedule:
interval: "weekly"
This tip works for aspell
.
You can slim down your .wordlist.txt
file if you have case variations of entries of words.
aspell:
ignore-case: true
To convert you existing .wordlist.txt
you could do something along the lines of this using Bash version 4.
$ tr '[:upper:]' '[:lower:]' < .wordlist.txt > temp-wordlist.txt
$ cat temp-wordlist.txt | sort -u > .wordlist.txt
$ rm temp-wordlist.txt
And you should be good to go.
This is a list of common diagnostics, which can be emitted by the action and it's tools.
This indicates that a spelling check has been completed, but spelling errors were located and should be corrected.
- Either correct pinpointed spelling errors
- Or add pinpointed words to custom dictionary
Please see the section: "Checking For Bad Spelling" above.
This diagnostic indicates that files outlines by the source
wildcard pattern match did not match any files.
- Either adjust the pattern
- Or remove the configuration part since it does not match the repository contents
- Or set
expect_match
tofalse
matrix:
- name: markdown
pipeline:
- pyspelling.filters.text
sources:
- '**/*.md'
expect_match: false
default_encoding: utf-8
Please see the documentation for Wildcard Match (1 and 2) or [Expect Match][expectmatch].
This diagnostic indicates that a custom word list has been specified in the used configuration, .spellcheck.yml
, but the file does not exist.
- Create the empty file
$ touch .wordlist.txt
Please see the section: "Configuration" above.
This diagnostic indicates that the configuration file pointed to with the --config
(-c
) parameter cannot be located.
- Check that a file with the indicated name exists.
ValueError: Unable to find or load pyspelling configuration from spellcheck.yaml
Indicates: spellcheck.yaml
so this file should exist in the repository.
If the file is available in the repository, please check that your workflow is configured correctly, with the following line, which enables the action: checkout.
uses: actions/checkout@master
In full context:
name: Spellcheck Action
on: push
jobs:
build:
name: Spellcheck
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@master
- uses: rojopolis/spellcheck-github-actions@0.22.1
name: Spellcheck
This step adds an action, which checkout out the repository for inspection by linters and other actions like this one.
This indicates issues with the Markdown and is reported by Markdown
(See: PyPi site).
PySpelling does however support extension of the standard Markdown parser and you can specify the use of extensions of these are support.
This action support the extensions included in: pymdown-extensions
(See: PyPi site)
And you can then put these to use in your configuration. The example below outlines the superfences
extension.
- pyspelling.filters.markdown:
markdown_extensions:
- pymdownx.superfences:
Please see the repository's requirements.txt
for a list of all included Python modules and their exact versions.
This error emitted from pyspelling
indicates issues with the configuration file.
Please see the section: "Configuration" above.
With the update of pyspelling from 2.6.1 to 2.7.3 with release 0.16.0 (0.15.0, do see the change log).
This error would be emitted from the GitHub Action, if the configuration was not properly formatted. The Markdown example's earlier revisions in this repository would demonstrate the error.
To address the error, the options below the filter should be properly indented.
- pyspelling.filters.html:
comments: false
ignores:
- code
- pre
Should be indented to:
- pyspelling.filters.html:
comments: false
ignores:
- code
- pre
The complete error emitted could would look something along the lines of:
Traceback (most recent call last):
File "/usr/local/bin/pyspelling", line 8, in <module>
sys.exit(main())
File "/usr/local/lib/python3.9/site-packages/pyspelling/__main__.py", line 30, in main
return run(
File "/usr/local/lib/python3.9/site-packages/pyspelling/__main__.py", line 55, in run
for results in spellcheck(
File "/usr/local/lib/python3.9/site-packages/pyspelling/__init__.py", line 673, in spellcheck
for result in spellchecker.run_task(task, source_patterns=sources):
File "/usr/local/lib/python3.9/site-packages/pyspelling/__init__.py", line 311, in run_task
self._build_pipeline(task)
File "/usr/local/lib/python3.9/site-packages/pyspelling/__init__.py", line 255, in _build_pipeline
raise ValueError(STEP_ERROR.format(str(step)))
ValueError: Pipline step in unexpected format: {'pyspelling.filters.html': None, 'comments': False, 'ignores': ['code', 'pre']}
Example lifted from issue #60
This action is based on a Docker image available on DockerHub.
This mean that if you developing your own spell checking action you can use this image.
Alternatively you can build your own Docker image based on the Dockerfile
in this repository.
The images are build from the GitHub repository master branch.
The recommended use is to use the latest release with a version tag. See the release history for details.
Whereas the tag latest
just reflect the latest build based on the master branch.
The master branch might contain changes not tagged as released yet and can be regarded as unstable or experimental. Changes such as corrections to documentation etc. will not be tagged until separately as a general rule, unless the changes are significant, but the aim is to keep the documentation relevant and up to date.
The GitHub Action is based on a Docker implementation.
The Dockerfile
contains the image building and the entrypoint.sh
, which acts as ENTRYPOINT
for the Docker image describes the execution part.
You can test the action locally by building the Docker image and running it against your project/repository.
First you have to build it.
Download or fork the spellcheck action repository.
Unpack or clone the source code and build the Docker image.
$ docker build -t github-action-spellcheck .
Run the newly build Docker image.
Do note the project/repository has to contain a configuration, please see the section on configuration above:
$ cd <your project/repository directory>
$ docker run -it -v $PWD:/tmp github-action-spellcheck
The original author of this GitHub Action is Robert Jordan (@rojopolis)
Here follows a list of contributors in alphabetical order:
- @aSemy
- Albert Volkman, @albertvolkman
- Byron Miller, @supernovae
- Isaac Muse, @facelessuser
- Jonas Brømsø, @jonasbn
- José Eduardo Montenegro Cavalcanti de Oliveira, @edumco
- Matt Calvert, @miff2000
- Michael Flaxman, @mflaxman
- Pavel Skipenes, @pavelskipenes
- Peter Petrik, @PeterPetrik
- Riccardo Porreca, @riccardoporreca
- Stephen Bates, @sbates130272
Do you want to be left out, or feel left out of this list or have a different representation of your name, please submit a pull request or raise an issue
This repository is licensed under the MIT license.