-
Notifications
You must be signed in to change notification settings - Fork 47
Spellchecker for English LP content #523
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
This is great, @spier ❗️ Thanks for this work. Very excited to take a look. |
This is great! Where does the output go after the job is run? |
If this were to raise a PR with the fixes it made it'd be great. |
@lenucksi the spell checker does not make any fixes itself. What it does is that it shows the spelling issues in the output of the GitHub Action. It is then up to the author to review those issues and decide how to fix them. The options are:
FYI for our patterns repo I am experimenting with another spell checker right now (Vale). I am not sure yet which spell checker we will go with for our patterns repo. So if you like, we can move this PR here back to Draft for the moment. Up to you. |
Most mistakes are very likely small errors like swapped out letters. All of that yields a very small edit distance.
Having to manually extract edits and a diff here really sounds like more new work created than time saved, so I'd not use such a feature. As for the style consistency there's still #222 which would address this. There's tooling to address this in a professional and effective way, it's not free though. Knowing those tools they are usually more interactive and less CI friendly, so that's probably not really first order of business - as was the conclusion of #222 as well. |
This is very 🆒 , @spier . I will bring this up in our next working group meeting.
I think that there's value in being on the same one, so let's see what you come up with.
As comments or something? That would be great.
I see it now. Great. |
@spier is there an issue or a pull request that we can follow in the InnerSourcePatterns repo for us to stay in touch with what spellchecker you are choosing? We are happy to follow your lead, here. Thanks for including us :). |
@rrrutledge good idea. here is the issue: InnerSourceCommons/InnerSourcePatterns#519 |
Thank you, @spier ❗️ I added myself as a watcher. |
We should update our |
Hey @spier! It seems like progress on this PR has slowed. But, we have a new group of articles that will be published soon and I think having a spell checker in place would be a great help with them. What, if anything, can I do to help get these changes pushed through? I can see from the conversation in InnerSourceCommons/InnerSourcePatterns#519 that you'd prefer to change the spell checker to Vale. Could you maybe list out the relevant config files (versus ones you may have been experimenting with) from InnerSourcePatterns repo that we should use over here? I could help get the code changes copied over. |
@marshmallowrobot thanks for reaching out. You are right, there is no further progress on this from my end. And yes, my last status was that Vale looks like the better option for us, given that the integrations into GitHub seemed nicer. See also the summary of the overall status of the experiments I did with vale. Thank you for your offer to help! Unfortunately I forgot where I was at exactly, so I cannot even ask for specific help right now. I have to look at this in more detail again to determine the most minimal set of configuration files that you would need to make vale useful for you. Will let you know what I find, however it may take a bit. |
Thanks for this work and this update, @spier ! We are very thankful and excited for spell checking. Let us know if you have a rough idea of when you might work on it. Any schedule is OK, but if you think that it will be a while before working on it some more then we may merge what's here so that at least we have some spell checking in the meantime until your fancy, new Vale work is finished. |
That is good clarification @rrrutledge. From previous comments I thought that the current solution would not be that useful for the LP content, which was one of the reasons for trying out vale. So I will keep the current solution as a fallback option. As for an estimate, I hope to have time around the Easter days, so within the next 10 days or so should be realistic. |
|
@spier we saw that you created https://github.com/InnerSourceCommons/isc-styles for some more Vale work - so it looks like this may be moving along? That's great! |
@rrrutledge hehe you got your eyes everywhere :) Yeah, I am trying to package up the styles/checks in a central place, so that it is easier to create some level of consistency across various ISC material. I am not 100% there yet but I have a basic idea of how it might work. So yeah, this is moving along, slowly but surely :) |
Closing this in favor of #560 |
I added a basic setup for a GitHub Action that runs a spellchecker for the LP content.
This is modelled after a very similar approach that we are just introducing for our Patterns (see InnerSourceCommons/InnerSourcePatterns#519).
A key difference here is that the tool used here, pyspelling, does seem to work a bit better with markdown, than it works with asciidocs. However I tweaked the configuration based on an example that I found, and it looks like it works well enough.
To judge the effect of the spellchecker, please compare the results of the 1st run, with the 2nd run. Before the 2nd run I had already fixed a couple of obvious spelling errors.
Left to do:
wordlist.txt
and fix the respective spellings (mostly just the word "organisation" 😄).pyspelling.yml
and thewordlist.txt
, so that the spellchecker passes on the current content (adapting the failing parts of the content is of course also another way to fix this)I am happy to answer your questions. Just note, that I am only using pyspelling/aspell for a couple of days myself now :)