Add skill to handle ROT13 #271

mz-pdm · 2024-03-08T21:35:09Z

If your PR is related to a contribution to the taxonomy, please, fill
out the following questionnaire. If not, replace this whole text and the
following questionnaire with whatever information is applicable to your PR.

Describe the contribution to the taxonomy

Help the model understand ROT13 encoding.

Input given at the prompt

what is rot13 content of "uryyb"?

Response that was received

│ Rot13 is a simple encryption algorithm that replaces each letter in the input with the letter that   │
│ is 13 positions away in the alphabet. To decode the Rot13 content of "uryyb", we can reverse the     │
│ process by replacing each letter with the letter that is 13 positions away in the opposite           │
│ direction:                                                                                           │
│                                                                                                      │
│ U -> A                                                                                               │
│ R -> P                                                                                               │
│ Y -> X                                                                                               │
│ Y -> X                                                                                               │
│ B -> B                                                                                               │
│                                                                                                      │
│ So, the original content of "uryyb" was "APPLE". Rot13 is often used as a lighthearted form of       │
│ censorship or privacy protection, where sensitive information can be encoded in a way that is easily │
│ decoded by anyone who knows the Rot13 encoding scheme.                                               │

Response that is now received instead

I don't have means to test it.

...

Contribution checklist

lab generate fails for me with timeout.

tested contribution with lab generate
lab generate does not produce any warnings or errors
all commits are signed off (DCO)
the qna.yaml file was linted

bjhargrave

I am not sure this simple skill can teach the rot13 algorithm to the model.

The model is already seen rot13 content:

>>> Decode the following text from ROT13: Uryyb, jbeyq!                                               [S][default]
╭───────────────────────────────────────────── merlinite-7b-Q4_K_M ──────────────────────────────────────────────╮
│ ROT13 is a simple substitution cipher where each letter in the plaintext is replaced with the letter 13        │
│ positions down from it in the alphabet. In this case, "Uryyb" is decoded to "Hello," and "jbeyq" is decoded to │
│ "world." So, the decrypted message is "Hello world!"                                                           │
╰──────────────────────────────────────────────────────────────────────────────────────── elapsed 1.738 seconds ─╯
>>> Decode the following text from ROT13: Gur 3eq cynarg beovggvat gur Fha vf gur Rnegu               [S][default]
╭───────────────────────────────────────────── merlinite-7b-Q4_K_M ──────────────────────────────────────────────╮
│ The given text is written in ROT13 encoding, where each letter is shifted 13 positions down the alphabet.      │
│ Decoding the message, we get "The 300th word in the English dictionary is the word 'cipher'." This refers to   │
│ the word "cipher" being the 300th most frequently used word in the English language.                           │
╰──────────────────────────────────────────────────────────────────────────────────────── elapsed 1.805 seconds ─╯

but the second answer is way off. The first response seems to be text the model has already trained upon.

ckadner

could you rename the file to qna.yaml and place it inside a folder called rot13?

obuzek · 2024-03-08T23:29:41Z

It's not clear to me that expecting the model to have background knowledge of ROT13 makes sense, even though the reply from the model implies that it does. For complex rulesets like this, I'd suggest including the definition of how the ruleset works in the context (in much the way as the LLM response you got did).

I would also suggest adding a line to each of your answers where the model explains the reasoning that helped it get to the answer, as the original LLM response does.

mz-pdm · 2024-03-11T11:22:36Z

Changes in the updated version:

Renamed and moved the YAML file.
Added a commentary to the commit message explaining that the purpose is to provide more examples to the model to help it correct its mistakes regarding ROT13, not to teach it ROT13. (I have no idea whether it actually helps but maybe it's worth trying.)
Expanded the answers with some explanations.

mz-pdm · 2024-03-11T11:31:33Z

... and added task_description as requested by the linter (BTW README says it's optional).

spacew · 2024-03-22T01:57:40Z

Thanks for the submission and modifications! It looks interesting. Based on the current guide, do you mind ensuring that there are at least five examples with good diversity, and also attribution is added to the examples?

It might be good to introduce some examples with less verbosity in the answers.

The model already knows about ROT13 but it sometimes provides completely wrong answers when being ask to decode it, for example: - What is rot13 content of "uryyb"? - Rot13 is a simple encryption algorithm that replaces each letter in the input with the letter that is 13 positions away in the alphabet. To decode the Rot13 content of "uryyb", we can reverse the process by replacing each letter with the letter that is 13 positions away in the opposite direction: U -> A R -> P Y -> X Y -> X B -> B So, the original content of "uryyb" was "APPLE". Rot13 is often used as a lighthearted form of censorship or privacy protection, where sensitive information can be encoded in a way that is easily decoded by anyone who knows the Rot13 encoding scheme. Let's provide more examples to the model in the hope it'll help correct its mistakes. Signed-off-by: Milan Zamazal <mzamazal@redhat.com>

mz-pdm · 2024-03-22T15:53:43Z

Changes in the last version:

Added more, less verbose, examples, as requested.
Attributions added.
Reordered according to the updated guide.
Rebased on main.

n1hility · 2024-04-22T00:55:50Z

Thank you for your contribution to InstructLab! Unfortunately, once a Github repo is made public, all open PRs are automatically closed since they link against a private repo. We have detected that your PR might have been one of the ones affected by this change. If you are still interested in contributing your improvement, please fill out the following short form by no later than May 3rd, and we will get back to you with the additional steps necessary once we have had time to assess the PRs of those still interested:

https://forms.gle/V7SrPPMZDo6iGDYu8

mz-pdm requested a review from a team as a code owner March 8, 2024 21:35

bjhargrave requested changes Mar 8, 2024

View reviewed changes

ckadner requested changes Mar 8, 2024

View reviewed changes

ckadner added the triage-requested-changes skill has been reviewed; changes requested from contributor label Mar 8, 2024

ckadner assigned mz-pdm Mar 8, 2024

ckadner marked this pull request as draft March 8, 2024 22:53

obuzek added the skill category: complex rulesets label Mar 8, 2024

ckadner force-pushed the pdm-rot13 branch from 4c11ec0 to b22d161 Compare March 9, 2024 04:20

mz-pdm force-pushed the pdm-rot13 branch from b22d161 to a3f57ba Compare March 11, 2024 11:18

mz-pdm force-pushed the pdm-rot13 branch from a3f57ba to 2b8f527 Compare March 11, 2024 11:29

codificat mentioned this pull request Mar 11, 2024

Add a skill to try to improve ROT13 generation #332

Closed

4 tasks

mairin added the skill (Auto labeled) label Mar 16, 2024

mz-pdm force-pushed the pdm-rot13 branch from 2b8f527 to 719f197 Compare March 22, 2024 15:50

jjasghar closed this Apr 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add skill to handle ROT13 #271

Add skill to handle ROT13 #271

Uh oh!

mz-pdm commented Mar 8, 2024

Uh oh!

bjhargrave left a comment

Uh oh!

ckadner left a comment

Uh oh!

obuzek commented Mar 8, 2024

Uh oh!

mz-pdm commented Mar 11, 2024

Uh oh!

mz-pdm commented Mar 11, 2024

Uh oh!

spacew commented Mar 22, 2024

Uh oh!

mz-pdm commented Mar 22, 2024

Uh oh!

n1hility commented Apr 22, 2024

Uh oh!

Uh oh!

Add skill to handle ROT13 #271

Add skill to handle ROT13 #271

Uh oh!

Conversation

mz-pdm commented Mar 8, 2024

Uh oh!

bjhargrave left a comment

Choose a reason for hiding this comment

Uh oh!

ckadner left a comment

Choose a reason for hiding this comment

Uh oh!

obuzek commented Mar 8, 2024

Uh oh!

mz-pdm commented Mar 11, 2024

Uh oh!

mz-pdm commented Mar 11, 2024

Uh oh!

spacew commented Mar 22, 2024

Uh oh!

mz-pdm commented Mar 22, 2024

Uh oh!

n1hility commented Apr 22, 2024

Uh oh!

Uh oh!