You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for nice work! I have two questions.
The first one is about length norm in calculating the conditional log probability. According to the paper and common practice, the denominator should be the length of response.
However, according to the code:
the denominator seems to include the padding part. Could you please check it?
The second question I wonder is the proper way to show experiment results.
The paper says,
Do you mean choosing the best result under each temperature , or choose the best temperature according to Pass@1 or something?
Thank you for reply.
The text was updated successfully, but these errors were encountered:
Hi,
Thanks a lot for your issue!
Regarding your first question, you are right and we should only include response part of the mask in the denominator.
For your second issue, we chose the best result under different temperature configurations for each metric, i.e., pass@1, pass@5, pass@10.
Thanks for nice work! I have two questions.
The first one is about length norm in calculating the conditional log probability. According to the paper and common practice, the denominator should be the length of response.
However, according to the code:
RTL-Coder/train/mle_scoring.py
Line 199 in 3394cce
the denominator seems to include the padding part. Could you please check it?
The second question I wonder is the proper way to show experiment results.
The paper says,
Do you mean choosing the best result under each temperature , or choose the best temperature according to Pass@1 or something?
Thank you for reply.
The text was updated successfully, but these errors were encountered: