You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
“Notably, except for ORPO, almost all approaches lead to consistent drops in one or more settings.”
But ARC shows that commonsense reasoning is improved.
However, the contradiction between reasoning tasks is counter-intuitive.
I wonder if this is caused by the difficulty difference.
So I am curious about the results on the ARC-C subset.
It would be so nice of you if you could curate related data and report the numbers.
The text was updated successfully, but these errors were encountered:
“Notably, except for ORPO, almost all approaches lead to consistent drops in one or more settings.”
But ARC shows that commonsense reasoning is improved.
However, the contradiction between reasoning tasks is counter-intuitive.
I wonder if this is caused by the difficulty difference.
So I am curious about the results on the ARC-C subset.
It would be so nice of you if you could curate related data and report the numbers.
The text was updated successfully, but these errors were encountered: