Replies: 4 comments 5 replies
-
Current write up:TLDR
|
Beta Was this translation helpful? Give feedback.
-
This looks fascinating! I'm particularly struck by the graph in the write-up labeled "The confidence/self-regulation/”wait” mechanism". It looks like, if you smoothed it a bit, you'd see large-scale structure, almost like a progress measure. |
Beta Was this translation helpful? Give feedback.
-
Hello - just checking in if this project is still active. To keep our project statuses accurate, otherwise we would like to switch the project status to Inactive until there is activity again! |
Beta Was this translation helpful? Give feedback.
-
Hello everyone! A long time passed after the previous update, but I am excited to share some recent cool progress! TLDR
Full write up is here: https://docs.google.com/document/d/1ayrLFQaR58HPBU4YXk219vNTpCCBljdrE8Hq1Sbzncw/edit?tab=t.0 Would be happy to receive any feedback and suggestions! Unfortunately, I wasn't able to get main results in time for NeurIPS, so I aim to expand the scope for ICLR. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Research Question
How do reasoning models solve BlocksWorld problems?
Can we extract their changing internal world representation from the CoT?
Can we find mechanisms for their actions search?
How do they solve the obfuscated "mystery" BlocksWorld?
Owners
Dmitrii Kharlapenko (@kisate)
Project status
Work in progress
Current findings
Beta Was this translation helpful? Give feedback.
All reactions