You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
refactor(Llama): enhance error handling and cleanup in eval method
- Wrap `decode` in a try-except block to provide detailed error context (position, batch size) on failure.
- Capture and log the result of `memory_seq_rm` to assist in debugging KV cache issues.
- Add an early return for empty token lists.
- Refactor loop variables and state updates for better clarity.
- Remove dead code related to logits processing.
0 commit comments