Closed
Description
For certain models, the chat causes infinite newlines after a few turns. I had specifically observed while using Blue-Orchid-2x7b-Q5_K_M that was quantized from here. The infinite newlines are caused after 5 turns for this particular model.
This is a regression caused by PR #6127 . I verified that reverting the commit from that PR fixes the problem.