MTalk-Bench: Benchmarking Multi-Turn Speech Dialogue Benchmark In this paper, we want to test the multiple-turn dialogue abilities of Speech to Speech Large Language Models (S2S LLMs). Contents Benchmark focusing on text information Benchmark focusing on ambient sound, paralinguistic information, and robustness Benchmark focusing on text information Benchmark focusing on ambient sound, paralinguistic information, and robustness