Explore more publications!

2nd MLC-SLM Challenge Launches, Advancing Multilingual Conversational Speech Understanding

MLC-SLM Challenge

LOS ANGELES, CA, UNITED STATES, April 13, 2026 /EINPresswire.com/ -- The 2nd Multilingual Conversational Speech Language Model (MLC-SLM) Challenge has officially opened for registration, inviting research teams and practitioners worldwide to participate. Built on a multilingual conversational speech training set covering ๐Ÿญ๐Ÿฐ ๐—น๐—ฎ๐—ป๐—ด๐˜‚๐—ฎ๐—ด๐—ฒ๐˜€ and approximately ๐Ÿฎ,๐Ÿญ๐Ÿฌ๐Ÿฌ ๐—ต๐—ผ๐˜‚๐—ฟ๐˜€ of data, this yearโ€™s challenge focuses on key tasks including ๐˜€๐—ฝ๐—ฒ๐—ฎ๐—ธ๐—ฒ๐—ฟ ๐˜€๐—ฒ๐—ด๐—บ๐—ฒ๐—ป๐˜๐—ฎ๐˜๐—ถ๐—ผ๐—ป, ๐—ฎ๐˜‚๐˜๐—ผ๐—บ๐—ฎ๐˜๐—ถ๐—ฐ ๐˜€๐—ฝ๐—ฒ๐—ฒ๐—ฐ๐—ต ๐—ฟ๐—ฒ๐—ฐ๐—ผ๐—ด๐—ป๐—ถ๐˜๐—ถ๐—ผ๐—ป (๐—”๐—ฆ๐—ฅ), ๐—ฎ๐—ป๐—ฑ ๐—ฑ๐—ถ๐—ฎ๐—น๐—ผ๐—ด๐˜‚๐—ฒ ๐˜‚๐—ป๐—ฑ๐—ฒ๐—ฟ๐˜€๐˜๐—ฎ๐—ป๐—ฑ๐—ถ๐—ป๐—ด, further pushing speech language model research from simple transcription toward deeper conversational understanding.

๐—ง๐—ฎ๐—ฟ๐—ด๐—ฒ๐˜๐—ถ๐—ป๐—ด ๐—ฅ๐—ฒ๐—ฎ๐—น-๐—ช๐—ผ๐—ฟ๐—น๐—ฑ ๐— ๐˜‚๐—น๐˜๐—ถ๐—น๐—ถ๐—ป๐—ด๐˜‚๐—ฎ๐—น ๐—–๐—ผ๐—ป๐˜ƒ๐—ฒ๐—ฟ๐˜€๐—ฎ๐˜๐—ถ๐—ผ๐—ป๐˜€

As speech language models continue to evolve, real-world multilingual conversations are becoming an increasingly important research direction. Unlike conventional ASR tasks, these scenarios involve multiple speakers, multi-turn interactions, and more complex acoustic and semantic information. Systems are expected not only to transcribe speech accurately, but also to determine who spoke when and ultimately understand the conversation as a whole.

The 2nd MLC-SLM Challenge is designed around this shift, focusing on multilingual conversational speech tasks that are closer to real application settings and providing an open benchmark and international platform for Speech LLM research.

๐—˜๐˜…๐—ฝ๐—ฎ๐—ป๐—ฑ๐—ฒ๐—ฑ ๐—ง๐—ฟ๐—ฎ๐—ถ๐—ป๐—ถ๐—ป๐—ด ๐——๐—ฎ๐˜๐—ฎ: ๐—”๐—ฟ๐—ผ๐˜‚๐—ป๐—ฑ ๐Ÿฎ,๐Ÿญ๐Ÿฌ๐Ÿฌ ๐—›๐—ผ๐˜‚๐—ฟ๐˜€ ๐—”๐—ฐ๐—ฟ๐—ผ๐˜€๐˜€ ๐Ÿญ๐Ÿฐ ๐—Ÿ๐—ฎ๐—ป๐—ด๐˜‚๐—ฎ๐—ด๐—ฒ๐˜€

One of the most significant highlights of this yearโ€™s challenge is the dataset. The training set contains approximately 2,100 hours of multilingual conversational speech spanning ๐Ÿญ๐Ÿฐ ๐—น๐—ฎ๐—ป๐—ด๐˜‚๐—ฎ๐—ด๐—ฒ๐˜€: ๐—˜๐—ป๐—ด๐—น๐—ถ๐˜€๐—ต, ๐—™๐—ฟ๐—ฒ๐—ป๐—ฐ๐—ต, ๐—š๐—ฒ๐—ฟ๐—บ๐—ฎ๐—ป, ๐—œ๐˜๐—ฎ๐—น๐—ถ๐—ฎ๐—ป, ๐—ฃ๐—ผ๐—ฟ๐˜๐˜‚๐—ด๐˜‚๐—ฒ๐˜€๐—ฒ, ๐—ฆ๐—ฝ๐—ฎ๐—ป๐—ถ๐˜€๐—ต, ๐—๐—ฎ๐—ฝ๐—ฎ๐—ป๐—ฒ๐˜€๐—ฒ, ๐—ž๐—ผ๐—ฟ๐—ฒ๐—ฎ๐—ป, ๐—ฅ๐˜‚๐˜€๐˜€๐—ถ๐—ฎ๐—ป, ๐—ง๐—ต๐—ฎ๐—ถ, ๐—ฉ๐—ถ๐—ฒ๐˜๐—ป๐—ฎ๐—บ๐—ฒ๐˜€๐—ฒ, ๐—ง๐—ฎ๐—ด๐—ฎ๐—น๐—ผ๐—ด, ๐—จ๐—ฟ๐—ฑ๐˜‚, ๐—ฎ๐—ป๐—ฑ ๐—ง๐˜‚๐—ฟ๐—ธ๐—ถ๐˜€๐—ต.

Among them, English contributes around 500 hours and includes diverse regional varieties such as US, UK, Australian, Indian, and Philippine English, while each of the other languages contributes roughly 100 hours. This expansion strengthens the challengeโ€™s foundation for multilingual conversational speech research in terms of scale, language coverage, and regional diversity.

๐—ก๐—ฎ๐˜๐˜‚๐—ฟ๐—ฎ๐—น ๐—ง๐˜„๐—ผ-๐—ฆ๐—ฝ๐—ฒ๐—ฎ๐—ธ๐—ฒ๐—ฟ ๐—–๐—ผ๐—ป๐˜ƒ๐—ฒ๐—ฟ๐˜€๐—ฎ๐˜๐—ถ๐—ผ๐—ป๐˜€ ๐—–๐—ผ๐—น๐—น๐—ฒ๐—ฐ๐˜๐—ฒ๐—ฑ ๐—ถ๐—ป ๐—ฅ๐—ฒ๐—ฎ๐—น๐—ถ๐˜€๐˜๐—ถ๐—ฐ ๐—ฆ๐—ฒ๐˜๐˜๐—ถ๐—ป๐—ด๐˜€

The dataset is designed to better reflect real application scenarios. All recordings are natural two-speaker conversations, where participants discuss randomly assigned topics in a meaningful and fluent way. The audio was collected in quiet indoor environments using consumer devices such as iPhones, making the data closer to real-world collection conditions.

The dataset also includes real-time timestamps and speaker labels to support system development. In addition, Track 1 and Track 2 share the same training set, encouraging participants to explore unified modeling approaches across recognition, diarization, and conversational understanding.

๐—ง๐˜„๐—ผ ๐—–๐—ผ๐—ฟ๐—ฒ ๐—ง๐—ฎ๐˜€๐—ธ๐˜€: ๐—™๐—ฟ๐—ผ๐—บ โ€œ๐—ช๐—ต๐—ผ ๐—ฆ๐—ฝ๐—ผ๐—ธ๐—ฒโ€ ๐˜๐—ผ โ€œ๐—ช๐—ต๐—ฎ๐˜ ๐—ช๐—ฎ๐˜€ ๐—จ๐—ป๐—ฑ๐—ฒ๐—ฟ๐˜€๐˜๐—ผ๐—ผ๐—ฑโ€

The challenge includes two main tasks.
Track 1: Multilingual Conversational Speech Diarization and Recognition
Track 2:Multilingual Conversational Speech Understanding

Unlike traditional speech benchmarks that focus primarily on transcription, the 2nd MLC-SLM Challenge places greater emphasis on multilingual, multi-speaker, and dialogue-level understanding. The evaluation setting does not provide prior information such as pre-segmented utterances or speaker labels, making the tasks closer to real deployment conditions.

๐—•๐˜‚๐—ถ๐—น๐—ฑ๐—ถ๐—ป๐—ด ๐—ผ๐—ป ๐˜๐—ต๐—ฒ ๐—œ๐—ป๐˜๐—ฒ๐—ฟ๐—ป๐—ฎ๐˜๐—ถ๐—ผ๐—ป๐—ฎ๐—น ๐—œ๐—บ๐—ฝ๐—ฎ๐—ฐ๐˜ ๐—ผ๐—ณ ๐˜๐—ต๐—ฒ ๐—™๐—ถ๐—ฟ๐˜€๐˜ ๐—˜๐—ฑ๐—ถ๐˜๐—ถ๐—ผ๐—ป

The new edition builds on the success of the inaugural MLC-SLM Challenge, which was held as a satellite event of Interspeech 2025. The first challenge attracted 78 teams from 13 countries and regions, generated 489 valid leaderboard submissions across two tracks, and received 14 high-quality technical reports. ๐—œ๐˜๐˜€ ๐˜€๐˜‚๐—บ๐—บ๐—ฎ๐—ฟ๐˜† ๐—ฝ๐—ฎ๐—ฝ๐—ฒ๐—ฟ ๐—ต๐—ฎ๐˜€ ๐—ฎ๐—น๐˜€๐—ผ ๐—ฏ๐—ฒ๐—ฒ๐—ป ๐—ฎ๐—ฐ๐—ฐ๐—ฒ๐—ฝ๐˜๐—ฒ๐—ฑ ๐—ฏ๐˜† ๐—œ๐—–๐—”๐—ฆ๐—ฆ๐—ฃ ๐Ÿฎ๐Ÿฌ๐Ÿฎ๐Ÿฒ, further demonstrating the challengeโ€™s academic value and growing international visibility.

๐—ฅ๐—ฒ๐—ด๐—ถ๐˜€๐˜๐—ฟ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—ณ๐—ผ๐—ฟ ๐˜๐—ต๐—ฒ ๐Ÿฎ๐—ป๐—ฑ ๐— ๐—Ÿ๐—–-๐—ฆ๐—Ÿ๐—  ๐—–๐—ต๐—ฎ๐—น๐—น๐—ฒ๐—ป๐—ด๐—ฒ ๐—ถ๐˜€ ๐—ป๐—ผ๐˜„ ๐—ผ๐—ฝ๐—ฒ๐—ป

โ— March 30, 2026: Registration opens
โ— April 10, 2026: Training data release
โ— April 24, 2026: Development set and baseline system release
โ— June 15, 2026: Evaluation set release and leaderboard open
โ— June 25, 2026: Leaderboard freeze and paper submission portal opens (CMT system)
โ— July 10, 2026: Paper submission deadline
โ— July 20, 2026: Notification of acceptance
โ— October 2, 2026: Workshop date

By offering open data, realistic tasks, and an international exchange platform, the challenge aims to bring together more research teams to advance multilingual conversational speech language modeling. The launch of the second edition also provides a new benchmark for pushing speech language models from simply โ€œhearing clearlyโ€ toward genuinely โ€œunderstandingโ€ conversations.

Registration Links: https://forms.gle/jfAZ95abGy4ZiNHo7
Official Website: https://www.nexdata.ai/competition/mlc-slm

Nexdata
MLC-SLM Competition Committee
mlc-slmw@nexdata.ai
Visit us on social media:
LinkedIn
Facebook
YouTube
X

Legal Disclaimer:

EIN Presswire provides this news content "as is" without warranty of any kind. We do not accept any responsibility or liability for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Share us

on your social networks:
AGPs

Get the latest news on this topic.

SIGN UP FOR FREE TODAY

No Thanks

By signing to this email alert, you
agree to our Terms & Conditions