Date
Friday November 8, 2024 from 3:30 PM to 4:30 PMLocation
Neuron 0.352Organizer
Industrial DesignCo-organizer
Eindhoven Artificial Intelligence Systems InstitutePrice
free
On November 8th, Emilia Barakova will host professor Sheng Li.
Title: Reinventing ASR: Multilingual, Security-Aware, Healthcare-Driven, and Beyond
Abstract:
Automatic speech recognition (ASR) transforms spoken audio into word, subword, or character sequences, serving as one of the most intuitive human-machine interfaces. It plays a vital role in complex tasks like speech-to-speech translation and robotic dialogue. With advances in deep neural networks, particularly large self-attention models, ASR accuracy has seen substantial gains. Despite this progress, ASR is far from a solved problem. Developers continue to face significant challenges, particularly in supporting low-resourced languages. Additionally, widespread adversarial attacks and data security concerns pose serious obstacles for real-world applications.
This talk will explore our research efforts to address these challenges, focusing on low-resourced multilingual modeling, enhancing security, and expanding ASR's capabilities to critical areas such as disordered speech, Alzheimer's detection, and beyond traditional language applications.
Bio:
Sheng LI received his BS and ME degrees in 2006 and 2009 from Nanjing University, Nanjing, China, and his Ph.D. from Kyoto University, Kyoto, Japan, in 2016. From 2009 to 2012, he worked at the joint lab of the Chinese University Hong Kong and Shenzhen City, researching speech technology-assisted language learning. From 2016 to 2017, he worked as a researcher at Kyoto University, studying speech recognition systems for humanoid robots. In 2017, he joined the National Institute of Information and Communications Technology, Kyoto, Japan, as a researcher working on speech recognition. He served as workshop co-organizer and chair in interspeech2020, coling2022, odyssey2022, ACM Multimedia Asia2023/2024, and ICASSP2024. He is a member of the Acoustic Society of Japan (ASJ), the International Speech Communication Association (ISCA), and IEEE. He is now a member of the Speech, Language, and Audio (SLA) Technical Committee for APSIPA.
Industrial Design
At the department of Industrial Design we design products and services that enable us to make optimal use of our environment and interact with it.