Bold ideas, rigorous research, and impactful publications: Sharing our discoveries with the global research community through international conferences and journals.
Metilda Sagaya Mary N J, S Umesh
INX-SpeakerHub: A 2000-Hour Indian Multilingual Speaker Verification Corpus, 2024 IEEE Spoken Language Technology Workshop (SLT), Macao, China, December 2024.
Metilda Sagaya Mary N J, S Umesh
Lite ASR Transformer: A Lightweight Transformer Architecture for Automatic Speech Recognition, 2024 IEEE Spoken Language Technology Workshop (SLT), Macao, China, December 2024.
Hamees Sayed, Advait Joglekar, Srinivasan Umesh
SPRING Lab IITM’s Submission to Low Resource Indic Language Translation Shared Task, Proc. of the Ninth Conference on Machine Translation, Miami, Florida, USA, November 2024, 10.18653/v1/2024.wmt-1.68.
Vasista Sai Lodagala, Abhishek Biswas, Shoutrik Das, Jordan F, S Umesh
“All Ears: Building Self-Supervised Learning based ASR models for Indian Languages at scale”, Proc. of InterSpeech 2024, Kos Island, Greece, September 2024.
Seth, A., Ghosh, S., Umesh, S., & Manocha, D.
“FusDom: Combining in-Domain and Out-of-Domain Knowledge for Continuous Self-Supervised Learning.”, Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, Korea, April 2024, pp. 12572-12576.
Seth, A., Ghosh, S., Umesh, S., & Manocha, D.
“Stable Distillation: Regularizing Continued Pre-Training for Low-Resource Automatic Speech Recognition.”, Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, Korea, April 2024, pp. 10821-10825.
Ramanan Sivaguru and Vasista Sai Lodagala and S Umesh
“SALTTS: Leveraging Self-Supervised Speech Representations for improved Text-to-Speech Synthesis.”, Proc. of InterSpeech 2023, August 2023, Dublin, Ireland, pp. 3033–3037, 10.21437/Interspeech.2023-2574
K Jayakumar, VN Sukhadia, A Arunkumar, S Umesh
“The Tag-Team Approach: Leveraging CLS and Language Tagging for Enhancing Multilingual ASR”, Proc. of InterSpeech 2023, August 2023, Dublin, Ireland, 10.21437/Interspeech.2023-2406.
Anusha Prakash, Arun Kumar, Ashish Seth, Bhagyashree Mukherjee, Ishika Gupta, Jom Kuriakose, Jordan Fernandes, KV Vikram, Metilda Sagaya Mary, Mohammad Wajahat, Mudit Batra, Nihal John George, Nithya Ravi, Pruthwik Mishra, Sudhanshu Srivastava, Vasista Sai Lodagala, Vandan Mujadia, Kada Sai Venkata Vineeth, Vrunda Sukhadia, Dipti Sharma, Hema Murthy, Pushpak Bhattacharya, S Umesh, Rajeev Sangal
“Technology pipeline for large scale cross-lingual dubbing of lecture videos into multiple indian languages.”, Proc. of InterSpeech 2023, August 2023, Dublin, Ireland, pp. 3683-3684
Seth, A., Ghosh, S., Umesh, S., & Manocha, D.
“Unfused: Unsupervised Finetuning Using Self Supervised Distillation.”, Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), Rhodes Island, Greece, June 2023, pp. 1-5.