Multi-task learning and weighted cross-entropy for DNN-based keyword spotting S Panchapagesan, M Sun, A Khare, S Matsoukas, A Mandal, ... | 179 | 2016 |
Using system command utterances to generate a speaker profile V Krishnamoorthy, S Srinivasan, S Matsoukas, A Khare, A Mandal, ... US Patent 10,490,195, 2019 | 65 | 2019 |
Self-supervised learning with cross-modal transformers for emotion recognition A Khare, S Parthasarathy, S Sundaram 2021 IEEE Spoken Language Technology Workshop (SLT), 381-388, 2021 | 51 | 2021 |
Multiresolution and multimodal speech recognition with transformers G Paraskevopoulos, S Parthasarathy, A Khare, S Sundaram arXiv preprint arXiv:2004.14840, 2020 | 46 | 2020 |
Keyword spotting using multi-task configuration S Panchapagesan, B Hoffmeister, A Mandal, A Khare, SNP Vitaladevuni, ... US Patent 10,304,440, 2019 | 43 | 2019 |
Speech based user recognition S Matsoukas, A Khare, V Krishnamoorthy, S Somashekar, A Mandal US Patent 10,522,134, 2019 | 23 | 2019 |
ASR-aware end-to-end neural diarization A Khare, E Han, Y Yang, A Stolcke ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 22 | 2022 |
Multi-modal embeddings using multi-task learning for emotion recognition A Khare, S Parthasarathy, S Sundaram Interspeech 2020, 384-388, 2020 | 21 | 2020 |
Automatic collection of speaker name pronunciations A Khare, N Agrawal, SS Kajarekar, M Paulik US Patent 9,240,181, 2016 | 9 | 2016 |
Method and apparatus for discovering and labeling speakers in a large and growing collection of videos with minimal user effort S Kajarekar, A Sankar, S Gannu, A Khare US Patent App. 13/312,800, 2013 | 9 | 2013 |
Voice profile updating S Srinivasan, A Mandal, K Subramanian, S Matsoukas, A Khare, ... US Patent 11,004,454, 2021 | 8 | 2021 |
Audiovisual highlight detection in videos K Mundnich, A Fenster, A Khare, S Sundaram ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 5 | 2021 |
Voice profile updating S Srinivasan, A Mandal, K Subramanian, S Matsoukas, A Khare, ... US Patent 11,200,884, 2021 | 4 | 2021 |
Turn-taking and backchannel prediction with acoustic and large language model fusion J Wang, L Chen, A Khare, A Raju, P Dheram, D He, M Wu, A Stolcke, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 3 | 2024 |
Speech based user recognition S Matsoukas, A Khare, V Krishnamoorthy, S Somashekar, A Mandal US Patent 11,270,685, 2022 | 3 | 2022 |
Multi-channel acoustic modeling using mixed bitrate Opus compression A Khare, S Sundaram, M Wu arXiv preprint arXiv:2002.00122, 2020 | 3 | 2020 |
Fully Learnable Front-End for Multi-Channel Acoustic Modeling using Semi-Supervised Learning S Wager, A Khare, M Wu, K Kumatani, S Sundaram ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 2 | 2020 |
Speech based user recognition SS Kopuri, J Moore, S Srinivasan, A Khare, A Mandal, S Matsoukas, ... US Patent 11,893,999, 2024 | 1 | 2024 |
Multi-stage multi-modal pre-training for automatic speech recognition Y Jain, D Chan, P Dheram, A Khare, O Shonibare, V Ravichandran, ... arXiv preprint arXiv:2403.19822, 2024 | | 2024 |
Two-Pass Endpoint Detection for Speech Recognition A Raju, A Khare, D He, I Sklyar, L Chen, S Alptekin, VA Trinh, Z Zhang, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | | 2023 |