Articles with public access mandates - Shinji WatanabeLearn more
Available somewhere: 96
Phasebook and friends: Leveraging discrete representations for source separation
J Le Roux, G Wichern, S Watanabe, A Sarroff, JR Hershey
IEEE Journal of Selected Topics in Signal Processing 13 (2), 370-382, 2019
Mandates: US National Science Foundation
Espresso: A fast end-to-end neural speech recognition toolkit
Y Wang, T Chen, H Xu, S Ding, H Lv, Y Shao, N Peng, L Xie, S Watanabe, ...
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
Mandates: US Office of the Director of National Intelligence
Espnet-slu: Advancing spoken language understanding through espnet
S Arora, S Dalmia, P Denisov, X Chang, Y Ueda, Y Peng, Y Zhang, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
Mandates: US National Science Foundation
Sequence summarizing neural network for speaker adaptation
K Veselý, S Watanabe, K Žmolíková, M Karafiát, L Burget, JH Černocký
2016 IEEE international conference on acoustics, speech and signal …, 2016
Mandates: US National Science Foundation
Findings of the IWSLT 2023 evaluation campaign
M Agarwal, S Agarwal, A Anastasopoulos, L Bentivogli, O Bojar, C Borg, ...
Association for Computational Linguistics, 2023
Mandates: US National Science Foundation, Science Foundation Ireland, European Commission
Reproducing whisper-style training using an open-source toolkit and publicly available data
Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
Mandates: US National Science Foundation
A comparative study on non-autoregressive modelings for speech-to-text generation
Y Higuchi, N Chen, Y Fujita, H Inaguma, T Komatsu, J Lee, J Nozaki, ...
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 47-54, 2021
Mandates: US National Science Foundation
Prompting the hidden talent of web-scale speech models for zero-shot task generalization
P Peng, B Yan
International Speech Communication Association, 2023
Mandates: US National Science Foundation
Improving massively multilingual asr with auxiliary ctc objectives
W Chen, B Yan, J Shi, Y Peng, S Maiti, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
Mandates: US National Science Foundation
S3prl-vc: Open-source voice conversion framework with self-supervised speech representations
WC Huang, SW Yang, T Hayashi, HY Lee, S Watanabe, T Toda
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
Mandates: Japan Science and Technology Agency
STFT-domain neural speech enhancement with very low algorithmic latency
ZQ Wang, G Wichern, S Watanabe, J Le Roux
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 397-410, 2022
Mandates: US National Science Foundation
Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study
X Chang, B Yan, K Choi, JW Jung, Y Lu, S Maiti, R Sharma, J Shi, J Tian, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
Mandates: US National Science Foundation
Speechlmscore: Evaluating speech generation using speech language model
S Maiti, Y Peng, T Saeki, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
Mandates: US National Science Foundation
End-to-end dereverberation, beamforming, and speech recognition with improved numerical stability and advanced frontend
W Zhang, C Boeddeker, S Watanabe, T Nakatani, M Delcroix, K Kinoshita, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
Mandates: National Natural Science Foundation of China
Joint acoustic and class inference for weakly supervised sound event detection
S Kothinti, K Imoto, D Chakrabarty, G Sell, S Watanabe, M Elhilali
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
Mandates: US Department of Defense, US National Institutes of Health
EEND-SS: Joint end-to-end neural speaker diarization and speech separation for flexible number of speakers
S Maiti, Y Ueda, S Watanabe, C Zhang, M Yu, SX Zhang, Y Xu
2022 IEEE Spoken Language Technology Workshop (SLT), 480-487, 2023
Mandates: US National Science Foundation
BEATs-based audio captioning model with INSTRUCTOR embedding supervision and ChatGPT mix-up
SL Wu, X Chang, G Wichern, J Jung, F Germain, J Le Roux, S Watanabe
Proc. Conf. Detection Classification Acoust. Scenes Events, Challenge, 1-5, 2023
Mandates: US National Science Foundation
Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models
K Deng, Z Yang, S Watanabe, Y Higuchi, G Cheng, P Zhang
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
Mandates: National Natural Science Foundation of China
Towards low-distortion multi-channel speech enhancement: The ESPNET-SE submission to the L3DAS22 challenge
YJ Lu, S Cornell, X Chang, W Zhang, C Li, Z Ni, ZQ Wang, S Watanabe
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
Mandates: US National Science Foundation
Acoustic modeling for overlapping speech recognition: JHU CHiME-5 challenge system
V Manohar, SJ Chen, Z Wang, Y Fujita, S Watanabe, S Khudanpur
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
Mandates: US National Science Foundation, US Office of the Director of National …
Publication and funding information is determined automatically by a computer program