Public access

Articles with public access mandates - Shinji WatanabeLearn more

Available somewhere: 96

[PDF] arxiv.org

Review

Phasebook and friends: Leveraging discrete representations for source separation

J Le Roux, G Wichern, S Watanabe, A Sarroff, JR Hershey

IEEE Journal of Selected Topics in Signal Processing 13 (2), 370-382, 2019

Mandates: US National Science Foundation

[PDF] arxiv.org

Review

Espresso: A fast end-to-end neural speech recognition toolkit

Y Wang, T Chen, H Xu, S Ding, H Lv, Y Shao, N Peng, L Xie, S Watanabe, ...

2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019

Mandates: US Office of the Director of National Intelligence

[PDF] arxiv.org

Review

Espnet-slu: Advancing spoken language understanding through espnet

S Arora, S Dalmia, P Denisov, X Chang, Y Ueda, Y Peng, Y Zhang, ...

ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022

Mandates: US National Science Foundation

[PDF] merl.com

Review

Sequence summarizing neural network for speaker adaptation

K Veselý, S Watanabe, K Žmolíková, M Karafiát, L Burget, JH Černocký

2016 IEEE international conference on acoustics, speech and signal …, 2016

Mandates: US National Science Foundation

[PDF] um.edu.mt

Review

Findings of the IWSLT 2023 evaluation campaign

M Agarwal, S Agarwal, A Anastasopoulos, L Bentivogli, O Bojar, C Borg, ...

Association for Computational Linguistics, 2023

Mandates: US National Science Foundation, Science Foundation Ireland, European Commission

[PDF] arxiv.org

Review

Reproducing whisper-style training using an open-source toolkit and publicly available data

Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ...

2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023

Mandates: US National Science Foundation

[PDF] arxiv.org

Review

A comparative study on non-autoregressive modelings for speech-to-text generation

Y Higuchi, N Chen, Y Fujita, H Inaguma, T Komatsu, J Lee, J Nozaki, ...

2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 47-54, 2021

Mandates: US National Science Foundation

[PDF] nsf.gov

Review

Prompting the hidden talent of web-scale speech models for zero-shot task generalization

P Peng, B Yan

International Speech Communication Association, 2023

Mandates: US National Science Foundation

[PDF] arxiv.org

Review

Improving massively multilingual asr with auxiliary ctc objectives

W Chen, B Yan, J Shi, Y Peng, S Maiti, S Watanabe

ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023

Mandates: US National Science Foundation

[PDF] arxiv.org

Review

S3prl-vc: Open-source voice conversion framework with self-supervised speech representations

WC Huang, SW Yang, T Hayashi, HY Lee, S Watanabe, T Toda

ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022

Mandates: Japan Science and Technology Agency

[PDF] arxiv.org

Review

STFT-domain neural speech enhancement with very low algorithmic latency

ZQ Wang, G Wichern, S Watanabe, J Le Roux

IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 397-410, 2022

Mandates: US National Science Foundation

[PDF] arxiv.org

Review

Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study

X Chang, B Yan, K Choi, JW Jung, Y Lu, S Maiti, R Sharma, J Shi, J Tian, ...

ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024

Mandates: US National Science Foundation

[PDF] arxiv.org

Review

Speechlmscore: Evaluating speech generation using speech language model

S Maiti, Y Peng, T Saeki, S Watanabe

ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023

Mandates: US National Science Foundation

[PDF] arxiv.org

Review

End-to-end dereverberation, beamforming, and speech recognition with improved numerical stability and advanced frontend

W Zhang, C Boeddeker, S Watanabe, T Nakatani, M Delcroix, K Kinoshita, ...

ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021

Mandates: National Natural Science Foundation of China

[PDF] arxiv.org

Review

Joint acoustic and class inference for weakly supervised sound event detection

S Kothinti, K Imoto, D Chakrabarty, G Sell, S Watanabe, M Elhilali

ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019

Mandates: US Department of Defense, US National Institutes of Health

[PDF] arxiv.org

Review

EEND-SS: Joint end-to-end neural speaker diarization and speech separation for flexible number of speakers

S Maiti, Y Ueda, S Watanabe, C Zhang, M Yu, SX Zhang, Y Xu

2022 IEEE Spoken Language Technology Workshop (SLT), 480-487, 2023

Mandates: US National Science Foundation

[PDF] arxiv.org

Review

BEATs-based audio captioning model with INSTRUCTOR embedding supervision and ChatGPT mix-up

SL Wu, X Chang, G Wichern, J Jung, F Germain, J Le Roux, S Watanabe

Proc. Conf. Detection Classification Acoust. Scenes Events, Challenge, 1-5, 2023

Mandates: US National Science Foundation

[PDF] arxiv.org

Review

Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models

K Deng, Z Yang, S Watanabe, Y Higuchi, G Cheng, P Zhang

ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022

Mandates: National Natural Science Foundation of China

[PDF] arxiv.org

Review

Towards low-distortion multi-channel speech enhancement: The ESPNET-SE submission to the L3DAS22 challenge

YJ Lu, S Cornell, X Chang, W Zhang, C Li, Z Ni, ZQ Wang, S Watanabe

ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022

Mandates: US National Science Foundation

[PDF] arxiv.org

Review

Acoustic modeling for overlapping speech recognition: JHU CHiME-5 challenge system

V Manohar, SJ Chen, Z Wang, Y Fujita, S Watanabe, S Khudanpur

ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019

Mandates: US National Science Foundation, US Office of the Director of National …

Publication and funding information is determined automatically by a computer program

UploadMandatesProvide linkUpdate linkFix link

Public access

zproxy.org