Vyzkoušejte nový nástroj s podporou AI
Summon Research Assistant
BETA
Continuous F0 Modeling for HMM Based Statistical Parametric Speech Synthesis
Kai Yu, Young, Steve
Published in IEEE transactions on audio, speech, and language processing (01.07.2011)
Published in IEEE transactions on audio, speech, and language processing (01.07.2011)
Get full text
Journal Article
Identifying Cover Songs Using Information-Theoretic Measures of Similarity
Foster, Peter, Dixon, Simon, Klapuri, Anssi
Published in IEEE/ACM transactions on audio, speech, and language processing (01.06.2015)
Published in IEEE/ACM transactions on audio, speech, and language processing (01.06.2015)
Get full text
Journal Article
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Chen, Sanyuan, Wang, Chengyi, Wu, Yu, Zhang, Ziqiang, Zhou, Long, Liu, Shujie, Chen, Zhuo, Liu, Yanqing, Wang, Huaming, Li, Jinyu, He, Lei, Zhao, Sheng, Wei, Furu
Published in IEEE Transactions on Audio, Speech and Language Processing (2025)
Published in IEEE Transactions on Audio, Speech and Language Processing (2025)
Get full text
Journal Article
Real-time speaker identification and verification
Kinnunen, T., Karpov, E., Franti, P.
Published in IEEE transactions on audio, speech, and language processing (01.01.2006)
Published in IEEE transactions on audio, speech, and language processing (01.01.2006)
Get full text
Journal Article
Explainable DNN-Based Beamformer With Postfilter
Cohen, Adi, Wong, Daniel, Lee, Jung-Suk, Gannot, Sharon
Published in IEEE Transactions on Audio, Speech and Language Processing (2025)
Published in IEEE Transactions on Audio, Speech and Language Processing (2025)
Get full text
Journal Article
Estimation of Physiological Vocal Features From Neck Surface Acceleration Signals Using Probabilistic Bayesian Neural Networks
Sepulveda, Joaquin, Parra, Jesus A., Ibarra, Emiro J., Araya, Mauricio, Cuadra, Patricio De La, Zanartu, Matias
Published in IEEE Transactions on Audio, Speech and Language Processing (2025)
Published in IEEE Transactions on Audio, Speech and Language Processing (2025)
Get full text
Journal Article
Open-Vocabulary Sound Event Localization and Detection With Joint Learning of CLAP Embedding and Activity-Coupled Cartesian DOA Vector
Shimada, Kazuki, Uchida, Kengo, Koyama, Yuichiro, Shibuya, Takashi, Takahashi, Shusuke, Mitsufuji, Yuki, Kawahara, Tatsuya
Published in IEEE Transactions on Audio, Speech and Language Processing (2025)
Published in IEEE Transactions on Audio, Speech and Language Processing (2025)
Get full text
Journal Article
Pretraining and Fine-Tuning Techniques for Electrolaryngeal Speech Enhancement Based on Sequence-to-Sequence Voice Conversion
Ma, Ding, Violeta, Lester Phillip, Kobayashi, Kazuhiro, Toda, Tomoki
Published in IEEE Transactions on Audio, Speech and Language Processing (2025)
Published in IEEE Transactions on Audio, Speech and Language Processing (2025)
Get full text
Journal Article
Blind Localization of Early Room Reflections Based on Microphone Arrays and Reverberant Speech
Hadadi, Yogev, Beit-On, Hanan, Tourbabin, Vladimir, Ben-Hur, Zamir, Alon, David Lou, Rafaely, Boaz
Published in IEEE Transactions on Audio, Speech and Language Processing (2025)
Published in IEEE Transactions on Audio, Speech and Language Processing (2025)
Get full text
Journal Article
Noise and Reverberation-Controllable Voice Conversion
Choi, Yeonjong, Xie, Chao, Toda, Tomoki
Published in IEEE Transactions on Audio, Speech and Language Processing (2025)
Published in IEEE Transactions on Audio, Speech and Language Processing (2025)
Get full text
Journal Article