PDF下载 分享
[1]李彤岩,裴浩延,裴 燕,等.基于注意力机制和掩码学习的GAN语音增强算法[J].成都信息工程大学学报,2025,40(02):137-142.[doi:10.16836/j.cnki.jcuit.2025.02.003]
 LI Tongyan,PEI Haoyan,PEI Yan,et al.GAN Speech Enhancement Algorithm based on Attention Mechanism and Mask Learning[J].Journal of Chengdu University of Information Technology,2025,40(02):137-142.[doi:10.16836/j.cnki.jcuit.2025.02.003]
点击复制

基于注意力机制和掩码学习的GAN语音增强算法

参考文献/References:

[1] Pascual Santiago,Antonio Bonafonte,Joan Serrà.SEGAN:Speech Enhancement Generative Adversarial Network[C].Interspeech,2017.
[2] Bahdanau Dzmitry,Jan Chorowski,Dmitriy Serdyuk,et al.End-to-end attention-based large vocabulary speech recognition[C].IEEE International Conference on Acoustics,Speech and Signal Processing(ICASSP),2015:4945-4949.
[3] Watanabe,Shinji,Takaaki Hori,et al.Hybrid CTC/Attention Architecture for End-to-End Speech Recognition[J].IEEE Journal of Selected Topics in Signal Processing,2017:1240-1253.
[4] Wang Kai,He Bengbeng,Zhu Weiping.TSTNN:Two-Stage Transformer Based Neural Network for Speech Enhancement in the Time Domain[C].IEEE International Conference on Acoustics,Speech and Signal Processing(ICASSP),2021:7098-7102.
[5] Qiuqiang Kong,Xu Yong,Wang Wenwu,et al.Sound Event Detection of Weakly Labelled Data With CNN-Transformer and Automatic Threshold Optimization[J].IEEE/ACM Transactions on Audio,Speech,and Language Processing,2019(28):2450-2460.
[6] Cao Ru,Sherif Abdulatif,Bin Yang.CMGAN:Conformer-based Metric GAN for Speech Enhancement[J].ArXiv,2022.
[7] 张恩琪,顾广华,赵晨,等.生成对抗网络GAN的研究进展[J].计算机应用研究,2021,38(4):968-974.
[8] 庞源焜,张宇山.句子级状态下LSTM对谣言鉴别的研究[J].计算机应用研究,2022,39(4):1064-1070.
[9] Zhang Shiqing,Zhao Xiaoming,Tian Qingxi.Spontaneous Speech Emotion Recognition Using Multiscale Deep Convolutional LSTM[J].IEEE Transactions on Affective Computing,2022(13):680-688.
[10] 刘继明,孙成,袁野.基于训练模型改进的语音问句信息抽取方法[J].科学技术与工程,2021,21(18):7635-7641.
[11] Sultana Sadia M,Zafar Iqbal,Mohammad Reza Selim,et al.Bangla Speech Emotion Recognition and Cross-Lingual Study Using Deep CNN and BLSTM Networks[J].IEEE Access,2022(10):564-578.
[12] Shim,Kyuhong, Jungwook Choi,et al.Understanding the Role of Self Attention for Efficient Speech Recognition[C].International Conference on Learning Representations,2022.
[13] Su BoHao,ChiChun Lee.Unsupervised Cross-Corpus Speech Emotion Recognition Using a Multi-Source Cycle-GAN[C].IEEE Transactions on Affective Computing,2022.
[14] Wu Bowen,Liu Chaoran,Carlos Toshinori Ishi,et al.Modeling the Conditional Distribution of Co-Speech Upper Body Gesture Jointly Using Conditional-GAN and Unrolled-GAN[C].Electronics,2021.
[15] Ju Lin,Niu Sufeng,Adriaan J,et al.Improved Speech Enhancement Using a Time-Domain GAN with Mask Learning[C].Interspeech,2020.
[16] 杨海涛,王华朋,楚宪腾,等.基于卷积循环神经网络的语音逻辑攻击检测[J].科学技术与工程,2022,22(18):7937-7944.
[17] Su Jiaqi,Jin Zeyu,Adam Finkelstein.HiFi-GAN-2: Studio-Quality Speech Enhancement via Generative Adversarial Networks Conditioned on Acoustic Features[C]. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics(WASPAA),2021:166-170.
[18] Beck,Gustavo Teodoro Döhler,Ulme Wennberg,et al.Wavebender GAN:An Architecture for Phonetically Meaningful Speech Manipulation[C].IEEE International Conference on Acoustics,Speech and Signal Processing(ICASSP),2022:6187-6191.
[19] Kim Minsu,Joanna Hong,Yong Man Ro.Lip to Speech Synthesis with Visual Context Attentional GAN[C].Neural Information Processing Systems,2022.
[20] Rix A W,Beerends J.G,Hollier M P,et al.Perceptual evaluation ofspeech quality(PESQ)—A new method for speech quality assessmentof telephone networks and codecs[C].Proc of the 26th IEEE IntConf on Acoustics,Speech,and Signal Processing.Piscataway,NJ:IEEE,2001:749-752.

相似文献/References:

[1]蔡 良,夏秀渝,陆 雄,等.基于基音跟踪的语音增强研究[J].成都信息工程大学学报,2019,(01):1.[doi:10.16836/j.cnki.jcuit.2019.01.001]
 CAI Liang,XIA Xiuyu,LU Xiong,et al.Research on Speech Enhancement based on Pitch Tracking[J].Journal of Chengdu University of Information Technology,2019,(02):1.[doi:10.16836/j.cnki.jcuit.2019.01.001]

备注/Memo

收稿日期:2023-09-25
基金项目:四川省科技厅资助项目(2023YFS0422)
通信作者:李彤岩.E-mail:lty@cuit.edu.cn

更新日期/Last Update: 2025-04-30