MIMO Speech Compression and Enhancement Based on Convolutional Denoising Autoencoder
Journal
2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings
Pages
1245-1250
Date Issued
2021
Author(s)
Abstract
For speech-related applications in Internet of things environments, identifying effective methods to handle interference noises and compress the amount of data in transmissions is essential for achieving high-quality services. In this paper, we propose a novel multi-input multi-output speech compression and enhancement (MIMO-SCE) system based on a convolutional denoising autoencoder (CDAE) model to simultaneously improve speech quality and reduce the dimension of transmission data. Compared with conventional single-channel and multiinput single-output systems, MIMO systems can be employed for applications where multiple acoustic signals need to be handled. We investigated two CDAE models, fully convolutional network (FCN) and Sinc FCN, as the core models in MIMO systems. The experimental results confirm that the proposed MIMO-SCE framework effectively improves speech quality and intelligibility, and reduces the amount of recording data to one-seventh for transmission. © 2021 APSIPA.
Other Subjects
Convolution; Convolutional codes; Learning systems; Speech communication; Speech intelligibility; Speech transmission; Auto encoders; Convolutional networks; De-noising; High quality service; Interference noise; Multi-input multi-output; Multi-input-single-output systems; Single channels; Speech compression; Speech quality; MIMO systems
Type
conference paper
