北京生物医学工程

___________基于谱减法的变频电子喉语音增强方法对比研究_________

Contrastive study on the pitch enhancement of adjustable electrolarynx speech based on spectral subtraction technology

作者：李阳王立冯亦军牛海军

单位：           北京航空航天大学生物与医学工程学院（北京100191）    

关键词：电子喉；变频；谱减法；语音增强

分类号：           R318.04    

出版年·卷·期（页码）：2016·35·2（137-142）

摘要：

目的电子喉(electrolarynx, EL)是喉头切除患者最常用的言语发声辅具，但是现有电子喉发声存在辐射噪声大的缺陷，通过消除辐射噪声以改善语音质量成为诸多研究者的目标。谱减类方法是目前消除基频恒定电子喉语音辐射噪声最常用的方法，但是对于变频电子喉语音而言，该类方法的有效性尚不得而知。因此，本文对比研究了不同谱减法对变频电子喉语音的去噪效果。方法首先分别介绍了经典谱减法、多带谱减法、基于感知加权技术的改进谱减法以及基于加权函数的改进谱减法的原理，然后在安静的环境下，让1名经过训练能够熟练使用电子喉的受试者（男）使用变频电子喉朗读给定的20句日常汉语普通话语句并录制作为电子喉语音原始数据，分别采用上述谱减方法对变频电子喉语音进行去噪处理，并对去噪后语音进行主、客观评价，比较不同谱减方法的去噪效果。结果客观结果表明，经典谱减法虽然去除了部分辐射噪声，但增强后语音仍然存在明显噪声。多带谱减法、基于感知加权技术的改进谱减法以及基于加权函数的改进谱减法均有效减少了辐射噪声，尤其是语音间隔段。然而，前两种方法增强后的语音在语音段仍存在少量噪声，而后一种方法增强后的语音在高频段有少量语音损失。主观结果表明，与原始电子喉语音相比，不同谱减法增强后的语音的可接受度有所提高，其中，基于加权函数的改进谱减法可接受度得分最高，而可懂度变化不大。结论对于选定的变频电子喉语音，基于谱减法的语音增强方法可以有效减少变频电子喉辐射噪声，提高语音的可接受度，改善语音的听觉质量，其中，基于加权函数的改进谱减法的去噪效果最好，但对于语音可懂度的影响不大。

Objective Electrolarynx (EL) is the most common device to provide a voice for laryngectomees. However, the EL phonation has a serious drawback: the radiated noise. To improve the speech quality by eliminating the radiated noise becomes a goal for researchers. Spectral subtraction method is the most common way to eliminate the radiated noise of EL whose pitch is constant. However, for the pitch adjustable EL speech, the effectivity of this method is not known. Thus, this paper conducts a study on the noise reduction effects of different spectral subtraction methods for pitch adjustable EL speech. Methods The classical spectral subtraction, the multi-band spectral subtraction, the perceptual weighting spectral subtraction and the weighting function spectral subtraction were firstly introduced, respectively. Then, a male native speaker of Mandarin Chinese who had been trained to be familiar with using pitch-controlled EL was instructed to read twenty daily mandarin sentences in a soundproof room, which was recorded as the raw data of the EL speech. Furthermore, the methods above were used to remove the radiated noise of EL speech, and subjective and objective methods were used to evaluate and compare the enhanced speech. Results Objective results showed that the enhanced EL speech by classical spectral subtraction still remained noise obviously though a part of radiated noise was reduced. The other three methods including multi-band spectral subtraction, the perceptual weighting spectral subtraction and the weighting function spectral subtraction reduced the noise effectively, especially in speech-pause section. Nevertheless the enhanced EL speeches of the former two methods still had a little noise in speech section. Meanwhile, certain high frequency speech components of the normal speech might be eliminated by the weighting function spectral subtraction. Subjective results indicated that compared with the original EL speech, the acceptability of the enhanced speeches by different spectral subtraction methods increased and the weighting function spectral subtraction method obtained the highest score. Yet the intelligibility of the enhanced speeches had little change. Conclusions According to the selected EL speech whose pitch is adjustable, the enhancement methods based on spectral subtraction can effectively reduce the radiated noise of EL speech and improve the acceptability and speech quality, among which the weighting function spectral subtraction method obtains the best performance. However, these methods have little influence on speech intelligibility.

参考文献：

服务与反馈：

【文章下载】【加入收藏】

提示：您还未登录，请登录！点此登录