登录

双语推荐:语音增强

语音信号中,噪声的存在极大地影响了语音质量,必须采用语音增强技术在一定程度上抑制背景噪声,最大限度地改善语音通信质量,提高语音的可懂度。语音增强就是从带噪语音的信号中提取尽可能纯净的原始语音信号。谱减法由于其计算简单,实时性好,所以得到了广泛应用。文中介绍了基于谱减法的语音增强算法的基本原理,并针对传统谱减法残余"音乐噪声"过强的缺点,分析了两种典型的改进谱减法的原理。实验结果表明改进算法更好地抑制了音乐噪声,有效提高了语音增强效果,改善了语音质量。
For the speech signal,voice quality is greatly affected by the noise. Speech enhancement techniques must be used to suppress background noise,improving voice quality and voice clear degrees. The speech enhancement is to get the speech as original as possible from the speech signal with noise. The spectral subtraction speech enhancement is utilized broadly because it is simple and easy for the re-al-time processing. In this paper,the basic principle of algorithm of the spectral subtraction is presented. Aiming at reducing annoying musical noise in the standard spectral subtraction algorithm,the principle of two kinds of typical improved spectral subtraction is ana-lyzed. Experimental results show that the proposed algorithm reduces the musical noise effectively and improves the speech quality.

[ 可能符合您检索需要的词汇 ]

广义Gamma模型是近年来新提出的一种语音分布模型,相对于传统的高斯或超高斯模型具有更好的普适性和灵活性,提出一种基于广义Gamma语音模型和语音存在概率修正的语音增强算法。在假设语音和噪声的幅度谱系数分别服从广义Gamma分布和Gaussian分布的基础上,推导了语音信号对数谱的最小均方误差估计式;在该模型下进一步推导了语音存在概率,对最小均方误差估计进行修正。仿真结果表明,与传统的短时谱估计算法相比,该算法不仅能够进一步提高增强语音的信噪比,而且可以有效减小增强语音的失真度,提高增强语音的主观感知质量。
This paper presents a modified speech enhancement algorithm under signal presence probability. Generalized Gamma distribution priors are assumed for speech short-time spectral amplitudes, which is more flexible in capturing the statistical behavior of speech signals. It derives a Minimum Mean-Square Error(MMSE)estimator of the log-spectra am-plitude for speech signals, under the assumption of a generalized Gamma speech priors and additive Gaussian noise priors. Furthermore, modification under signal presence probability is obtained, which is estimated for each frequency bin and each frame consistent with the new model. The simulation results show that the proposed algorithm achieves better noise suppression and lower speech distortion compared to the conventional short-time spectral amplitude estimators, which are based on Gaussian and super-Gaussian speech model.
为了使得MELP声码器在高噪声环境下仍然获得较好的语音效果,需对含噪声语音进行语音增强。本文采用谱减法和独立分量分析相结合方法,对语音进行增强。该方法可以在不增加语音采样硬件的条件下,满足独立分量分析中观测信号的数目不少于源信号数目的约束条件。结果表明,该方法能较好的分离出噪声和语音信号,增强输入到MELP声码器中的语音信号,提高MELP声码器在高噪声环境下应用的语音效果。
To get better hearing performance in strong noise environment,MELP vocoder should have the ability to reduce the noise mixed into the speech. The method combining spectral subtraction technique with independent component analysis is employed for noise reduction in this paper. The restructured speech and the original noised speech act as the observed signals to fulfil the constraint condition of ICA that is the number of observed signals must be greater than or equal to source signals.Then,the ICA algorithm is applied to get purer speech from original noised speech. The presented method can obey the constraints for application of ICA,without increasing the complexity of data acquisition circuit.Simulation results show that good effect of noise reduction is achieved,and the presented method can be applied to MELP vocoder used in strong noise environment to improve the speech quality without additional hardware.

[ 可能符合您检索需要的词汇 ]

通过介绍语音增强的特点,详细分析了最小均方误差对数谱幅度估计(MMSE-LSA)算法,并提出了与MMSELSA算法相匹配的语音激活检测(VAD)算法。该方案计算简单、易于实现且语音增强效果好,能够动态地跟踪背景噪声的变化。最后通过仿真分析,比较了MMSE-LSA与其它几种语音增强算法的增强效果。
The minimum mean square error of log-spectral amplitude estimator (MMSE-LSA) algorithm is analyzed in detail by introducing the characteristics of speech enhancement, and voice activity detection (VAD) algorithm matching with MMSE-LSA algorithm is proposed. This scheme is simple and easy to implement and its speech enhancement effect is good. In addition, it can track the changes of background noise dynamically. Finally, the enhancement effect of MMSE-LSA is compared with that of other algorithms by the analysis of simulation.

[ 可能符合您检索需要的词汇 ]

针对固定阈值小波包语音增强算法在去噪时会损失语音信号的问题,文中提出了一种新的自适应阈值小波包语音增强算法。该算法先利用带噪语音的小波包变换系数估计出后验信噪比,再由含有后验信噪比因子的sigmoid函数作为平滑因子对随尺度变化的阈值进行相邻帧的平滑,最后由后验信噪比自适应修正平滑阈值,减少语音失真;仿真实验结果表明,该算法在去噪的同时减少了语音信号的损失,有效地提高了增强语音的信噪比和分段信噪比,较固定阈值小波包语音增强算法具有明显的优越性。
Aiming at the problem of the fixed threshold wavelet packet speech enhancement algorithm loses the speech signal in denoising, this paper proposed a new adaptive threshold of wavelet packet for speech enhancement algorithm. Firstly, the algo-rithm uses the wavelet packet coefficients of noisy speech to estimate the posteriori SNR. Secondly, the sigmoid function con-tains a posteriori the signal-to-noise ratio (SNR) factor as a smooth factor smoothes adjacent frame threshold with scale chang-ing. Finally, the posterior SNR adaptively adjusts the smoothed threshold, which reduced speech distortion. Simulation experi-mental results show that this algorithm in denoising reduces the loss of the speech signal, effectively improves the SNR and SS-NR of the enhanced speech, and has obvious superiority than a fixed threshold wavelet packet speech enhancement algorithm.

[ 可能符合您检索需要的词汇 ]

随着语音处理技术研究的不断深入和语音处理应用要求的不断提高,语音处理系统需要按实时方式工作。该文采用TMS320VC5416 DSP和TLV320AIC23 Codec组成实时语音增强系统,实现了语音信号的增强处理,并对设计中所涉及到的关键技术和解决方案进行了详细分析和说明。
With further study of speech processing technology and speech processing applications continue to improve , speech processing systems need to work in real time mode .This paper adopts TMS320VC5416 DSP and TLV320AIC23 Codec to form a real time speech enhancement system , enhan-cing the processing of speech signals , and the key technologies involved in the design and solutions were analyzed and explained .

[ 可能符合您检索需要的词汇 ]

提出一种具有较高可懂度的基于维纳滤波的语音增强算法。相比于其他语音增强算法,维纳滤波法可以明显提高语音质量且含有较少的音乐噪声,但是它和其他现有语音增强算法一样,都无法有效提高语音可懂度。因为维纳滤波法和其他现有算法都过多注重噪声减少,却忽略了SNR(信噪比)的估计误差和不同的语音幅度谱畸变对可懂度有更重要的影响。为改进这些缺点,此研究依据于先验SNR和增益函数来判定SNR估计误差和语音畸变区域,然后对先验SNR小于-10 d B区域的增益函数进行修正,以及幅度谱畸变大于6.02 d B区域语音进行限制。实验证明,该算法能有效提升增强语音可懂度NCM(归一化协方差方法)的评测值。
We propose a Wiener filtering-based speech enhancement algorithm with higher intelligibility.Compared with other speech en-hancement algorithms, the Wiener filtering can significantly improve the quality of speech but contains less musical noise, however the same as them, it is unable to effectively improve speech intelligibility either.It is because that the Wiener filtering and other existing algorithms pay too more attention to noise reduction while ignoring that the SNR ( signal-to-noise ratio) estimation errors and the different speech magnitude spectrum distortions have a greater influence on speech intelligibility.In order to make up these shortages, in our study we determine the SNR estimation errors and the speech distortions area relying on priori SNR and gain function, and then modify the gain functions in the areas where the priori SNR is less than -10 dB, and constrain the speech in the areas where the distortion of amplification spectrum is beyond 6.02 dB.Experi

[ 可能符合您检索需要的词汇 ]

研究表明超高斯分布更加贴近语音信号的实际分布,然而语音信号很难用单一的概率密度函数准确描述,针对这一情况,提出了一种用超高斯混合模型对语音信号幅度谱建模的新方法,并推导了基于此模型的幅度谱最小均方误差估的估计式。仿真结果表明:与传统的短时谱估计算法相比,该算法不仅能够进一步提高增强语音的信噪比,而且可以有效减小增强语音的失真度,提高增强语音的主观感知质量。
Recent research indicates that the speech spectral amplitude distributions could be fairly described with super-Gaussian probability density function. However, the complexity of speech signal determines that the distribution statistics of speech signal could not be well described by single simple function. Thus a super-Gaussian mixture model for speech spectral amplitude is proposed, and with this model, a minimum mean-square error (MMSE) estimator for speech signals spectral amplitude is derived. The simulation results show that this algorithm based on Gaussian and super-Gaussian speech model could achieve better noise suppression and lower speech distortion as compared with the conventional short-time spectral amplitude estimation algorithm.

[ 可能符合您检索需要的词汇 ]

提出一种可适应非平稳噪声环境的基于码本学习的改进谱减语音增强算法。该算法分为训练阶段和增强阶段。训练阶段,使用自回归模型对语音和噪声的频谱形状进行建模并构造语音和噪声码本;增强阶段,采用对数谱最小化算法估计出语音和噪声的频谱,通过谱相减消除噪声。算法在每个时间帧估计语音和噪声频谱,即使在语音存在时仍能够有效跟踪快速变化的非平稳噪声;采用自回归模型能得到噪声频谱的平滑估计,减少了音乐噪声。实验仿真表明,相比于传统谱减法和多带谱减法,改进的谱减法具有更好的噪声抑制性能并且语音失真更小。
An improved spectral subtraction algorithm based on codebook learning for speech enhancement in non-stationary noise conditions is proposed. The proposed algorithm contains two steps. The priori information about the spectrum of speech and noise is modeled using autoregressive model and the speech and noise codebooks are constructed. The speech and noise are estimated in each time frame by solving a log-spectral distortion minimization problem. The proposed algorithm can adapt to varying levels of noise even while speech is present. On the other hand, autoregressive modeling results in smooth frequency spectrums and thus reduces musical noise. Experimental results show that the proposed algorithm outperforms the traditional spectral subtraction algorithm and multiband spectral subtraction algorithm.

[ 可能符合您检索需要的词汇 ]

针对频域受限子空间语音增强在构造增强矩阵时,采用固定拉格朗日乘子,使得减小语音畸变和提高语音可懂度的过程中,有音乐噪声残留,提出一种变拉格朗日乘子的算法。利用听觉特性中较强的频率成分对噪声进行掩蔽,通过掩蔽阈值的频率域与子空间特征值之间的变换算法,用变量控制子空间拉格朗日乘子计算增益函数的对角矩阵。对比实验和试听结果表明,提出算法增强语音信号不仅信噪比有较大提高,语音质量主观感知度也有明显改善。
Frequency domain subspace speech enhancement using fixed Lagrange multipliers to construct enhanced matrix, it can reduce the speech distortion and improve the speech intelligibility, but the major drawback is the residual musical noise. In the auditory masking, noise is masked by the strong frequency component near it in frequency domain. With the relationship between the frequency domain and the Eigen domain of signal subspace, the diagonal matrix of the gain function is calculated by using a variable Lagrange multiplier. Both comparative experiments and audition results show that the proposed algorithm can not only enhance the signal-to-noise ratio greatly, but also improve the subjective perception of voice quality significantly.

[ 可能符合您检索需要的词汇 ]