Welcome![Sign In][Sign Up]
Location:
Search - Speech Perception

Search list

[Speech/Voice recognition/combinetraditionalsp

Description: 语音信号的频域处理,语音虽然是一个时变、非平稳的随机过程。但在短时间内可近似看作是平稳的。因此如果能从带噪语音的短时谱中估计出“纯净”语音的短时谱,即可达到语音增强的目的。由于噪声也是随机过程,因此这种估计只能建立在统计模型基础上。利用人耳感知对语音频谱分量的相位不敏感的特性,这类语音增强算法主要针对短时谱的幅度估计。 -voice signals in the frequency domain processing, voice is a time-varying, nonstationary random process. But in a short period of time can be approximated as smooth. So if Noisy Speech from the short-term spectrum estimate "pure" voice of the short-term spectrum, and reached speech enhancement purposes. As the noise is random process, which can only be estimated based on statistical models based on. Use ear perception of voice spectrum component of the phase sensitive to the characteristics of such speech enhancement algorithms targeted at the rate of short-term spectral estimation.
Platform: | Size: 1158 | Author: 罗飞 | Hits:

[Speech/Voice recognition/combinetraditionalsp

Description: 语音信号的频域处理,语音虽然是一个时变、非平稳的随机过程。但在短时间内可近似看作是平稳的。因此如果能从带噪语音的短时谱中估计出“纯净”语音的短时谱,即可达到语音增强的目的。由于噪声也是随机过程,因此这种估计只能建立在统计模型基础上。利用人耳感知对语音频谱分量的相位不敏感的特性,这类语音增强算法主要针对短时谱的幅度估计。 -voice signals in the frequency domain processing, voice is a time-varying, nonstationary random process. But in a short period of time can be approximated as smooth. So if Noisy Speech from the short-term spectrum estimate "pure" voice of the short-term spectrum, and reached speech enhancement purposes. As the noise is random process, which can only be estimated based on statistical models based on. Use ear perception of voice spectrum component of the phase sensitive to the characteristics of such speech enhancement algorithms targeted at the rate of short-term spectral estimation.
Platform: | Size: 1024 | Author: 罗飞 | Hits:

[Speech/Voice recognition/combinenoise

Description: 语音噪声压缩新算法,一种基于vad的语音噪声抑制方法。这种新算法源于vad 技术及谱减法,它先对含噪声信号进 行vad处理,而后是基于感知掩蔽阈值的谱减法运算,降低噪声,获得清晰的增强信号。与其它算法相比,其效果更好,语 音听测试证明了这一点。-New algorithm for speech compression noise, a noise reduction of speech based vad method. This new algorithm derived from vad technology and spectral subtraction, it firstly deal with the noise signal vad, then the masking threshold is based on perception of spectral subtraction to reduce noise, increase access to clear signals. Compared with other algorithms, the better, speech test proves this point.
Platform: | Size: 101376 | Author: lorry | Hits:

[Speech/Voice recognition/combinecs_bp

Description: 基于压缩感知的语音信号的重构算法。使用的bp算法-Based on perception of speech signal compression algorithm of reconstruction. Using the bp algorithm
Platform: | Size: 1024 | Author: 刘bing | Hits:

[File FormatCS-and-its-applications

Description: 有关于压缩感知CS非常详细的PPT讲稿~~出自天津大学杨敬钰老师之手-On the perception of compressed speech PPT CS very detailed ~ ~ Yang from the hands of Tianjin University
Platform: | Size: 2207744 | Author: | Hits:

[Speech/Voice recognition/combinesci-speech-recognition

Description: 语音识别相关的sci论文,包括 Production and perception of speaker-specific phonetic detail Self-learning speaker identification for enhanced speech recognition Sources ofillusioninconsonantclusterperception Speech listening specifically modulates the excitability of tongue muscles-Speech systems work together in harmony Structure and function of auditory cortex Unilateral tongue angioedema caused by angiotensin-converting enzyme inhibitor
Platform: | Size: 2362368 | Author: | Hits:

[Algorithmspraal1

Description: An Automatic Gain Controller (AGC) for speech signals embedded in additive noise requires Voice Activity Detection (VAD) to avoid noise amplification, a peak level detector for computing gain, and a gain controller for adjusting gain. This paper describes a low computational-intensive software AGC for use in handheld devices. The AGC provides options for static and dynamic noise floor estimation in a VAD module. Further, this paper describes analog and digital gain adjustment with gain curve selection to allow for distance perception during the AGC operation.
Platform: | Size: 724992 | Author: azza | Hits:

[OtherPESQ

Description: 本文介绍了基于听觉感知模型 的PEQ客观音质 评价方法,它在通信网络的设计维护和运营中具有广泛的应用。-This paper introduces a model based on auditory perception PEQ objective speech quality evaluation methods in the design of the communication network in the maintenance and operation of a wide range of applications.
Platform: | Size: 190464 | Author: 小飞侠 | Hits:

[Delphi VCL1

Description: 语音信号的压缩感知及其重构 代码 dct-Perception of speech signal compression and reconstruction code dct
Platform: | Size: 1024 | Author: slf | Hits:

[AI-NN-PRSpeech-quality-evaluation-algorithm

Description: 一种低复杂度的基于听觉感知的高效心理声学模型语音质量客观评估算法,目前已经被ITU-T标准采纳为P.862标准, 对大家深入理解P.862标准有很大的帮助,在学术工作者和从事这方面的工作工程技术人员都很有用-An efficient auditory perception based psychoacoustic model objective speech quality assessment of low complexity algorithms, has now been adopted by ITU-T standard P.862 standard, P.862 standard for our in-depth understanding of a great help in academic Workers engaged in this work and engineering and technical staff are very useful! ! !
Platform: | Size: 4483072 | Author: 岑岑 | Hits:

[matlabdp

Description: 共振峰是反映声道谐振特性的重要特征,它代表了发音信息的最直接的来源,而且人在语音感知中利用了共振峰信息。 提取共振峰-Channel formant resonance characteristics reflect an important feature, which represents the most direct source of pronunciation information, and the human use of the formant information of speech perception. Formant extraction
Platform: | Size: 1024 | Author: 许仁君 | Hits:

[File Formattain

Description: 耳蜗实质上相当于一个滤波器组,耳蜗的滤波作用是在对数频率尺度上进行的,在1000HZ下,人耳的感知能力与频率成线性关系;而在1000HZ以上,人耳的感知能力与频率不构成线性关系,而更偏向于对数关系,这就使得人耳对低频信号比高频信号更敏感。Mel频率的提出是为了方便人耳对不同频率语音的感知特性的研究。频率与Mel频率的转换公式为-Cochlear substantially equivalent to a filter set, cochlear filter is used on logarithm frequency scale, under the 1000 hz, the perception of the human ear and a linear relationship with frequency In more than 1000 hz, the perception of the human ear does not constitute a linear relationship with frequency, and prefer to logarithmic relationship, which makes the human ear is sensitive to low frequency signal is better than high frequency signal. Mel frequency is put forward in order to facilitate the ear of the study of speech perception characteristics of different frequency. For frequency and Mel frequency conversion formula
Platform: | Size: 1024 | Author: 朱健晨 | Hits:

[Technology Managementtese

Description: 耳蜗实质上相当于一个滤波器组,耳蜗的滤波作用是在对数频率尺度上进行的,在1000HZ下,人耳的感知能力与频率成线性关系;而在1000HZ以上,人耳的感知能力与频率不构成线性关系,而更偏向于对数关系,这就使得人耳对低频信号比高频信号更敏感。Mel频率的提出是为了方便人耳对不同频率语音的感知特性的研究。频率与Mel频率的转换公式为-Cochlear substantially equivalent to a filter set, cochlear filter is used on logarithm frequency scale, under the 1000 hz, the perception of the human ear and a linear relationship with frequency In more than 1000 hz, the perception of the human ear does not constitute a linear relationship with frequency, and prefer to logarithmic relationship, which makes the human ear is sensitive to low frequency signal is better than high frequency signal. Mel frequency is put forward in order to facilitate the ear of the study of speech perception characteristics of different frequency. For frequency and Mel frequency conversion formula
Platform: | Size: 1024 | Author: 朱健晨 | Hits:

[OtherJudge2_mfcc

Description: 耳蜗实质上相当于一个滤波器组,耳蜗的滤波作用是在对数频率尺度上进行的,在1000HZ下,人耳的感知能力与频率成线性关系;而在1000HZ以上,人耳的感知能力与频率不构成线性关系,而更偏向于对数关系,这就使得人耳对低频信号比高频信号更敏感。Mel频率的提出是为了方便人耳对不同频率语音的感知特性的研究。频率与Mel频率的转换公式为-on logarithm frequency scale, under the 1000 hz, the perception of the human ear and a linear relationship with frequency In more than 1000 hz, the perception of the human ear does not constitute a linear relationship with frequency, and prefer to logarithmic relationship, which makes the human ear is sensitive to low frequency signal is better than high frequency signal. Mel frequency is put forward in order to facilitate the ear of the study of speech perception characteristics of different frequency. For frequency and Mel frequency conversion formula
Platform: | Size: 1024 | Author: 朱健晨 | Hits:

[Otherfenlie2_mian

Description: 耳蜗实质上相当于一个滤波器组,耳蜗的滤波作用是在对数频率尺度上进行的,在1000HZ下,人耳的感知能力与频率成线性关系;而在1000HZ以上,人耳的感知能力与频率不构成线性关系,而更偏向于对数关系,这就使得人耳对低频信号比高频信号更敏感。Mel频率的提出是为了方便人耳对不同频率语音的感知特性的研究。频率与Mel频率的转换公式为-on logarithm frequency scale, under the 1000 hz, the perception of the human ear and a linear relationship with frequency In more than 1000 hz, the perception of the human ear does not constitute a linear relationship with frequency, and prefer to logarithmic relationship, which makes the human ear is sensitive to low frequency signal is better than high frequency signal. Mel frequency is put forward in order to facilitate the ear of the study of speech perception characteristics of different frequency. For frequency and Mel frequency conversion formula
Platform: | Size: 1024 | Author: 朱健晨 | Hits:

[Othermelfb

Description: 耳蜗实质上相当于一个滤波器组,耳蜗的滤波作用是在对数频率尺度上进行的,在1000HZ下,人耳的感知能力与频率成线性关系;而在1000HZ以上,人耳的感知能力与频率不构成线性关系,而更偏向于对数关系,这就使得人耳对低频信号比高频信号更敏感。Mel频率的提出是为了方便人耳对不同频率语音的感知特性的研究。频率与Mel频率的转换公式为-on logarithm frequency scale, under the 1000 hz, the perception of the human ear and a linear relationship with frequency In more than 1000 hz, the perception of the human ear does not constitute a linear relationship with frequency, and prefer to logarithmic relationship, which makes the human ear is sensitive to low frequency signal is better than high frequency signal. Mel frequency is put forward in order to facilitate the ear of the study of speech perception characteristics of different frequency. For frequency and Mel frequency conversion formula
Platform: | Size: 2048 | Author: 朱健晨 | Hits:

[Process-Threadrcvoice

Description: 耳蜗实质上相当于一个滤波器组,耳蜗的滤波作用是在对数频率尺度上进行的,在1000HZ下,人耳的感知能力与频率成线性关系;而在1000HZ以上,人耳的感知能力与频率不构成线性关系,而更偏向于对数关系,这就使得人耳对低频信号比高频信号更敏感。Mel频率的提出是为了方便人耳对不同频率语音的感知特性的研究。频率与Mel频率的转换公式为-on logarithm frequency scale, under the 1000 hz, the perception of the human ear and a linear relationship with frequency In more than 1000 hz, the perception of the human ear does not constitute a linear relationship with frequency, and prefer to logarithmic relationship, which makes the human ear is sensitive to low frequency signal is better than high frequency signal. Mel frequency is put forward in order to facilitate the ear of the study of speech perception characteristics of different frequency. For frequency and Mel frequency conversion formula
Platform: | Size: 4096 | Author: 朱健晨 | Hits:

[Process-Threadlearn

Description: 耳蜗实质上相当于一个滤波器组,耳蜗的滤波作用是在对数频率尺度上进行的,在1000HZ下,人耳的感知能力与频率成线性关系;而在1000HZ以上,人耳的感知能力与频率不构成线性关系,而更偏向于对数关系,这就使得人耳对低频信号比高频信号更敏感。Mel频率的提出是为了方便人耳对不同频率语音的感知特性的研究。频率与Mel频率的转换公式为-on logarithm frequency scale, under the 1000 hz, the perception of the human ear and a linear relationship with frequency In more than 1000 hz, the perception of the human ear does not constitute a linear relationship with frequency, and prefer to logarithmic relationship, which makes the human ear is sensitive to low frequency signal is better than high frequency signal. Mel frequency is put forward in order to facilitate the ear of the study of speech perception characteristics of different frequency. For frequency and Mel frequency conversion formula
Platform: | Size: 156672 | Author: 朱健晨 | Hits:

[Embeded-SCM DevelopSTM32-Speech-Recognition-Master

Description: 于市售 STM32 开发板上实现特定人语音识别处理项目。识别流程是:预滤波、ADC、分帧、端点检测、预加重、加窗、特征提取、特征匹配。端点检测(VAD)采用短时幅度和短时过零率相结合。检测出有效语音后,根据人耳听觉感知特性,计算每帧语音的 Mel 频率倒谱系数(MFCC)。然后采用动态时间弯折(DTW)算法与特征模板相匹配,最终输出识别结果。先用Matlab对上述算法进行仿真,经数次试验求得算法内所需各系数的最优值。而后将算法移植到 STM32 开发板上,移植过程中根据 STM32 上存储空间相对较小、计算能力也相对较弱的实际情况,对算法进行优化。最终完成于 STM32 微处理器上的特定人语音识别系统。-Implement speech recognition processing project in commercially available STM32 development board. Identification is the process: pre-filter, ADC, framing, endpoint detection, pre-emphasis, windowing, feature extraction, feature matching. Endpoint detection (VAD) short-time amplitude and short-term zero rate combined. After detecting an effective voice, according to the characteristics of human auditory perception, calculated for each frame of speech Mel Frequency Cepstral Coefficients (MFCC). Then dynamic time warping (DTW) algorithm and feature template matches the final output recognition result. First with Matlab simulation algorithm described above, after several trials to get the optimal value of each coefficient within the desired algorithm. The algorithm will migrate to STM32 development board, the porting process according to the STM32 relatively small storage space, computing power is relatively weak situation of the optimization algorithm. Finally completed on the STM32 micr
Platform: | Size: 325632 | Author: Chenkly | Hits:

[OtherSpeech-Coders

Description: 全面讲述了数字音频压缩的方法,是数字音频压缩领域的红宝书。-This book covers the basics of speech production, perception, and digital signal analysis techniques. These serve as building blocks to understand the various speech coding methods and their particular implementations. The presentations assume no prior knowledge of speech processing and are designed to be accessible to anyone with a technical background.
Platform: | Size: 3860480 | Author: yuzhiqiang | Hits:
« 12 »

CodeBus www.codebus.net