Description: 对wave文件进行读取,并提取短时平均能量,过零率,短时平均幅值值,音频中每一帧的最大值,最小值-right wave to read documents and extract short-term average energy, the zero-rate, short-term value of the average amplitude, Audio each frame of the maximum, minimum Platform: |
Size: 61440 |
Author:daisiguang |
Hits:
Description: 语音特征提取,如过零率,能量比,短时能量。须导入MAV音频。-Voice feature extraction, such as zero-crossing rate, energy ratio, short-term energy. MAV audio to be imported. Platform: |
Size: 1024 |
Author:李 |
Hits:
Description: 读取语音文件 进行分帧加窗 及计算语音文件短时能力和过零率。可用于对音乐哼唱检索系统的开发及研究学习。-Voice files to read sub-frame and calculation of windowed audio files and short-term zero-crossing rate. Music can be used to Query by Humming system development and research study. Platform: |
Size: 36864 |
Author:wdn |
Hits:
Description: 个人编写的提取音频信号的时域特征参数短时平均过零率及短时平均能量的matlab源程序。供大家参考-Extraction of individual preparation of the audio signal in time domain characteristic parameters of short-term average of zero-crossing rate and short-time average energy of the matlab source. For your reference Platform: |
Size: 219136 |
Author:席学云 |
Hits:
Description: This project describes the work done on the development of an audio segmentation and classification system. Many existing works on audio classification deal with the problem of classifying known homogeneous audio segments. In this work, audio recordings are divided into acoustically similar regions and classified into basic audio types such as speech, music or silence. Audio features used in this project include Mel Frequency Cepstral Coefficients (MFCC), Zero Crossing Rate and Short Term Energy (STE). These features were extracted from audio files that were stored in a WAV format. Possible use of features, which are extracted directly from MPEG audio files, is also considered. Statistical based methods are used to segment and classify audio signals using these features. The classification methods used include the General Mixture Model (GMM) and the k- Nearest Neighbour (k-NN) algorithms. It is shown that the system implemented achieves an accuracy rate of more than 95 for discrete audio classification.-This project describes the work done on the development of an audio segmentation and classification system. Many existing works on audio classification deal with the problem of classifying known homogeneous audio segments. In this work, audio recordings are divided into acoustically similar regions and classified into basic audio types such as speech, music or silence. Audio features used in this project include Mel Frequency Cepstral Coefficients (MFCC), Zero Crossing Rate and Short Term Energy (STE). These features were extracted from audio files that were stored in a WAV format. Possible use of features, which are extracted directly from MPEG audio files, is also considered. Statistical based methods are used to segment and classify audio signals using these features. The classification methods used include the General Mixture Model (GMM) and the k- Nearest Neighbour (k-NN) algorithms. It is shown that the system implemented achieves an accuracy rate of more than 95 for discrete audio classification. Platform: |
Size: 653312 |
Author:kvga |
Hits:
Description: 语音特征提取
读取一段语音,输出语音文件的短时对数能量和短时过零率等特征-Voice reading a speech feature extraction, the output audio file of the short-time energy and short on the number of features such as zero-crossing rate Platform: |
Size: 283648 |
Author:万玉龙 |
Hits:
Description: PAV (H265) 是 音视频 压缩解压 协议,非常不同于H264/MPEG4,ZPAV (H265) 的基本算法 是 小波,多级树集合群,广义小波,数学形态小波,......
ZPAV (H265) 基本算法 :
1,图象与声音分解与合成 :小波 ;
2,图象与声音前处理 :小波子带零交叉降噪,目标纹理处理,语音处理 ;
3,速率控制 :小波子带熵速率控制 ;
4,量化与反量化 :小波子带熵量化与反量化 ;
5,低频分量和高频分量的降维 :小波子带邻域交叉降维 ;
6,运动矢量和量化表的分解与合成 :广义小波 ;
7,位面编码 :数学形态小波,多级树集合群,嵌入零树,位面降维 ;
8,位流编码 :算术编码,熵编码 ;
9,运动估计 :宏块最优决策,运动矢量预测 ;
A,运动搜索 :钻石,大钻石,小钻石,方形 ;
B,图象与声音后处理 :低通滤波,断点重构,宏块平滑 ;
C,误码纠错 :矢量仿真,帧间仿真 。
-The PAV (H265) is the audio and video compression and decompression protocol, is very different from H264/MPEG4
ZPAV (H265) algorithm is a wavelet, multi-level tree collection group, and generalized wavelets, mathematical morphology, wavelet, ...
ZPAV (H265) algorithm:
1, image and sound decomposition and synthesis: wavelet
2, image and sound processing: wavelet sub-band zero-crossing noise, target texture processing, speech processing
3, the rate control: wavelet subband entropy rate control
4, quantization and inverse quantization: Wavelet subband entropy quantization and inverse quantization
5, the dimensionality reduction of the low frequency component and high frequency components: the wavelet subbands neighborhood cross-dimensionality reduction
6, the decomposition and synthesis of the motion vector and quantization tables: generalized wavelet
7, bit-plane coding: the mathematical form of wavelet multi-level tree collection group, embedded zerotree, bit plane Platform: |
Size: 4826112 |
Author:李阳 |
Hits:
Description: 完成对一段语音信号的短时分析,语音信号要包括清音、浊音和无声段,求其短时平均能量、短时平均过零率、短时自相关函数和短时功率谱;-Complete short-term analysis of a voice signal, the specific requirements: a voice signal to include the voiceless, voiced, and silent segments 2, find its short-term average energy, short-time average zero crossing rate, short-term autocorrelation function and short-term power spectrum 3, drawing, and the differences in the comparative analysis of three types of audio signals in the four short-term parameters 4, estimated using short-time autocorrelation voiced fundamental frequency using matlab to write and draw Platform: |
Size: 2048 |
Author:houjam |
Hits:
Description: Audio Annotation containing several demo using zero crossing rate, short term energy and feature extraction Platform: |
Size: 630784 |
Author:john |
Hits:
Description: 由于科学技术的飞速发展,使银行卡的使用范围不断扩大,因而出现银行卡用卡安全问题 。本文主要介绍了银行卡密码泄露漏洞,对利用录音设备对手机、电话等按键式密码输入终端进录音盗取银行卡密码的不法行为进行技术分析,利用MATLAB软件编程对按键式密码输入终端的按键音进行音频分析,基于短时能量和过零率对所采集的音频文件进行端点检测,提取出单个拨号音,然后对处理后的音频进行频谱分析、对比,从而识别按键式密码输入终端的按键音。-Due to the rapid development of science and technology, the use of bank cards continued to expand, and thus to a bank card with a card security issues. This paper describes the bank card password disclosure vulnerability on the use of recording equipment on the phone, telephone touchtone input terminal password into the recording steal bank card passwords wrongful conduct technical analysis using MATLAB software programming for touchtone input terminal password keys sound audio analysis, based on short-term energy and zero-crossing rate of the acquired endpoint detection audio files, extract single dial tone, and then processed audio spectrum analysis, comparison, to identify key touchtone input terminal password sound. Platform: |
Size: 4096 |
Author:张校非 |
Hits:
Description: 音频的短时过零率分析,一种简单的语音时域分析方法。-Audio short time zero crossing rate analysis, a simple voice time domain analysis method. Platform: |
Size: 2048 |
Author:陈鹏鹏 |
Hits: