Introduction - If you have any usage issues, please Google them yourself
The nonstationary signals are segmented, and then the interception, the short-time signal after processing the short-time Fourier transform, the analysis of the spectrum characteristics and frequency characteristics and find the different time node, in order to analyze the acoustic characteristics of different event period, expressed in the form of an image, and ultimately achieve the purpose of distinguishing different voices.