Pitch extraction using modified higher order moments
2010
This paper proposes a set of higher-order modified moments as alternative objective criteria for pitch extraction and explores the impact of the speech window length on pitch estimation error. To obtain the K th order modified moment, each speech frame is split into a positive-valued signal and a negative-valued signal. The magnitudes of the K th order moments for the positive and the negative valued signals are obtained and combined. The proposed objective criteria form a relatively sharp peak around the true pitch value compared to the correlation function. For calculation of errors, pitch reference (‘ground truth’) values are calculated from manually-corrected estimates of the periods obtained from laryngograph signals. The results obtained for the third order modified moment are compared with the results for correlation and magnitude difference criteria and the YIN method. The modified moments provide improved pitch accuracy with less occurrence of large errors (e.g. half or double pitch estimation errors).
Keywords:
- Speech recognition
- Statistics
- Speech processing
- Speech enhancement
- Signal-to-noise ratio
- Correlation function
- Mathematical optimization
- Magnitude (mathematics)
- Pitch detection algorithm
- Harmonic analysis
- Computer science
- Correlation
- Artificial intelligence
- Third order
- Mathematical analysis
- Ground truth
- Pattern recognition
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
11
References
3
Citations
NaN
KQI