GROUP DELAY BASED MELODY EXTRACTION FOR INDIAN MUSIC
December 21, 2013
Rajeev Rajan and Hema A. Murthy Department of Computer Science and Engineering Indian Institute of Technology,Madras e-mail:rajeevrajan002@gmail.com
Slide 1/21
GROUP DELAY BASED MELODY EXTRACTION FOR INDIAN MUSIC December 21, - - PowerPoint PPT Presentation
GROUP DELAY BASED MELODY EXTRACTION FOR INDIAN MUSIC December 21, 2013 Rajeev Rajan and Hema A. Murthy Department of Computer Science and Engineering Indian Institute of Technology,Madras e-mail:rajeevrajan002@gmail.com Slide 1/21 Outline
Slide 1/21
Slide 1/21
1Graham E.Poliner, Daniel P . W. Ellis, Andreas F. Ehmann, Emilia Gomez, Sebastian Strich and Beesuan Ong,Melody Transcription From Music Audio :Approaches and Evaluations“,IEEE Transactions on Audio, Speech, and Language Processing ,pp-1247–1256,Vol-15,No-4,May 2007 Slide 2/21
audio signals”, Working Notes of the IJCAI-99 Workshop on Computational Auditory Scene Analysis, pp 31-40
International Society for Music Information Retrieval (ISMIR),No.4, 2007. 3Justin Salamon and Emilia Gomez, “Melody extraction from polyphonic music signals using pitch contours characteristics,” In IEEE Trans. on Audio Speech and Language Processing, vol. 20, no. 6, pp. 1759- 1770, August 2012.
. Rao,“Vocal melody extraction in the presence of pitched accompaniment in polyphonic music” In
Slide 3/21
Slide 4/21
Slide 5/21
= XR(ejω)YR(ejω) + YI(ejω)XI(ejω) | S(ejω) |2 (3)
Steps Algorithm 1 Let x[n] be the given sequence. 2 Compute the DFT X[k] , Y [k], of x[n] and nx[n] respectively 3 Group delay function is τx[k] = XR[k]YR[k]+XI[k]YI[k]
|X[k]|2
R and I represents real and imaginary respectively. 4 Modified group delayτ[k] = XR[k]YR[k]+XI[k]YI[k]
|S[k]|2
, where S[k] is the smoothed version of X[k] 5 Two new parameters α and γ are introduced in Equation of τ[k] τm[k] =
τ[k] |τ[k]|(|τ[k]|)α
τm[k] = XR[k]YR[k]+XI[k]YI[k]
|S[k]|2γ
1Hema A. Murthy, Algorithms for Processing Fourier Transform Phase of Signals, PhD dissertation, Indian Institute of Technology, Department of Computer Science and Engg., Madras, India, December 1991. Slide 6/21
Slide 7/21
Slide 8/21
Slide 9/21
1700 1800 1900 2000 2100 2200 2300 220 240 260 280 300 320 340 360 380
Frame Index Pitch (b)
MODGD Pitch REF
Slide 10/21
Slide 11/21
Slide 12/21
Slide 13/21
Slide 14/21
KNF0
Slide 15/21
Slide 16/21
s)2 − e2
s is the detected pitch, N is the
s)
Slide 17/21
4200 4250 4300 4350 4400 4450 4500 4550 4600 4650 4700 100 150 200 250 300
Frame Index Pitch (b) MODGD Pitch REF
Original audio-GNB Kamboji4b 039GNB.wav Synthesized audio Slide 18/21
Method OA RPA RCA VR VF V.Arora et al 69.06 81.41 85.92 76.51 23.56 Sam Meyer 60.34 64.23 71.21 77.36 32.96 Bin Liao et al(1) 46.24 55.87 66.71 99.98 97.76 Bin Liao et al(2) 41.54 48.32 59.90 99.96 95.37 Bin Liao et al(3) 41.54 48.32 59.90 99.96 95.37 Salamon et al 73.55 76.34 78.71 80.55 15.09 Liao et al 73.05 84.50 86.16 98.60 87.37 Tachibana et al 59.62 73.03 81.43 74.98 29.37 MODGD 60.78 67.80 75.95 82.26 26.00 Slide 19/21
OA RPA RCA VR VF V.Arora et al 67.95 85.85 86.79 70.76 15.58 Sam Meyer 50.06 49.31 59.48 63.52 30.23 Bin Liao et al(1) 70.25 81.94 82.17 100.00 100.00 Bin Liao et al(2) 51.21 59.59 67.95 100.00 100.00 Bin Liao et al(3) 51.51 59.59 67.95 100.00 100.00 Salamon et al 82.78 87.55 88.02 89.26 17.86 Chien et al 68.88 71.75 74.67 89.60 44.81 Stacy et al 63.57 67.64 73.20 78.69 34.25 MODGD 58.21 64.44 66.05 82.88 27.94
Method σe RPA RCA YIN 2.94 74.20 85.00 MODGD 2.67 75.16 80.49 1Ref: Melodia Slide 20/21
Slide 21/21