The Audio Degradation Toolbox - - PowerPoint PPT Presentation

the audio degradation toolbox
SMART_READER_LITE
LIVE PREVIEW

The Audio Degradation Toolbox - - PowerPoint PPT Presentation

The Audio Degradation Toolbox http://code.soundsoftware.ac.uk/projects/audio-degradation-toolbox/ and its Application to Robustness Evaluation Sebastian Ewert and Matthias Mauch Friday, 1 November 13 reverb photo by steveleenow Friday, 1


slide-1
SLIDE 1

The Audio Degradation Toolbox

and its Application to Robustness Evaluation Sebastian Ewert and Matthias Mauch

http://code.soundsoftware.ac.uk/projects/audio-degradation-toolbox/

Friday, 1 November 13

slide-2
SLIDE 2

reverb

photo by steveleenow

Friday, 1 November 13

slide-3
SLIDE 3

lossy compression

photo by dan taylor

Friday, 1 November 13

slide-4
SLIDE 4

bad analog-to-digital conversion

photo by emilio di fabio

Friday, 1 November 13

slide-5
SLIDE 5

low quality microphone

photo by JeffaCubed

Friday, 1 November 13

slide-6
SLIDE 6

Environmental noise

Friday, 1 November 13

slide-7
SLIDE 7

...and many other things degrade audio.

irregular tape playback dynamic range compression in radio and tv broadcasts audio speedup on the radio noise clipping and other distortion ... and yet more.

Friday, 1 November 13

slide-8
SLIDE 8

Audio Collection Quality

most audio collections contain some audio of low quality contain recordings of different qualities contain recording of unknown quality

bad

  • k
  • k

bad bad

  • k

alright-ish bad

  • k

Friday, 1 November 13

slide-9
SLIDE 9

Audio Collection Quality

most audio collections contain some audio of low quality contain recordings of different qualities contain recording of unknown quality

Friday, 1 November 13

slide-10
SLIDE 10

Impact on Music Informatics

methods are usually tested only on one (or few) audio collections, hence: feature extractors (etc.) might fail in the real world affects MIR researchers’ work if feature extractors work, it is not clear if they corrleate with content or audio quality affects ‘digital musicologists’ and industry

Friday, 1 November 13

slide-11
SLIDE 11

Audio Degradation Toolbox

most comprehensive collection of Matlab code for audio degradation designed to make it easy to degrade audio in many different ways aim: encourage MIR researchers to test their algorithms under many different conditions

GPL open source

  • n SoundSoftware

Friday, 1 November 13

slide-12
SLIDE 12

Degradation Units

Add Noise Add Sound Attenuation Aliasing Clipping Delay Dynamic Range Compr. Apply Impulse Response High-pass filter Low-pass filter MP3 Compression Saturation Speedup Wow Resampling

Friday, 1 November 13

slide-13
SLIDE 13

Degradation Units

Add Noise Add Sound Attenuation Aliasing Clipping Delay Dynamic Range Compr. Apply Impulse Response High-pass filter Low-pass filter MP3 Compression Saturation Speedup Wow Resampling sounds included: pub sound env., vinyl crackle

Friday, 1 November 13

slide-14
SLIDE 14

Degradation Units

Add Noise Add Sound Attenuation Aliasing Clipping Delay Dynamic Range Compr. Apply Impulse Response High-pass filter Low-pass filter MP3 Compression Saturation Speedup Wow Resampling sounds included: pub sound env., vinyl crackle room, microphone, speaker and vinyl player IRs

Friday, 1 November 13

slide-15
SLIDE 15

Degradation Unit Example

example sound before / after why “timestamps” — we’ll see later.

parameter.noiseColor = ’brown’; [audio_out, timestamps_out] = degradationUnit_addNoise(audio, samplingFreq, timestamps, parameter)

Friday, 1 November 13

slide-16
SLIDE 16

Degradations

to make complex “Degradations” we can make chains from degradation units ... like audio effects! Example: Radio Broadcast Degradation Dynamic Range Compr. Speedup

Friday, 1 November 13

slide-17
SLIDE 17

Degradations

to make complex “Degradations” we can make chains from degradation units ... like audio effects! Example: Radio Broadcast Degradation Radio Broadcast Dynamic Range Compr. Speedup

Friday, 1 November 13

slide-18
SLIDE 18

Degradations — examples

Lots of audio examples (file://localhost/Users/ matthiasm/code/audio-degradation-toolbox/ html/audio_examples.html) Examples with spectrogram: Wow resampling on cello (file6) Live Recording on file1

Friday, 1 November 13

slide-19
SLIDE 19

Comparing to Ground Truth

  • ne main purpose:

evaluate methods under different degradations problem — we have time-distorting degradations solution: every degradation can also transform ground truth to the time line of the degraded audio example: beat tracking ground truth after “Speedup” degradation

Friday, 1 November 13

slide-20
SLIDE 20

Comparing to Ground Truth

  • ne main purpose:

evaluate methods under different degradations problem — we have time-distorting degradations solution: every degradation can also transform ground truth to the time line of the degraded audio example: beat tracking ground truth after “Speedup” degradation

time

  • riginal ground truth

Friday, 1 November 13

slide-21
SLIDE 21

Comparing to Ground Truth

  • ne main purpose:

evaluate methods under different degradations problem — we have time-distorting degradations solution: every degradation can also transform ground truth to the time line of the degraded audio example: beat tracking ground truth after “Speedup” degradation

time

  • riginal ground truth

transformed

Friday, 1 November 13

slide-22
SLIDE 22

Revisit Example

Radio Broadcast Dynamic Range Compr. Speedup

audio ground truth timestamps transformed ground truth transformed audio degradation unit degradation unit degradation

Friday, 1 November 13

slide-23
SLIDE 23

Experiments on ‘Real-World’ Degradations

Live Recording Vinyl Recording Smartphone Recording Radio Broadcast Smartphone Playback Strong MP3 Compression

Friday, 1 November 13

slide-24
SLIDE 24

Results I — Audio ID

audio ID fails for most “Real-World” degradations, not for mp3 robustness to pink noise is ok

correct incorrect not identified Original 100 Live 100 Radio 3 3 94 PhonePlay 1 99 PhoneRec 5 7 88 MP3 100 Vinyl 4 96

Friday, 1 November 13

slide-25
SLIDE 25

Results I — Audio ID

audio ID fails for most “Real-World” degradations, not for mp3 robustness to pink noise is ok

correct incorrect not identified Original 100 Live 100 Radio 3 3 94 PhonePlay 1 99 PhoneRec 5 7 88 MP3 100 Vinyl 4 96 20 40 60 80 100 dB SNR correct

  • rig

40 30 20 10 5

Friday, 1 November 13

slide-26
SLIDE 26

Results II — Score-to-audio alignment

pretty much falls over for “Live” and “Phone Playback” degradations explanations: onset duplication; bass harmony missing

percentage in 50ms window 30 40 50 60 70 80 90 100 Original Live Radio PhonePlay PhoneRec MP3 Vinyl

Friday, 1 November 13

slide-27
SLIDE 27

Results III — Beat-tracking

compare two methods: BeatRoot, Davies very similar, but Davies more robust to “Live” degradation

F measure 0.0 0.2 0.4 0.6 0.8 1.0 Original Live Radio PhonePlay PhoneRec MP3 Vinyl BeatRoot Davies

Friday, 1 November 13

slide-28
SLIDE 28

Results IV — Chord recognition

compare two methods: Chordino, HPA HPA usually better, Chordino more robust

  • n “Phone Play”

relative correct overlap 0.0 0.2 0.4 0.6 0.8 1.0 Original Live Radio PhonePlay PhoneRec MP3 Vinyl Chordino HPA

Friday, 1 November 13

slide-29
SLIDE 29

Results IV — Chord recognition

compare two methods: Chordino, HPA HPA usually better, Chordino more robust

  • n “Phone Play”

relative correct overlap 0.0 0.2 0.4 0.6 0.8 1.0 Original Live Radio PhonePlay PhoneRec MP3 Vinyl Chordino HPA relative correct overlap 0.0 0.2 0.4 0.6 0.8 1.0 Original HP 50 HP 100 HP 200 HP 400 HP 800 Chordino HPA

Friday, 1 November 13

slide-30
SLIDE 30

Results IV — Chord recognition

compare two methods: Chordino, HPA HPA usually better, Chordino more robust

  • n “Phone Play”

relative correct overlap 0.0 0.2 0.4 0.6 0.8 1.0 Original Live Radio PhonePlay PhoneRec MP3 Vinyl Chordino HPA relative correct overlap 0.0 0.2 0.4 0.6 0.8 1.0 Original HP 50 HP 100 HP 200 HP 400 HP 800 Chordino HPA

C D E F G A B

  • riginal

400Hz High-pass

time time

Friday, 1 November 13

slide-31
SLIDE 31

Summary

Audio Degradation Toolbox offers easy-to-use degradations more comprehensive than other existing toolboxes ground truth time-line transform to evaluate on time- warping degradations Results show: ADT is useful to detect strengths and weaknesses of MIR methods For paper, audio examples, source code: http://code.soundsoftware.ac.uk/projects/audio- degradation-toolbox

Friday, 1 November 13

slide-32
SLIDE 32

What’s up next?

convince everyone to use the ADT :) work with it ourselves... degraded audio as additional training data affect of degradation on human ground truth labelling

Friday, 1 November 13