AES Dublin Paper Session P18: MIR

AES Dublin 2019
Paper Session P18

P18 - MIR


Friday, March 22, 16:30 — 18:00 (Meeting Room 2)

Chair:
Konstantinos Tsioutas, Athens University of Economics and Business - Athens, Greece

P18-1 Evaluating White Noise Degradation on Sonic Quick Response Code (SQRC) Decode EfficacyMark Sheppard, Anglia Ruskin University - Cambridge, Cambridgeshire, UK; Rob Toulson, University of Westminster - London, UK
With the advent of high-resolution recording and playback systems, a proportion of the ultrasonic frequency spectrum can potentially be utilized as a carrier for imperceptible data, which can be used to trigger events or to hold metadata in the form of, for example, an ISRC (International Standard Recording Code), a website address or audio track liner notes. The Sonic Quick Response Code (SQRC) algorithm was previously proposed as a method for encoding inaudible acoustic metadata within a 96 kHz audio file in the 30–35 kHz range. This paper demonstrates the effectiveness of the SQRC decode algorithm when acoustically transmitted over distance while evaluating the degradation effect of adding ultrasonic banded white noise to the pre and post transmission SQRC signal.
Convention Paper 10197 (Purchase now)

P18-2 Tagging and Retrieval of Room Impulse Responses Using Semantic Word Vectors and Perceptual Measures of ReverberationEmmanouil Theofanis Chourdakis, Queen Mary University London - London, UK; Joshua D. Reiss, Queen Mary University of London - London, UK
This paper studies tagging and retrieval of room impulse responses from a labelled library. A similarity-based method is introduced that relies on perceptually relevant characteristics of reverberation. This method is developed using a publicly available dataset of algorithmic reverberation settings. Semantic word vectors are introduced to exploit semantic correlation among tags and allow for unseen words to be used for retrieval. Average precision is reported on a subset of the dataset as well as tagging of recorded room impulse responses. The developed approach manages to assign downloaded room impulse responses to tags that match their short descriptions. Furthermore, introducing semantic word vectors allows it to perform well even when large portions of the training data have been replaced by synonyms.
Convention Paper 10198 (Purchase now)

P18-3 A Custom Integrated Circuit Based Audio-to-CV and Audio-to-MIDI SolutionBrian Kaczynski, Second Sound, LLC - Miami, FL, USA
A new synthesizer technology is demonstrated that tracks the fundamental frequency of virtually any acoustic or electric instrument played monotonically. This technology relies on a mixed analog-digital application-specific integrated circuit (ASIC), which contains a very fast frequency-locked loop (FLL) that tracks with the minimum physically achievable latency of one audio cycle. The ASIC also contains a novel fundamental frequency detection circuit composed of two switched-capacitor peak detectors with decay time proportional to the fundamental period of the audio signal and a novel switched-capacitor, zero-ripple envelope follower. This frequency-tracking technology is fast enough to implement an audio-to-CV or even, with the addition of a simple microcontroller, an audio-to-MIDI solution in real time with very high accuracy and negligible latency.
Convention Paper 10199 (Purchase now)


Return to Paper Sessions