[Feature] Although machines are getting better at understanding audio signals, the tasks that are being undertaken are still relatively basic. Identifying a solo on a particular instrument, working out who is talking, working out when someone is singing, and so forth—these are tasks we mostly do relatively easily as humans. Coming up with an algorithm that enables a machine to do it reliably is surprisingly challenging. Selected papers from the recent conference on semantic audio are summarized.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.