Generating melodic dictations using Markov Chains and LSTM neural networks

Stefanowska, Emilia; Kacprzak, Stanis?aw; Ksi??ek, Piotr

AES E-Library

Generating melodic dictations using Markov Chains and LSTM neural networks

Melodic dictations are aural training exercises that require students to transcribe the melody they hear into musical notation. In this paper, we propose three algorithms that generate single-voice melodies that could be serve as melodic dictations. The first algorithm utilizes a higher-order Markov Chain model to generate melodic patterns based on a given data set of training set dictations. The second algorithm employs a neural network with Long Short-Term Memory (LSTM) layers and the Bahdanau attention mechanism. The third algorithm generates melodies by choosing each note randomly. We analyzed the generated dictations using the dissimilarity index based on the cross-correlation, to demonstrate that the algorithms generate novel and diverse melodic dictations. To evaluate the musical quality of the melodies, we conducted a survey in which professional music theory teachers graded the dictations from the training set and those generated by the algorithms. The results indicate that some of the generated dictations are comparable in quality to those in the training set and could find potential applications in musical education.

Authors: Stefanowska, Emilia; Kacprzak, Stanis?aw; Ksi??ek, Piotr
Affiliations: AGH University of Science and Technology, Kraków, Poland; AGH University of Science and Technology, Kraków, Poland; AGH University of Science and Technology, Kraków, Poland(See document for exact affiliation information.)
AES Convention: 154 (May 2023) Paper Number: 10647
Publication Date: May 13, 2023 Import into BibTeX
Subject: Music AI
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=22054

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: /conv/154/10647.pdf

Start a discussion about this Music AI!

AES E-Library

Generating melodic dictations using Markov Chains and LSTM neural networks

ABOUT AES

Contact Us