Evaluating language models of tonal harmony

David R.W. Sears; Filip Korzeniowski; Gerhard Widmer

Evaluating language models of tonal harmony

David R.W. Sears, Filip Korzeniowski, Gerhard Widmer

Visual and Performing Arts

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

2 Scopus citations

Abstract

This study borrows and extends probabilistic language models from natural language processing to discover the syntactic properties of tonal harmony. Language models come in many shapes and sizes, but their central purpose is always the same: to predict the next event in a sequence of letters, words, notes, or chords. However, few studies employing such models have evaluated the most state-of-the-art architectures using a large-scale corpus of Western tonal music, instead preferring to use relatively small datasets containing chord annotations from contemporary genres like jazz, pop, and rock. Using symbolic representations of prominent instrumental genres from the common-practice period, this study applies a flexible, data-driven encoding scheme to (1) evaluate Finite Context (or n-gram) models and Recurrent Neural Networks (RNNs) in a chord prediction task; (2) compare predictive accuracy from the best-performing models for chord onsets from each of the selected datasets; and (3) explain differences between the two model architectures in a regression analysis. We find that Finite Context models using the Prediction by Partial Match (PPM) algorithm outperform RNNs, particularly for the piano datasets, with the regression model suggesting that RNNs struggle with particularly rare chord types.

Original language	English
Title of host publication	Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018
Editors	Emilia Gomez, Xiao Hu, Eric Humphrey, Emmanouil Benetos
Publisher	International Society for Music Information Retrieval
Pages	211-217
Number of pages	7
ISBN (Electronic)	9782954035123
State	Published - 2018
Event	19th International Society for Music Information Retrieval Conference, ISMIR 2018 - Paris, France Duration: Sep 23 2018 → Sep 27 2018

Publication series

Name	Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018

Conference

Conference	19th International Society for Music Information Retrieval Conference, ISMIR 2018
Country/Territory	France
City	Paris
Period	09/23/18 → 09/27/18

Cite this

Sears, D. R. W., Korzeniowski, F., & Widmer, G. (2018). Evaluating language models of tonal harmony. In E. Gomez, X. Hu, E. Humphrey, & E. Benetos (Eds.), Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018 (pp. 211-217). (Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018). International Society for Music Information Retrieval.

Sears, David R.W. ; Korzeniowski, Filip ; Widmer, Gerhard. / Evaluating language models of tonal harmony. Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018. editor / Emilia Gomez ; Xiao Hu ; Eric Humphrey ; Emmanouil Benetos. International Society for Music Information Retrieval, 2018. pp. 211-217 (Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018).

@inproceedings{28d82aeb817e4048842574888c5cfaeb,

title = "Evaluating language models of tonal harmony",

abstract = "This study borrows and extends probabilistic language models from natural language processing to discover the syntactic properties of tonal harmony. Language models come in many shapes and sizes, but their central purpose is always the same: to predict the next event in a sequence of letters, words, notes, or chords. However, few studies employing such models have evaluated the most state-of-the-art architectures using a large-scale corpus of Western tonal music, instead preferring to use relatively small datasets containing chord annotations from contemporary genres like jazz, pop, and rock. Using symbolic representations of prominent instrumental genres from the common-practice period, this study applies a flexible, data-driven encoding scheme to (1) evaluate Finite Context (or n-gram) models and Recurrent Neural Networks (RNNs) in a chord prediction task; (2) compare predictive accuracy from the best-performing models for chord onsets from each of the selected datasets; and (3) explain differences between the two model architectures in a regression analysis. We find that Finite Context models using the Prediction by Partial Match (PPM) algorithm outperform RNNs, particularly for the piano datasets, with the regression model suggesting that RNNs struggle with particularly rare chord types.",

author = "Sears, {David R.W.} and Filip Korzeniowski and Gerhard Widmer",

note = "Publisher Copyright: {\textcopyright} Sears, Korzeniowski, Widmer.; 19th International Society for Music Information Retrieval Conference, ISMIR 2018 ; Conference date: 23-09-2018 Through 27-09-2018",

year = "2018",

language = "English",

series = "Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018",

publisher = "International Society for Music Information Retrieval",

pages = "211--217",

editor = "Emilia Gomez and Xiao Hu and Eric Humphrey and Emmanouil Benetos",

booktitle = "Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018",

}

Sears, DRW, Korzeniowski, F & Widmer, G 2018, Evaluating language models of tonal harmony. in E Gomez, X Hu, E Humphrey & E Benetos (eds), Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018. Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018, International Society for Music Information Retrieval, pp. 211-217, 19th International Society for Music Information Retrieval Conference, ISMIR 2018, Paris, France, 09/23/18.

Evaluating language models of tonal harmony. / Sears, David R.W.; Korzeniowski, Filip; Widmer, Gerhard.
Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018. ed. / Emilia Gomez; Xiao Hu; Eric Humphrey; Emmanouil Benetos. International Society for Music Information Retrieval, 2018. p. 211-217 (Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Evaluating language models of tonal harmony

AU - Sears, David R.W.

AU - Korzeniowski, Filip

AU - Widmer, Gerhard

N1 - Publisher Copyright: © Sears, Korzeniowski, Widmer.

PY - 2018

Y1 - 2018

N2 - This study borrows and extends probabilistic language models from natural language processing to discover the syntactic properties of tonal harmony. Language models come in many shapes and sizes, but their central purpose is always the same: to predict the next event in a sequence of letters, words, notes, or chords. However, few studies employing such models have evaluated the most state-of-the-art architectures using a large-scale corpus of Western tonal music, instead preferring to use relatively small datasets containing chord annotations from contemporary genres like jazz, pop, and rock. Using symbolic representations of prominent instrumental genres from the common-practice period, this study applies a flexible, data-driven encoding scheme to (1) evaluate Finite Context (or n-gram) models and Recurrent Neural Networks (RNNs) in a chord prediction task; (2) compare predictive accuracy from the best-performing models for chord onsets from each of the selected datasets; and (3) explain differences between the two model architectures in a regression analysis. We find that Finite Context models using the Prediction by Partial Match (PPM) algorithm outperform RNNs, particularly for the piano datasets, with the regression model suggesting that RNNs struggle with particularly rare chord types.

AB - This study borrows and extends probabilistic language models from natural language processing to discover the syntactic properties of tonal harmony. Language models come in many shapes and sizes, but their central purpose is always the same: to predict the next event in a sequence of letters, words, notes, or chords. However, few studies employing such models have evaluated the most state-of-the-art architectures using a large-scale corpus of Western tonal music, instead preferring to use relatively small datasets containing chord annotations from contemporary genres like jazz, pop, and rock. Using symbolic representations of prominent instrumental genres from the common-practice period, this study applies a flexible, data-driven encoding scheme to (1) evaluate Finite Context (or n-gram) models and Recurrent Neural Networks (RNNs) in a chord prediction task; (2) compare predictive accuracy from the best-performing models for chord onsets from each of the selected datasets; and (3) explain differences between the two model architectures in a regression analysis. We find that Finite Context models using the Prediction by Partial Match (PPM) algorithm outperform RNNs, particularly for the piano datasets, with the regression model suggesting that RNNs struggle with particularly rare chord types.

UR - http://www.scopus.com/inward/record.url?scp=85069862591&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85069862591

T3 - Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018

SP - 211

EP - 217

BT - Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018

A2 - Gomez, Emilia

A2 - Hu, Xiao

A2 - Humphrey, Eric

A2 - Benetos, Emmanouil

PB - International Society for Music Information Retrieval

T2 - 19th International Society for Music Information Retrieval Conference, ISMIR 2018

Y2 - 23 September 2018 through 27 September 2018

ER -

Sears DRW, Korzeniowski F, Widmer G. Evaluating language models of tonal harmony. In Gomez E, Hu X, Humphrey E, Benetos E, editors, Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018. International Society for Music Information Retrieval. 2018. p. 211-217. (Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018).

Evaluating language models of tonal harmony

Abstract

Publication series

Conference

Other files and links

Fingerprint

Cite this