The power of deep without going deep? A study of HDPGMM music representation learning

Jaehun Kim; C.C.S. Liem

The power of deep without going deep? A study of HDPGMM music representation learning

^*Corresponding author for this work

Multimedia Computing

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

11 Downloads (Pure)

Abstract

In the previous decade, Deep Learning (DL) has proven to be one of the most effective machine learning methods to tackle a wide range of Music Information Retrieval (MIR) tasks. It offers highly expressive learning capacity that can fit any music representation needed for MIR-relevant downstream tasks. However, it has been criticized for sacrificing interpretability. On the other hand, the Bayesian nonparametric (BN) approach promises similar positive properties as DL, such as high flexibility, while being robust to overfitting and preserving interpretability. Therefore, the primary motivation of this work is to explore the potential of Bayesian nonparametric models in comparison to DL models for music representation learning. More specifically, we assess the music representation learned from the Hierarchical Dirichlet Process Gaussian Mixture Model (HDPGMM), an infinite mixture model based on the Bayesian nonparametric approach, to MIR tasks, including classification, auto-tagging, and recommendation. The experimental result suggests that the HDPGMM music representation can outperform DL representations in certain scenarios, and overall comparable.

Original language	English
Title of host publication	Proceedings of the 23rd International Society for Music Information Retrieval Conference
Pages	116 - 124
Number of pages	9
Publication status	Published - 2022
Event	23rd International Society for Music Information Retrieval Conference - Bengaluru, India Duration: 4 Dec 2022 → 8 Dec 2022 Conference number: 23

Conference

Conference	23rd International Society for Music Information Retrieval Conference
Abbreviated title	ISMIR 2022
Country/Territory	India
City	Bengaluru
Period	4/12/22 → 8/12/22

Access to Document

000013Final published version, 317 KBLicence: CC BY

Supplementary material of the paper "The power of deep without going deep? A study of HDPGMM music representation learning"
Kim, J. H. (Creator) & Liem, C. C. S. (Creator), TU Delft - 4TU.ResearchData, 6 Feb 2023
DOI: 10.4121/21981442
Dataset/Software: Dataset

Cite this

@inproceedings{6485898ba36b41b4a958b917339a7ef9,

title = "The power of deep without going deep? A study of HDPGMM music representation learning",

abstract = "In the previous decade, Deep Learning (DL) has proven to be one of the most effective machine learning methods to tackle a wide range of Music Information Retrieval (MIR) tasks. It offers highly expressive learning capacity that can fit any music representation needed for MIR-relevant downstream tasks. However, it has been criticized for sacrificing interpretability. On the other hand, the Bayesian nonparametric (BN) approach promises similar positive properties as DL, such as high flexibility, while being robust to overfitting and preserving interpretability. Therefore, the primary motivation of this work is to explore the potential of Bayesian nonparametric models in comparison to DL models for music representation learning. More specifically, we assess the music representation learned from the Hierarchical Dirichlet Process Gaussian Mixture Model (HDPGMM), an infinite mixture model based on the Bayesian nonparametric approach, to MIR tasks, including classification, auto-tagging, and recommendation. The experimental result suggests that the HDPGMM music representation can outperform DL representations in certain scenarios, and overall comparable. ",

author = "Jaehun Kim and C.C.S. Liem",

year = "2022",

language = "English",

pages = "116 -- 124",

booktitle = "Proceedings of the 23rd International Society for Music Information Retrieval Conference",

note = "23rd International Society for Music Information Retrieval Conference, ISMIR 2022 ; Conference date: 04-12-2022 Through 08-12-2022",

}

TY - GEN

T1 - The power of deep without going deep? A study of HDPGMM music representation learning

AU - Kim, Jaehun

AU - Liem, C.C.S.

N1 - Conference code: 23

PY - 2022

Y1 - 2022

N2 - In the previous decade, Deep Learning (DL) has proven to be one of the most effective machine learning methods to tackle a wide range of Music Information Retrieval (MIR) tasks. It offers highly expressive learning capacity that can fit any music representation needed for MIR-relevant downstream tasks. However, it has been criticized for sacrificing interpretability. On the other hand, the Bayesian nonparametric (BN) approach promises similar positive properties as DL, such as high flexibility, while being robust to overfitting and preserving interpretability. Therefore, the primary motivation of this work is to explore the potential of Bayesian nonparametric models in comparison to DL models for music representation learning. More specifically, we assess the music representation learned from the Hierarchical Dirichlet Process Gaussian Mixture Model (HDPGMM), an infinite mixture model based on the Bayesian nonparametric approach, to MIR tasks, including classification, auto-tagging, and recommendation. The experimental result suggests that the HDPGMM music representation can outperform DL representations in certain scenarios, and overall comparable.

AB - In the previous decade, Deep Learning (DL) has proven to be one of the most effective machine learning methods to tackle a wide range of Music Information Retrieval (MIR) tasks. It offers highly expressive learning capacity that can fit any music representation needed for MIR-relevant downstream tasks. However, it has been criticized for sacrificing interpretability. On the other hand, the Bayesian nonparametric (BN) approach promises similar positive properties as DL, such as high flexibility, while being robust to overfitting and preserving interpretability. Therefore, the primary motivation of this work is to explore the potential of Bayesian nonparametric models in comparison to DL models for music representation learning. More specifically, we assess the music representation learned from the Hierarchical Dirichlet Process Gaussian Mixture Model (HDPGMM), an infinite mixture model based on the Bayesian nonparametric approach, to MIR tasks, including classification, auto-tagging, and recommendation. The experimental result suggests that the HDPGMM music representation can outperform DL representations in certain scenarios, and overall comparable.

M3 - Conference contribution

SP - 116

EP - 124

BT - Proceedings of the 23rd International Society for Music Information Retrieval Conference

T2 - 23rd International Society for Music Information Retrieval Conference

Y2 - 4 December 2022 through 8 December 2022

ER -

The power of deep without going deep? A study of HDPGMM music representation learning

Abstract

Conference

Access to Document

Fingerprint

Datasets

Supplementary material of the paper "The power of deep without going deep? A study of HDPGMM music representation learning"

Cite this