MTL Music Representation, data underlying the publication: One deep music representation to rule them all? A comparative analysis of different representation learning strategies
Datacite citation style:
Kim, Jaehun; Urbano, J. (Julián); Liem, C.C.S. (Cynthia); Hanjalic, A. (Alan) (2019): MTL Music Representation, data underlying the publication: One deep music representation to rule them all? A comparative analysis of different representation learning strategies. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/uuid:3c7d3086-bfec-407d-a33c-0a7a9c8d7ec0
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset
usage stats
1105
views
412
downloads
licence
CC BY-NC-SA 4.0
MTL Music Representation dataset is the collection of 384 neural network that are trained on 8 learning tasks and datasets (learning sources) from music domain. The data used in the training consists of subset of the Million Song Dataset (MSD). The neural network architecture is based on the VGG architecture. To host multiple learning sources, we adopted multi-task architecture where the task-specific layers branches out from the shared layer.
Main dataset file consists of multiple directories, where the model checkpoint and the learning curve data is saved in two separate files. Each model parameter is saved in compressed binary file serialized by the `Pytorch` python package. Each learning curve data is saved in `.csv` file with the model idenfier, where each row indicates individual record for the loss function for either training or validation.
We are planning to provide a number of utilities for instance: 1) extracting features for given audio file 2) visualizing and save the learning curves.
For more information, please visit our github page.
history
- 2019-03-13 first online, published, posted
publisher
4TU.Centre for Research Data
format
media types: application/octet-stream, application/zip, text/csv, text/markdown, text/plain
organizations
TU Delft, Faculty of Electrical Engineering, Mathematics and Computer Science, Department of Intelligent Systems, Multimedia Computing Group
DATA
files (2)
- 2,571 bytesMD5:
420ebb486451362d9e3d4a4c7143f422
README.md - 8,746,183,778 bytesMD5:
73f22b1f9bbd1704d3b9f4ce8e2b5203
data.zip -
download all files (zip)
8,746,186,349 bytes unzipped