Steelpan-specific pitch detection: a dataset and deep learning model

Colin Malloy, and George Tzanetakis

Proceedings of the International Conference on New Interfaces for Musical Expression

Abstract:

The steelpan is a pitched percussion instrument that although generally known by listeners is typically not included in music instrument audio datasets. This means that it is usually underrepresented in existing data-driven deep learning models for fundamental frequency estimation. Furthermore, the steelpan has complex acoustic properties that make fundamental frequency estimation challenging when using deep learning models for general fundamental frequency estimation for any music instrument. Fundamental frequency estimation or pitch detection is a fundamental task in music information retrieval and it is interesting to explore methods that are tailored to specific instruments and whether they can outperform general methods. To address this, we present SASS, the Steelpan Analysis Sample Set that can be used to train steel-pan specific pitch detection algorithms as well as propose a custom-trained deep learning model for steelpan fundamental frequency estimation. This model outperforms general state-of-the-art methods such as pYin and CREPE on steelpan audio - even while having significantly fewer parameters and operating on a shorter analysis window. This reduces minimum system latency, allowing for deployment to a real-time system that can be used in live music contexts.

Citation:

Colin Malloy, and George Tzanetakis. 2023. Steelpan-specific pitch detection: a dataset and deep learning model. Proceedings of the International Conference on New Interfaces for Musical Expression. DOI: 10.5281/zenodo.11189232

BibTeX Entry:

  @inproceedings{nime2023_59,
 abstract = {The steelpan is a pitched percussion instrument that although generally known by listeners is typically not included in music instrument audio datasets.  This means that it is usually underrepresented in existing data-driven deep learning models for fundamental frequency estimation. Furthermore, the steelpan has complex acoustic properties that make fundamental frequency estimation challenging when using deep learning models for general fundamental frequency estimation for any music instrument. Fundamental frequency estimation or pitch detection is a fundamental task in music information retrieval and it is interesting to explore methods that are tailored to  specific instruments and whether they can outperform general methods.  To address this, we present SASS, the Steelpan Analysis Sample Set that can be used to train steel-pan specific pitch detection algorithms as well as propose a custom-trained deep learning model for steelpan fundamental frequency estimation. This model outperforms general state-of-the-art methods such as pYin and CREPE on steelpan audio - even while having significantly fewer parameters and operating on a shorter analysis window. This reduces minimum system latency, allowing for deployment to a real-time system that can be used in live music contexts.},
 address = {Mexico City, Mexico},
 articleno = {59},
 author = {Colin Malloy and George Tzanetakis},
 booktitle = {Proceedings of the International Conference on New Interfaces for Musical Expression},
 doi = {10.5281/zenodo.11189232},
 editor = {Miguel Ortiz and Adnan Marquez-Borbon},
 issn = {2220-4806},
 month = {May},
 numpages = {8},
 pages = {428--435},
 title = {Steelpan-specific pitch detection: a dataset and deep learning model},
 track = {Papers},
 url = {http://nime.org/proceedings/2023/nime2023_59.pdf},
 year = {2023}
}