claVision: Visual Automatic Piano Music Transcription

Mohammad Akbari, and Howard Cheng

Proceedings of the International Conference on New Interfaces for Musical Expression

Abstract

One important problem in Musical Information Retrieval is Automatic Music Transcription, which is an automated conversion process from played music to a symbolic notation such as sheet music. Since the accuracy of previous audio-based transcription systems is not satisfactory, we propose an innovative visual-based automatic music transcription system named claVision to perform piano music transcription. Instead of processing the music audio, the system performs the transcription only from the video performance captured by a camera mounted over the piano keyboard. claVision can be used as a transcription tool, but it also has other applications such as music education. The claVision software has a very high accuracy (over 95%) and a very low latency in real-time music transcription, even under different illumination conditions.

Citation

Mohammad Akbari, and Howard Cheng. 2015. claVision: Visual Automatic Piano Music Transcription. Proceedings of the International Conference on New Interfaces for Musical Expression. DOI: 10.5281/zenodo.1179002

BibTeX Entry

@inproceedings{makbari2015,
 abstract = {One important problem in Musical Information Retrieval is Automatic Music Transcription, which is an automated conversion process from played music to a symbolic notation such as sheet music. Since the accuracy of previous audio-based transcription systems is not satisfactory, we propose an innovative visual-based automatic music transcription system named claVision to perform piano music transcription. Instead of processing the music audio, the system performs the transcription only from the video performance captured by a camera mounted over the piano keyboard. claVision can be used as a transcription tool, but it also has other applications such as music education. The claVision software has a very high accuracy (over 95%) and a very low latency in real-time music transcription, even under different illumination conditions.},
 address = {Baton Rouge, Louisiana, USA},
 author = {Mohammad Akbari and Howard Cheng},
 booktitle = {Proceedings of the International Conference on New Interfaces for Musical Expression},
 doi = {10.5281/zenodo.1179002},
 editor = {Edgar Berdahl and Jesse Allison},
 issn = {2220-4806},
 month = {May},
 pages = {313--314},
 publisher = {Louisiana State University},
 title = {claVision: Visual Automatic Piano Music Transcription},
 url = {http://www.nime.org/proceedings/2015/nime2015_105.pdf},
 urlsuppl1 = {http://www.nime.org/proceedings/2015/105/0105-file1.avi},
 year = {2015}
}