claVision: Visual Automatic Piano Music Transcription

Mohammad Akbari, and Howard Cheng

Proceedings of the International Conference on New Interfaces for Musical Expression

Abstract:

One important problem in Musical Information Retrieval is Automatic Music Transcription, which is an automated conversion process from played music to a symbolic notation such as sheet music. Since the accuracy of previous audio-based transcription systems is not satisfactory, we propose an innovative visual-based automatic music transcription system named claVision to perform piano music transcription. Instead of processing the music audio, the system performs the transcription only from the video performance captured by a camera mounted over the piano keyboard. claVision can be used as a transcription tool, but it also has other applications such as music education. The claVision software has a very high accuracy (over 95%) and a very low latency in real-time music transcription, even under different illumination conditions.

Citation:

Mohammad Akbari, and Howard Cheng. 2015. claVision: Visual Automatic Piano Music Transcription. Proceedings of the International Conference on New Interfaces for Musical Expression. DOI: 10.5281/zenodo.1179002

BibTeX Entry:

  @inproceedings{makbari2015,
 abstract = {One important problem in Musical Information Retrieval is Automatic Music Transcription, which is an automated conversion process from played music to a symbolic notation such as sheet music. Since the accuracy of previous audio-based transcription systems is not satisfactory, we propose an innovative visual-based automatic music transcription system named claVision to perform piano music transcription. Instead of processing the music audio, the system performs the transcription only from the video performance captured by a camera mounted over the piano keyboard. claVision can be used as a transcription tool, but it also has other applications such as music education. The claVision software has a very high accuracy (over 95%) and a very low latency in real-time music transcription, even under different illumination conditions.},
 address = {Baton Rouge, Louisiana, USA},
 author = {Mohammad Akbari and Howard Cheng},
 booktitle = {Proceedings of the International Conference on New Interfaces for Musical Expression},
 doi = {10.5281/zenodo.1179002},
 editor = {Edgar Berdahl and Jesse Allison},
 issn = {2220-4806},
 month = {May},
 pages = {313--314},
 publisher = {Louisiana State University},
 title = {claVision: Visual Automatic Piano Music Transcription},
 url = {http://www.nime.org/proceedings/2015/nime2015_105.pdf},
 urlsuppl1 = {http://www.nime.org/proceedings/2015/105/0105-file1.avi},
 year = {2015}
}