20
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Improving Perceptual Quality of Drum Transcription with the Expanded Groove MIDI Dataset

      Preprint
      , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Classifier metrics, such as accuracy and F-measure score, often serve as proxies for performance in downstream tasks. For the case of generative systems that use predicted labels as inputs, accuracy is a good proxy only if it aligns with the perceptual quality of generated outputs. Here, we demonstrate this effect using the example of automatic drum transcription (ADT). We optimize classifiers for downstream generation by predicting expressive dynamics (velocity) and show with listening tests that they produce outputs with improved perceptual quality, despite achieving similar results on classification metrics. To train expressive ADT models, we introduce the Expanded Groove MIDI dataset (E-GMD), a large dataset of human drum performances, with audio recordings annotated in MIDI. E-GMD contains 444 hours of audio from 43 drum kits and is an order of magnitude larger than similar datasets. It is also the first human-performed drum dataset with annotations of velocity. We make this new dataset available under a Creative Commons license along with open source code for training and a pre-trained model for inference.

          Related collections

          Author and article information

          Journal
          31 March 2020
          Article
          2004.00188
          0056c251-1b34-4fbd-885a-51ca3d452511

          http://arxiv.org/licenses/nonexclusive-distrib/1.0/

          History
          Custom metadata
          Examples available at https://goo.gl/magenta/e-gmd-examples
          cs.SD cs.LG

          Artificial intelligence,Graphics & Multimedia design
          Artificial intelligence, Graphics & Multimedia design

          Comments

          Comment on this article