The final, formatted version of the article will be published soon.
ORIGINAL RESEARCH article
Front. Signal Process.
Sec. Audio and Acoustic Signal Processing
Volume 4 - 2024 |
doi: 10.3389/frsip.2024.1494864
Sine, Transient, Noise Neural Modeling of Piano Notes
Provisionally accepted- University of Oslo, Oslo, Norway
This article introduces a novel method for emulating piano sounds. We propose to exploit the sine, transient, and noise decomposition to design a differentiable spectral modeling synthesizer replicating piano notes. Three sub-modules learn these components from piano recordings and generate the corresponding harmonic, transient, and noise signals. Splitting the emulation into three independently trainable models reduces the modeling tasks' complexity. The quasi-harmonic content is produced using a differentiable sinusoidal model guided by physics-derived formulas, whose parameters are automatically estimated from audio recordings. The noise sub-module uses a learnable time-varying filter, and the transients are generated using a deep convolutional network. From singular notes, we emulate the coupling between different keys in trichords with a convolutional-based network. Results show the model matches the partial distribution of the target while predicting the energy in the higher part of the spectrum presents more challenges.The energy distribution in the spectra of the transient and noise components is accurate overall.While the model is more computationally and memory efficient, perceptual tests reveal limitations in accurately modeling the attack phase of notes. Despite this, it generally achieves perceptual accuracy in emulating single notes and trichords.
Keywords: Sound source modeling, physics-informed modeling, Acoustic modeling, deep learning, Piano synthesis, differentiable digital signal processing
Received: 11 Sep 2024; Accepted: 30 Dec 2024.
Copyright: © 2024 Simionato and Fasciani. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Riccardo Simionato, University of Oslo, Oslo, Norway
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.