Conference Papers
-
Inference-Time Structured Pruning for Real-Time Neural Network Audio Effects
- Author(s)
- C. J. Clarke and J. Chowdhury
- Conference
- DAFx25
- Publication Year
- 2025
To appear.Abstract
Structured pruning is a technique for reducing the computational load and memory footprint of neural networks by removing structured subsets of parameters according to a predefined schedule or ranking criterion. This paper investigates the application of structured pruning to real-time neural network audio effects, focusing on both feedforward networks and recurrent architectures. We evaluate multiple pruning strategies at inference time, without retraining, and analyze their effects on model performance. To quantify the trade-off between parameter count and audio fidelity, we construct a theoretical model of the approximation error as a function of network architecture and pruning level. The resulting bounds establish a principled relationship between pruning-induced sparsity and functional error, enabling informed deployment of neural audio effects in constrained real-time environments.