国際会議
-
Inference-Time Structured Pruning for Real-Time Neural Network Audio Effects
- 著者
- C. J. Clarke and J. Chowdhury
- 会議名
- DAFx25
- 発行年
- 2025
To appear.Abstract
Structured pruning is a technique for reducing the computational load and memory footprint of neural networks by removing structured subsets of parameters according to a predefined schedule or ranking criterion. This paper investigates the application of structured pruning to real-time neural network audio effects, focusing on both feedforward networks and recurrent architectures. We evaluate multiple pruning strategies at inference time, without retraining, and analyze their effects on model performance. To quantify the trade-off between parameter count and audio fidelity, we construct a theoretical model of the approximation error as a function of network architecture and pruning level. The resulting bounds establish a principled relationship between pruning-induced sparsity and functional error, enabling informed deployment of neural audio effects in constrained real-time environments.