Low Rank Affine Adversarial Attacks on Image Classifiers
Aizhan Issagali,
Catherine F. Higham,
Desmond J. Higham,
Francesco Tudisco,
Springer INdAM Series,
(2026)
Abstract
Neural networks for image classification are vulnerable to adversarial attacks; an imperceptible perturbation to an image can cause a change in classification. Standard attack algorithms use explicit or approximate partial derivative information with respect to the input data. Here, we explore the idea of using a less expensive, universal affine surrogate. We find that this approach can match, or even outperform, a traditional gradient-based algorithm. Training the affine attack model leads us naturally towards transformations that are close to low rank, reflecting the structure of the problem. Truncating to a precisely low rank transformation does not degrade the performance of the model.
This paper is to appear in the Springer INdAM Series, following the INdAM Workshop on Low-Rank Structures and Numerical Methods in Matrix and Tensor Computations.
Keywords:
low rank transformations
image classifiers
adversarial robustness
deep learning
neural networks