Low Rank Affine Adversarial Attacks on Image Classifiers

Aizhan Issagali, Catherine F. Higham, Desmond J. Higham, Francesco Tudisco,
Springer INdAM Series, (2026)

Abstract

Neural networks for image classification are vulnerable to adversarial attacks; an imperceptible perturbation to an image can cause a change in classification. Standard attack algorithms use explicit or approximate partial derivative information with respect to the input data. Here, we explore the idea of using a less expensive, universal affine surrogate. We find that this approach can match, or even outperform, a traditional gradient-based algorithm. Training the affine attack model leads us naturally towards transformations that are close to low rank, reflecting the structure of the problem. Truncating to a precisely low rank transformation does not degrade the performance of the model.

This paper is to appear in the Springer INdAM Series, following the INdAM Workshop on Low-Rank Structures and Numerical Methods in Matrix and Tensor Computations.

Keywords: low-rank image classifiers adversarial robustness deep learning neural networks