Adversarial Quantum Machine Learning: An Information-Theoretic Generalization Analysis

Georgiou, Petros; Jose, Sharu Theresa; Simeone, Osvaldo

Full-text links:

Download:

Current browse context:

quant-ph

< prev | next >

new | recent | 2402

Quantum Physics

Title: Adversarial Quantum Machine Learning: An Information-Theoretic Generalization Analysis

Authors: Petros Georgiou, Sharu Theresa Jose, Osvaldo Simeone

(Submitted on 31 Jan 2024 (v1), last revised 15 Feb 2024 (this version, v2))

Abstract: In a manner analogous to their classical counterparts, quantum classifiers are vulnerable to adversarial attacks that perturb their inputs. A promising countermeasure is to train the quantum classifier by adopting an attack-aware, or adversarial, loss function. This paper studies the generalization properties of quantum classifiers that are adversarially trained against bounded-norm white-box attacks. Specifically, a quantum adversary maximizes the classifier's loss by transforming an input state $\rho(x)$ into a state $\lambda$ that is $\epsilon$-close to the original state $\rho(x)$ in $p$-Schatten distance. Under suitable assumptions on the quantum embedding $\rho(x)$, we derive novel information-theoretic upper bounds on the generalization error of adversarially trained quantum classifiers for $p = 1$ and $p = \infty$. The derived upper bounds consist of two terms: the first is an exponential function of the 2-R\'enyi mutual information between classical data and quantum embedding, while the second term scales linearly with the adversarial perturbation size $\epsilon$. Both terms are shown to decrease as $1/\sqrt{T}$ over the training set size $T$ . An extension is also considered in which the adversary assumed during training has different parameters $p$ and $\epsilon$ as compared to the adversary affecting the test inputs. Finally, we validate our theoretical findings with numerical experiments for a synthetic setting.

Comments:	10 pages, 2 figures. Fixed a typo (wrong inequality sign) in lemma 2 and extended to cover the whole range of values of p. Added reference on inequalities in trace norms
Subjects:	Quantum Physics (quant-ph); Emerging Technologies (cs.ET); Machine Learning (cs.LG)
Cite as:	arXiv:2402.00176 [quant-ph]
	(or arXiv:2402.00176v2 [quant-ph] for this version)

Submission history

From: Petros Georgiou [view email]
[v1] Wed, 31 Jan 2024 21:07:43 GMT (180kb,D)
[v2] Thu, 15 Feb 2024 13:18:04 GMT (181kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> quant-ph > arXiv:2402.00176

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Quantum Physics

Title: Adversarial Quantum Machine Learning: An Information-Theoretic Generalization Analysis

Submission history