In this problem set you will prove an exponential separation between single-copy
and two-copy algorithms for purity testing. Each exercise builds
on the last; proofs of supporting lemmas are provided, while the key arguments
are left for you.
Part II. Establish that any single-copy algorithm requires \(\Omega(2^{n/2})\) copies — an exponential separation.
Purity Testing with Two-Copy Measurements
Recall that the swap test on two copies of \(\rho\) accepts with probability
Prove that Purity can be tested with \(O(1/\epsilon)\) copies of \(\rho\).
(a) Completeness. Show that if \(\rho\) is pure, the swap test accepts with certainty.
(b) Soundness. Suppose \(\rho\) is \(\epsilon\)-far from every pure state. Show \(\operatorname{tr}(\rho^2) \le 1-\epsilon\), and conclude \(p_{\mathrm{acc}} \le 1-\epsilon/2\).
(c) Repetition. By running \(T\) independent swap tests (accepting iff all pass), show \(T = O(1/\epsilon)\) suffices for error \(\le 1/3\).
Write \(\rho = \sum_i \lambda_i |\psi_i\rangle\!\langle\psi_i|\) with \(\lambda_1 \ge \lambda_2 \ge \cdots\). The closest pure state is \(|\psi_1\rangle\), so the distance assumption gives \(\lambda_1 \le 1-\epsilon\). Then use \(\sum_i \lambda_i^2 \le \max_i \lambda_i \cdot \sum_j \lambda_j\).
Each test accepts with probability \(\le 1-\epsilon/2\), so all \(T\) accept with probability \(\le (1-\epsilon/2)^T \le e^{-\epsilon T/2}\). Set \(\le 1/3\).
A Matching Lower Bound
We now show \(\Omega(1/\epsilon)\) copies are necessary.
Prove that testing purity requires \(T = \Omega(1/\epsilon)\) copies.
(a) Consider \(\rho_0 = |0\rangle\!\langle 0|\) and \(\rho_\epsilon = (1-\epsilon)|0\rangle\!\langle 0| + \epsilon|1\rangle\!\langle 1|\). Verify \(\rho_\epsilon\) is \(\epsilon\)-far from \(\rho_0\), so any purity tester distinguishes \(\rho_0^{\otimes T}\) from \(\rho_\epsilon^{\otimes T}\) with probability \(\ge 2/3\).
(b) Using the Holevo–Helstrom theorem and the Fuchs–van de Graaf inequality, show
You may use that fidelity is multiplicative and \(\mathrm{F}(\rho_0,\rho_\epsilon)^2 = 1-\epsilon\).
(c) Combine (a) and (b) to conclude \(T \ge \Omega(1/\epsilon)\).
Holevo–Helstrom: \(p_{\mathrm{succ}} \le \tfrac{1}{2}(1+d_{\mathrm{tr}})\). Fuchs–van de Graaf: \(d_{\mathrm{tr}} \le \sqrt{1-F^2}\). Multiplicativity: \(F^2=(1-\epsilon)^T\). Impose \(p_{\mathrm{succ}}\ge 2/3\) to get \((1-\epsilon)^T\le 8/9\), then take logs.
Exponential Lower Bound for Single-Copy Testers
Any single-copy algorithm for purity testing on \(n\)-qubit states (\(d=2^n\)) must use \(\Omega(d^{1/2})=\Omega(2^{n/2})\) copies. The key ingredient is a lower bound on the permanent of a Gram matrix.
Permanents of PSD Matrices
Let \(\{|\psi_t\rangle\}_{t=1}^T\) be pure states in \(\mathbb{C}^d\) with Gram matrix \(G_{tt'}=\langle\psi_t|\psi_{t'}\rangle\). Using Lemma 3, prove:
Lemma 3 gives \(\mathrm{per}(G)\ge\frac{1}{T}\|G\|_F^2\). Since \(\|G\|_F^2 = T + \sum_{i\ne j}|\langle\psi_i|\psi_j\rangle|^2 \ge T\), we get \(\mathrm{per}(G)\ge 1\). An orthonormal set shows tightness.
Let \(r=\mathrm{rank}(G)\le d\). Cauchy–Schwarz: \(T^2=(\sum_k\lambda_k)^2\le r\sum_k\lambda_k^2 = r\|G\|_F^2\le d\|G\|_F^2\). Hence \(\|G\|_F^2\ge T^2/d\), giving \(\mathrm{per}(G)\ge T/d\).
From Permanents to Permutation Overlaps
The action of \(P_d(\pi)\) permutes tensor factors:
where \(G\) is the Gram matrix. Since \(G\) is PSD with unit diagonal, Exercise 3 gives \(\mathrm{per}(G)\ge 1\). ■
The Main Event
Prove Proposition 5 following this outline:
(a) Set up the likelihood ratio. Consider distinguishing \(\rho=\mathbb{I}/d\) from \(\rho=|v\rangle\!\langle v|\) with \(|v\rangle\) Haar-random. After \(T\) adaptive single-copy measurements producing outcome string \(\ell\), show
(b) Apply the Haar integral. Using the identity
and Lemma 4, show
(c) Bound the product. Show the RHS is at least \(1 - \frac{T(T-1)}{2d}\).
(d) Conclude. Via Le Cam's method, deduce \(d_{\mathrm{TV}}\le O(T^2/d)\) and \(T\ge\Omega(d^{1/2})\).
Take logs: \(-\sum_{t=0}^{T-1}\log(1+t/d)\ge -\sum_{t=0}^{T-1}t/d = -T(T-1)/(2d)\). Then \(e^{-x}\ge 1-x\).
Le Cam's one-sided method: if the likelihood ratio is \(\ge 1-\delta\) for all \(\ell\), then \(d_{\mathrm{TV}}\le\delta\). A successful algorithm needs \(\Omega(1)\le d_{\mathrm{TV}}\), forcing \(T^2/d=\Omega(1)\).
This problem set accompanies the MATH 595 lecture notes. If you find errors or have questions, please reach out.