ad_hoc_data¶

ad_hoc_data(training_size, test_size, n, gap, plot_data=False, one_hot=True, include_sample_total=False)[source]¶

Generates a toy dataset that can be fully separated with ZZFeatureMap according to the procedure outlined in [1]. To construct the dataset, we first sample uniformly distributed vectors $\vec{x} \in (0, 2 π]^{n}$ and apply the feature map

| Φ (\vec{x}) ⟩ = U_{Φ (\vec{x})} H^{\otimes n} U_{Φ (\vec{x})} H^{\otimes n} | 0^{\otimes n} ⟩

where

U_{Φ (\vec{x})} = \exp (i \sum_{S \subseteq [n]} ϕ_{S} (\vec{x}) \prod_{i \in S} Z_{i})

and

\begin{array}{r} {\begin{cases} ϕ_{{i, j}} = (π - x_{i}) (π - x_{j}) \\ ϕ_{{i}} = x_{i} \end{cases} \end{array}

We then attribute labels to the vectors according to the rule

\begin{array}{r} m (\vec{x}) = {\begin{cases} 1 & ⟨ Φ (\vec{x}) | V^{†} \prod_{i} Z_{i} V | Φ (\vec{x}) ⟩ > Δ \\ - 1 & ⟨ Φ (\vec{x}) | V^{†} \prod_{i} Z_{i} V | Φ (\vec{x}) ⟩ < - Δ \end{cases} \end{array}

where $Δ$ is the separation gap, and $V \in SU (4)$ is a random unitary.

The current implementation only works with n = 2 or 3.

References:

[1] Havlíček V, Córcoles AD, Temme K, Harrow AW, Kandala A, Chow JM, Gambetta JM. Supervised learning with quantum-enhanced feature spaces. Nature. 2019 Mar;567(7747):209-12. arXiv:1804.11326

Parameters:

training_size (int) – the number of training samples.
test_size (int) – the number of testing samples.
n (int) – number of qubits (dimension of the feature space). Must be 2 or 3.
gap (int) – separation gap ( $Δ$ ).
plot_data (bool) – whether to plot the data. Requires matplotlib.
one_hot (bool) – if True, return the data in one-hot format.
include_sample_total (bool) – if True, return all points in the uniform grid in addition to training and testing samples.

Returns:

Training and testing samples.

Raises:

ValueError – if n is not 2 or 3.

Return type:

Tuple[ndarray, ndarray, ndarray, ndarray] | Tuple[ndarray, ndarray, ndarray, ndarray, ndarray]