Quantum Cluster Algorithm for Data classification

Junxu Li,Sabre Kais
DOI: https://doi.org/10.48550/arXiv.2106.07078
2021-06-14
Abstract:We present a quantum algorithm for data classification based on the nearest-neighbor learning algorithm. The classification algorithm is divided into two steps: Firstly, data in the same class is divided into smaller groups with sublabels assisting building boundaries between data with different labels. Secondly we construct a quantum circuit for classification that contains multi control gates. The algorithm is easy to implement and efficient in predicting the labels of test data. To illustrate the power and efficiency of this approach, we construct the phase transition diagram for the metal-insulator transition of $VO_2$, using limited trained experimental data, where $VO_2$ is a typical strongly correlated electron materials, and the metallic-insulating phase transition has drawn much attention in condensed matter physics. Moreover, we demonstrate our algorithm on the classification of randomly generated data and the classification of entanglement for various Werner states, where the training sets can not be divided by a single curve, instead, more than one curves are required to separate them apart perfectly. Our preliminary result shows considerable potential for various classification problems, particularly for constructing different phases in materials.
Quantum Physics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to develop a data classification algorithm based on quantum computing, especially for data sets that cannot be segmented by a single curve. Specifically, the author proposes a two - step quantum classification algorithm: 1. **Data Grouping and Sub - label Generation**: - First, divide the data of the same category into smaller groups and assign sub - labels to assist in constructing the boundaries between different categories of data. - Use the Lloyd algorithm for initial clustering, and then use two adjustment algorithms to reduce redundant sub - labels and ensure that there is no overlap between different main labels. 2. **Quantum Circuit Construction**: - Based on the information obtained from the above learning process, construct a quantum classification circuit containing multi - control gates. - Predict the label of new data by measuring the inner product of the new data and the center vectors of each sub - label. This algorithm aims to improve the classification ability for data with complex distributions, and it shows significant potential especially when dealing with scientific problems such as the construction of different phases of materials. For example, the paper shows the effectiveness of this algorithm in the following application scenarios: - **Classification of Metal - Insulator Phase Transitions in Vanadium Oxide (VO₂)**: Using limited experimental training data, the phase diagram of VO₂, which is a typical strongly correlated electron material, is successfully constructed. - **Classification of Randomly Generated Data**: For data sets that cannot be perfectly segmented by a single curve, this algorithm also shows good classification performance. - **Entanglement Classification of Werner States**: By analyzing four - dimensional vectors (four correlation functions calculated from measurement results), it can effectively distinguish between entangled and non - entangled states. Overall, this research provides a new quantum machine - learning method that can achieve efficient and accurate data classification without relying on the gradient optimization process. This is of great significance for dealing with large - scale scientific data sets.