Neural Network-based Quantization for Network Automation

Marton Kajo,Stephen S. Mwanje,Benedek Schultz,Georg Carle
DOI: https://doi.org/10.48550/arXiv.2103.04764
2021-03-04
Abstract:Deep Learning methods have been adopted in mobile networks, especially for network management automation where they provide means for advanced machine cognition. Deep learning methods utilize cutting-edge hardware and software tools, allowing complex cognitive algorithms to be developed. In a recent paper, we introduced the Bounding Sphere Quantization (BSQ) algorithm, a modification of the k-Means algorithm, that was shown to create better quantizations for certain network management use-cases, such as anomaly detection. However, BSQ required a significantly longer time to train than k-Means, a challenge which can be overcome with a neural network-based implementation. In this paper, we present such an implementation of BSQ that utilizes state-of-the-art deep learning tools to achieve a competitive training speed.
Machine Learning
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to increase the training speed of the Bounding Sphere Quantization (BSQ) algorithm, making it more practical in real - world applications. Specifically, BSQ is an improved k - Means quantization algorithm and shows better performance in some network management tasks (such as anomaly detection), but its training time is significantly longer than that of the traditional k - Means algorithm. To solve this problem, the author proposes a BSQ algorithm implemented based on neural networks, using modern deep - learning tools (such as PyTorch and TensorFlow) to accelerate the training process. ### Background and Problem Description of the Paper 1. **Deep Learning in Network Management**: - Deep - learning methods have been widely used in mobile network management automation, especially in tasks requiring advanced machine cognition. - However, network management tasks usually involve semi - supervised or unsupervised learning scenarios, where the expected output is not available in the training data. Therefore, machine learning is required to provide intuition / proxy capabilities, enabling the system to deal with unforeseen situations. 2. **Quantization Algorithms**: - Quantization (clustering) is one of the unsupervised learning tasks. Traditional methods such as k - Means are still widely used, but these methods are difficult to integrate into modern deep - learning frameworks and cannot fully utilize hardware acceleration. - BSQ optimizes quantization by minimizing the maximum distance of each cluster, which makes it perform better than k - Means in some tasks but has a longer training time. 3. **Neural Network Implementation**: - To overcome the problem of the overly long BSQ training time, the author proposes an implementation method based on neural networks, rewriting the k - Means and BSQ algorithms into forms suitable for modern deep - learning frameworks. - By using Stochastic Gradient Descent (SGD) instead of the traditional Expectation - Maximization (EM) algorithm and combining cross - batch accumulation techniques, the author achieves a faster training speed. ### Formulas and Technical Details - **Distance Calculation Layer**: \[ D' = B' - Q^T' \] where \(B\) is a tensor of shape \((n, d)\) containing training points; \(Q\) is a tensor of shape \((k, d)\) containing quantum center points. Through the broadcasting operation, the distances between all training points and quantum center points can be calculated: \[ D=\text{p - norm}(D', d) \] - **Distance Selection Layer**: The distance from each training point to the nearest quantum center point is selected through the mask tensor \(M\): \[ D_m = D\odot M \] - **Cross - batch Accumulation**: For BSQ, the results of multiple batches need to be accumulated to ensure that the global farthest point is found. For k - Means, only the average value of each batch needs to be calculated: \[ T[i]=\frac{W[i]\cdot T[i]+W'[i]\cdot T'[i]}{W[i]+W'[i]} \] where \(T\) and \(W\) are the target tensor and weight tensor respectively, and \(T'\) and \(W'\) are the results of the current batch. ### Conclusion The author verifies the effectiveness of the neural - network - based BSQ and k - Means implementations through experiments, especially the significant improvement in training speed on large - scale data sets. This implementation not only improves training efficiency but also simplifies integration with existing deep - learning frameworks, providing strong support for future network management automation.