Twisted Convolutional Networks (TCNs): Enhancing Feature Interactions for Non-Spatial Data Classification

Junbo Jacob Lian
2024-11-30
Abstract:Twisted Convolutional Networks (TCNs) are introduced as a novel neural network architecture designed to effectively process one-dimensional data with arbitrary feature order and minimal spatial relationships. Unlike traditional Convolutional Neural Networks (CNNs), which excel at handling structured two-dimensional data like images, TCNs reduce dependency on feature order by combining input features in innovative ways to create new representations. By explicitly enhancing feature interactions and employing diverse feature combinations, TCNs generate richer and more informative representations, making them especially effective for classification tasks on datasets with arbitrary feature arrangements. This paper details the TCN architecture and its feature combination strategy, providing a comprehensive comparison with traditional CNNs, DeepSets, Transformers, and Graph Neural Networks (GNNs). Extensive experiments on benchmark datasets demonstrate that TCNs achieve superior performance, particularly in classification scenarios involving one-dimensional data.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the limitations of traditional convolutional neural networks (CNNs) when processing non - spatial data. Specifically, CNNs rely heavily on the spatial order of input features, which makes them perform poorly when dealing with data without a clear spatial structure or feature relationships. For example, in application scenarios such as gene expression data, customer demographic information, and sensor readings, the relationships between features are not strictly spatial or sequential, and the order of feature arrangement may not carry important information. To overcome these limitations, the paper proposes a new neural network architecture - Twisted Convolutional Networks (TCNs). The main goal of TCNs is to combine input features in an innovative way, reduce the dependence on feature order, and generate richer and more informative feature representations. This makes TCNs particularly effective when dealing with data sets with arbitrary feature order, especially in classification tasks. ### Main problem summary: 1. **Limitations of traditional CNNs**: CNNs depend on the spatial order of features and are not effective for non - spatial data (such as one - dimensional data). 2. **Importance of feature combination**: Existing methods fail to fully utilize the potential information in feature combinations. 3. **Improving model robustness**: By enhancing feature interaction, the model is made more robust when dealing with data with arbitrary feature arrangements. ### Solutions: - **Introducing TCNs**: Combine input features in multiple ways to generate new feature representations, thereby reducing the dependence on feature order. - **Feature combination strategies**: Include two methods, multiplicative combination and pairwise product summation, to capture high - order interactions between features. - **Improved network architecture**: Include a feature interaction module, residual connections, and regularization techniques to improve the generalization ability and training stability of the model. Through these improvements, TCNs can achieve better classification performance on various benchmark data sets, especially in classification scenarios involving one - dimensional data.