Abstract:Exploring whether Enriched Category Theory could provide the foundation of an alternative approach to Machine Learning. This paper is the first to construct and motivate a Machine Learning algorithm solely with Enriched Category Theory. In order to supplement evidence that Category Theory can be used to motivate robust and explainable algorithms, it is shown that a series of reasonable assumptions about a dataset lead to the construction of the Nearest Neighbours Algorithm. In particular, as an extension of the original dataset using profunctors in the category of Lawvere metric spaces. This leads to a definition of an Enriched Nearest Neighbours Algorithm, which consequently also produces an enriched form of the Voronoi diagram. This paper is intended to be accessible without any knowledge of Category Theory

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: **How to use Enriched Category Theory to provide a first - principles - based design method for machine learning, and naturally derive the classical Nearest Neighbours Algorithm (NNA) through this theory.** Specifically, the goals of the paper are: 1. **To explore whether Enriched Category Theory can be used as a basic framework for machine learning**. Through this method, machine - learning algorithms can be designed and explained more transparently, making the assumptions of the algorithms more explicit, and the learning process can be interpreted as comparative reasoning on observed data. 2. **To show how basic data - set assumptions can naturally lead to the Nearest Neighbours Classification Algorithm**. By introducing concepts such as Lawvere metric spaces and profunctors, the paper closely combines the construction of NNA with Enriched Category Theory, proving a direct path from data representation to optimal classification selection. ### Core problems of the paper - **Improving the interpretability and transparency of machine - learning algorithms**: Many existing machine - learning algorithms are regarded as "black boxes", and it is difficult to understand their internal working principles. By using Enriched Category Theory, data structures and algorithm mechanisms can be expressed more clearly. - **Theoretically verifying the application potential of Enriched Category Theory in machine learning**: By constructing NNA, it is proved that Enriched Category Theory can not only be used to describe existing algorithms, but also can provide a theoretical basis for the design of new algorithms. ### Main contributions - **For the first time, a machine - learning algorithm is completely constructed and explained based on Enriched Category Theory**: This is the first example showing how to derive NNA only through reasonable data - set assumptions and the tools of Enriched Category Theory. - **Providing a new perspective to understand and design machine - learning algorithms**: By explicitly encoding the comparison relationships between data, Enriched Category Theory provides a more intuitive and rigorous framework for the design of machine - learning algorithms. ### Presentation of formulas in Markdown format For example, when constructing NNA, the key formulas mentioned in the paper are as follows: \[ \text{NNA}(y, x) = \exists i \in N [T_i = y \text{ and } d(F_i, x) = \inf_{i' \in N} d(F_{i'}, x)] \] where: - \( T_i \) represents the class label of the \( i\) - th data point. - \( F_i \) represents the feature vector of the \( i\) - th data point. - \( d(a, b) \) represents the distance between two points \( a\) and \( b\) in the metric space. In this way, the paper not only shows how to theoretically construct NNA, but also provides new ideas and tools for future research.

Using Enriched Category Theory to Construct the Nearest Neighbour Classification Algorithm

Exploring Explainable AI: Category Theory Insights into Machine Learning Algorithms

Nearest Feature Line Classifier Based on Collaborative Representation with Nearest Neighbour Search Algorithm

Learning from Exemplars and Prototypes in Machine Learning and Psychology

Category Theory in Machine Learning

On high-dimensional modifications of the nearest neighbor classifier

PCNN: Probable-Class Nearest-Neighbor Explanations Improve Fine-Grained Image Classification Accuracy for AIs and Humans

In defense of Nearest-Neighbor based image classification

A Bayesian reassessment of nearest-neighbour classification

Towards a Categorical Foundation of Deep Learning: A Survey

A Novel Pseudo Nearest Neighbor Classification Method Using Local Harmonic Mean Distance

Algorithms that Defy the Gravity of Learning Curve

Rethinking Nearest Neighbors for Visual Classification

Categorification of Group Equivariant Neural Networks

k-Nearest Neighbour Classifiers: 2nd Edition (with Python examples)

The Nearest Neighbor Algorithm of Local Probability Centers

Category-Theoretical and Topos-Theoretical Frameworks in Machine Learning: A Survey

Natural Learning

Characterizing the invariances of learning algorithms using category theory

Learn to Categorize or Categorize to Learn? Self-Coding for Generalized Category Discovery