Scalable Linear Algebra on a Relational Database System

Shangyu Luo,Zekai J. Gao,Michael Gubanov,Luis L. Perez,Christopher Jermaine
DOI: https://doi.org/10.1145/3277006.3277013
2018-09-10
ACM SIGMOD Record
Abstract:Scalable linear algebra is important for analytics and machine learning (including deep learning). In this paper, we argue that a parallel or distributed database system is actually an excellent platform upon which to build such functionality. Most relational systems already have support for cost-based optimization-which is vital to scaling linear algebra computations-and it is well-known how to make relational systems scale. We show that by making just a few changes to a parallel/distributed relational database system, such a system can be a competitive platform for scalable linear algebra. Our results suggest that brand new systems supporting scalable linear algebra are not absolutely necessary, and that such systems could instead be built on top of existing relational technology.
What problem does this paper attempt to address?