Ten Pillars for Data Meshes

Robert L. Grossman,Ceilyn Boyd,Nhan Do,Danne C. Elbers,Michael S. Fitzsimons,Maryellen L. Giger,Anthony Juehne,Brienna Larrick,Jerry S. H. Lee,Dawei Lin,Michael Lukowski,James D. Myers,L. Philip Schumm,Aarti Venkat
2024-11-08
Abstract:Over the past few years, a growing number of data platforms have emerged, including data commons, data repositories, and databases containing biomedical, environmental, social determinants of health and other data relevant to improving health outcomes. With the growing number of data platforms, interoperating multiple data platforms to form data meshes, data fabrics and other types of data ecosystems reduces data silos, expands data use, and increases the potential for new discoveries. In this paper, we introduce ten principles, which we call pillars, for data meshes. The goals of the principles are 1) to make it easier, faster, and more uniform to set up a data mesh from multiple data platforms; and, 2) to make it easier, faster, and more uniform, for a data platform to join one or more data meshes. The hope is that the greater availability of data through data meshes will accelerate research and that the greater uniformity of meshes will lower the cost of developing meshes and connecting a data platform to them.
Distributed, Parallel, and Cluster Computing
What problem does this paper attempt to address?