Excel Spreadsheet Analyzer

Amir Nassereldine,Patrick Chen,Jinjun Xiong
DOI: https://doi.org/10.48550/arXiv.2211.06333
2022-11-01
Software Engineering
Abstract:Spreadsheets are widely used in various fields to do large numerical analysis. While several companies have relied on spreadsheets for decades, data scientists are going in the direction of using scientific programming languages such as python to do their data analysis due to the support, community, and vast amount of libraries. While using python to analyze a company's spreadsheets, some information such as the formulas and dependencies of a cell are lost. We propose a tool that creates an abstract intermediate representation (AIR) of a spreadsheet. This representation facilitates the transfer from spreadsheets into scientific programming languages while preserving inter-dependency information about data. In addition to that, we build a python library on top of our tool to perform some data analysis in python.
What problem does this paper attempt to address?