TypeEvalPy: A Micro-benchmarking Framework for Python Type Inference Tools

Ashwin Prasad Shivarpatna Venkatesh,Samkutty Sabu,Jiawei Wang,Amir M. Mir,Li,Eric Bodden
DOI: https://doi.org/10.1145/3639478.3640033
2023-01-01
Abstract:In light of the growing interest in type inference research for Python, bothresearchers and practitioners require a standardized process to assess theperformance of various type inference techniques. This paper introducesTypeEvalPy, a comprehensive micro-benchmarking framework for evaluating typeinference tools. TypeEvalPy contains 154 code snippets with 845 typeannotations across 18 categories that target various Python features. Theframework manages the execution of containerized tools, transforms inferredtypes into a standardized format, and produces meaningful metrics forassessment. Through our analysis, we compare the performance of six typeinference tools, highlighting their strengths and limitations. Our findingsprovide a foundation for further research and optimization in the domain ofPython type inference.
What problem does this paper attempt to address?