3DGen: AI-Assisted Generation of Provably Correct Binary Format Parsers

Sarah Fakhoury,Markus Kuppe,Shuvendu K. Lahiri,Tahina Ramananandro,Nikhil Swamy
2024-05-07
Abstract:Improper parsing of attacker-controlled input is a leading source of software security vulnerabilities, especially when programmers transcribe informal format descriptions in RFCs into efficient parsing logic in low-level, memory unsafe languages. Several researchers have proposed formal specification languages for data formats from which efficient code can be extracted. However, distilling informal requirements into formal specifications is challenging and, despite their benefits, new, formal languages are hard for people to learn and use.
Software Engineering
What problem does this paper attempt to address?