CONCORD: Towards a DSL for Configurable Graph Code Representation

Mootez Saad,Tushar Sharma
2024-02-01
Abstract:Deep learning is widely used to uncover hidden patterns in large code corpora. To achieve this, constructing a format that captures the relevant characteristics and features of source code is essential. Graph-based representations have gained attention for their ability to model structural and semantic information. However, existing tools lack flexibility in constructing graphs across different programming languages, limiting their use. Additionally, the output of these tools often lacks interoperability and results in excessively large graphs, making graph-based neural networks training slower and less scalable.
Software Engineering,Machine Learning
What problem does this paper attempt to address?