AST-Transformer: Encoding Abstract Syntax Trees Efficiently for Code Summarization

Ze Tang,Chuanyi Li,Jidong Ge,Xiaoyu Shen,Zheling Zhu,Bin Luo
DOI: https://doi.org/10.1109/ase51524.2021.9678882
2021-01-01
Abstract:Code summarization aims to generate brief natural language descriptions for source code. As source code is highly structured and follows strict programming language grammars, its Abstract Syntax Tree (AST) is often leveraged to inform the encoder about the structural information. However, ASTs are usually much longer than the source code. Current approaches ignore the size limit and simply feed the whole linearized AST into the encoder. To address this problem, we propose AST-Transformer to efficiently encode tree-structured ASTs. Experiments show that AST-Transformer outperforms the state-of-arts by a substantial margin while being able to reduce 90 ~ 95% of the computational complexity in the encoding process.
What problem does this paper attempt to address?