Rethinking Positional Encoding in Tree Transformer for Code Representation.

Han Peng,Ge Li,Yunfei Zhao,Zhi Jin
DOI: https://doi.org/10.18653/v1/2022.emnlp-main.210
2022-01-01
Abstract:Transformers are now widely used in code representation, and several recent works further develop tree Transformers to capture the syntactic structure in source code.Specifically, novel tree positional encodings have been proposed to incorporate inductive bias into Transformer.In this work, we propose a novel tree Transformer encoding node positions based on our new description method for tree structures.Technically, local and global soft bias shown in previous works is both introduced as positional encodings of our Transformer model.Our model finally outperforms strong baselines on code summarization and completion tasks across two languages, demonstrating our model's effectiveness.Besides, extensive experiments and ablation study shows that combining both local and global paradigms is still helpful in improving model performance.We release our code at https://github.com/AwdHanPeng/TreeTransformer.
What problem does this paper attempt to address?