Abstract:In recent years, millions of source codes are generated in different languages on a daily basis all over the world. A deep neural network-based intelligent support model for source code completion would be a great advantage in software engineering and programming education fields. Vast numbers of syntax, logical, and other critical errors that cannot be detected by normal compilers continue to exist in source codes, and the development of an intelligent evaluation methodology that does not rely on manual compilation has become essential. Even experienced programmers often find it necessary to analyze an entire program in order to find a single error and are thus being forced to waste valuable time debugging their source codes. With this point in mind, we proposed an intelligent model that is based on long short-term memory (LSTM) and combined it with an attention mechanism for source code completion. Thus, the proposed model can detect source code errors with locations and then predict the correct words. In addition, the proposed model can classify the source codes as to whether they are erroneous or not. We trained our proposed model using the source code and then evaluated the performance. All of the data used in our experiments were extracted from Aizu Online Judge (AOJ) system. The experimental results obtained show that the accuracy in terms of error detection and prediction of our proposed model approximately is 62% and source code classification accuracy is approximately 96% which outperformed a standard LSTM and other state-of-the-art models. Moreover, in comparison to state-of-the-art models, our proposed model achieved an interesting level of success in terms of error detection, prediction, and classification when applied to long source code sequences. Overall, these experimental results indicate the usefulness of our proposed model in software engineering and programming education arena.

A Self-Attentional Neural Architecture for Code Completion with Multi-Task Learning.

A unified multi-task learning model for AST-level and token-level code completion

Multi-task learning based pre-trained language model for code completion

Deep Learning Based Code Completion Models for Programming Codes.

Towards Full-line Code Completion with Neural Language Models

Adaptive Code Completion with Meta-learning.

A Neural Network Based Intelligent Support Model for Program Code Completion

Deep-AutoCoder: Learning to Complete Code Precisely with Induced Code Tokens

Domain Adaptive Code Completion via Language Models and Decoupled Domain Databases

ReACC: A Retrieval-Augmented Code Completion Framework

A Graph Sequence Neural Architecture for Code Completion with Semantic Structure Features

LongCoder: A Long-Range Pre-trained Language Model for Code Completion

Sequence Model Design for Code Completion in the Modern IDE

When Neural Code Completion Models Size up the Situation: Attaining Cheaper and Faster Completion through Dynamic Model Inference

Improving the Robustness to Data Inconsistency between Training and Testing for Code Completion by Hierarchical Language Model

Does Your Neural Code Completion Model Use My Code? A Membership Inference Approach

Prompt-based Code Completion via Multi-Retrieval Augmented Generation

CodeFill: Multi-token Code Completion by Jointly Learning from Structure and Naming Sequences

Multi-task Pre-training Language Model for Semantic Network Completion

IRCoCo: Immediate Rewards-Guided Deep Reinforcement Learning for Code Completion

Development of Software Tools to Improve the Work of the Code Completion Mechanism Using Machine Learning Algorithms in an Integrated Development Environment for Python