CoRT: Transformer-based code representations with self-supervision by predicting reserved words for code smell detection

Amal Alazba,Hamoud Aljamaan,Mohammad Alshayeb
DOI: https://doi.org/10.1007/s10664-024-10445-9
IF: 3.762
2024-04-09
Empirical Software Engineering
Abstract:Code smell detection is the process of identifying poorly designed and implemented code pieces. Machine learning-based approaches require enormous amounts of manually labeled data, which are costly and difficult to scale. Unsupervised semantic feature learning, or learning without manual annotation, is vital for effectively harvesting an enormous amount of available data.
computer science, software engineering
What problem does this paper attempt to address?