Abstract:The code written by developers usually suffers from efficiency problems and contain various performance bugs. These inefficiencies necessitate the research of automated refactoring methods for code optimization. Early research in code optimization employs rule-based methods and focuses on specific inefficiency issues, which are labor-intensive and suffer from the low coverage issue. Recent work regards the task as a sequence generation problem, and resorts to deep learning (DL) techniques such as large language models (LLMs). These methods typically prompt LLMs to directly generate optimized code. Although these methods show state-of-the-art performance, such one-step generation paradigm is hard to achieve an optimal solution. First, complex optimization methods such as combinatorial ones are hard to be captured by LLMs. Second, the one-step generation paradigm poses challenge in precisely infusing the knowledge required for effective code optimization within LLMs, resulting in under-optimized <a class="link-external link-http" href="http://code.To" rel="external noopener nofollow">this http URL</a> address these problems, we propose to model this task from the search perspective, and propose a search-based LLMs framework named SBLLM that enables iterative refinement and discovery of improved optimization methods. SBLLM synergistically integrate LLMs with evolutionary search and consists of three key components: 1) an execution-based representative sample selection part that evaluates the fitness of each existing optimized code and prioritizes promising ones to pilot the generation of improved code; 2) an adaptive optimization pattern retrieval part that infuses targeted optimization patterns into the model for guiding LLMs towards rectifying and progressively enhancing their optimization methods; and 3) a genetic operator-inspired chain-of-thought prompting part that aids LLMs in combining different optimization methods and generating improved optimization methods.

CodeRefine: A Pipeline for Enhancing LLM-Generated Code Implementations of Research Papers

Rethinking Code Refinement: Learning to Judge Code Efficiency

CodeSift: An LLM-Based Reference-Less Framework for Automatic Code Validation

Towards Large Language Model Aided Program Refinement

Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs

Together We Go Further: LLMs and IDE Static Analysis for Extract Method Refactoring

LLMRefine: Pinpointing and Refining Large Language Models via Fine-Grained Actionable Feedback

TextRefine: A Novel approach to improve the accuracy of LLM Models

Linguacodus: A Synergistic Framework for Transformative Code Generation in Machine Learning Pipelines

Search-Based LLMs for Code Optimization

REINFOREST: Reinforcing Semantic Code Similarity for Cross-Lingual Code Search Models

LLM-Assisted Code Cleaning For Training Accurate Code Generators

Effi-Code: Unleashing Code Efficiency in Language Models

Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository

A Survey on Large Language Models for Code Generation

Improving Natural Language Capability of Code Large Language Model

Rewriting the Code: A Simple Method for Large Language Model Augmented Code Search

CodeLutra: Boosting LLM Code Generation via Preference-Guided Refinement

Better Python Programming for all: With the focus on Maintainability

LLM-Ref: Enhancing Reference Handling in Technical Writing with Large Language Models