Skip to main navigation Skip to search Skip to main content

Imperfect Code Generation: UncoveringWeaknesses in Automatic Code Generation by Large Language Models

  • Xiaoli Lian
  • , Shuaisong Wang
  • , Jieping Ma
  • , Xin Tan
  • , Fang Liu
  • , Lin Shi
  • , Li Zhang
  • , Cuiyun Gao
  • Beihang University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The task of code generation has received significant attention in recent years, especially when the pre-trained large language models (LLMs) for code have consistently achieved state-of-the-art performance. However, there is currently a lack of a comprehensive weakness taxonomy in the field, uncovering weaknesses in automatic code generation by LLMs. This may lead the community to invest excessive efforts into well-known hotspots while neglecting many crucial yet unrecognized issues that deserve more attention. To bridge this gap, we conduct a systematic study on analyzing the weaknesses based on three state-of-the-art LLMs across three widely-used code generation datasets. Our study identifies eight types of weaknesses and assesses their prevalence across each LLM and dataset, aiming to inform and shape the trajectory of future research in the domain.

Original languageEnglish
Title of host publicationProceedings - 2024 ACM/IEEE 46th International Conference on Software Engineering
Subtitle of host publicationCompanion, ICSE-Companion 2024
PublisherIEEE Computer Society
Pages422-423
Number of pages2
ISBN (Electronic)9798400705021
DOIs
StatePublished - 23 May 2024
Event46th International Conference on Software Engineering: Companion, ICSE-Companion 2024 - Lisbon, Portugal
Duration: 14 Apr 202420 Apr 2024

Publication series

NameProceedings - International Conference on Software Engineering
ISSN (Print)0270-5257

Conference

Conference46th International Conference on Software Engineering: Companion, ICSE-Companion 2024
Country/TerritoryPortugal
CityLisbon
Period14/04/2420/04/24

Fingerprint

Dive into the research topics of 'Imperfect Code Generation: UncoveringWeaknesses in Automatic Code Generation by Large Language Models'. Together they form a unique fingerprint.

Cite this