Skip to main navigation Skip to search Skip to main content

The Emergence of Large Language Models in Static Analysis: A First Look Through Micro-Benchmarks

Ashwin Prasad Shivarpatna Venkatesh, Samkutty Sabu, Amir M. Mir, Sofia Reis, Eric Bodden

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

34 Downloads (Pure)

Abstract

The application of Large Language Models (LLMs) in software engineering, particularly in static analysis tasks, represents a paradigm shift in the field. In this paper, we investigate the role that current LLMs can play in improving callgraph analysis and type inference for Python programs. Using the PyCG, HeaderGen, and TypeEvalPy micro-benchmarks, we evaluate 26 LLMs, including OpenAI's GPT series and open-source models such as LLaMA. Our study reveals that LLMs show promising results in type inference, demonstrating higher accuracy than traditional methods, yet they exhibit limitations in callgraph analysis. This contrast emphasizes the need for specialized fine-tuning of LLMs to better suit specific static analysis tasks. Our findings provide a foundation for further research towards integrating LLMs for static analysis tasks.
Original languageEnglish
Title of host publicationFORGE '24
Subtitle of host publicationProceedings of the 2024 IEEE/ACM First International Conference on AI Foundation Models and Software Engineering
Place of PublicationNew York, NY
PublisherAssociation for Computing Machinery (ACM)
Pages35-39
Number of pages5
ISBN (Electronic)979-8-4007-0609-7
DOIs
Publication statusPublished - 2024
Event1st IEEE/ACM International Conference on AI Foundation Models and Software Engineering, FORGE 2024, co-located with the 46th ACM/IEEE International Conference on Software Engineering, ICSE 2024 - Lisbon, Portugal
Duration: 14 Apr 202414 Apr 2024
https://conf.researchr.org/home/forge-2024

Conference

Conference1st IEEE/ACM International Conference on AI Foundation Models and Software Engineering, FORGE 2024, co-located with the 46th ACM/IEEE International Conference on Software Engineering, ICSE 2024
Country/TerritoryPortugal
CityLisbon
Period14/04/2414/04/24
Internet address

Fingerprint

Dive into the research topics of 'The Emergence of Large Language Models in Static Analysis: A First Look Through Micro-Benchmarks'. Together they form a unique fingerprint.

Cite this