Assessing Robustness of ML-Based Program Analysis Tools using Metamorphic Program Transformations

L.H. Applis; A. Panichella; A. van Deursen

Assessing Robustness of ML-Based Program Analysis Tools using Metamorphic Program Transformations

L.H. Applis, A. Panichella, A. van Deursen

Research output: Contribution to conference › Paper › peer-review

274 Downloads (Pure)

Abstract

Metamorphic testing is a well-established testing technique that has been successfully applied in various domains, including testing deep learning models to assess their robustness against data noise or malicious input. Currently, metamorphic testing approaches for machine learning (ML) models focused on image processing and object recognition tasks. Hence, these approaches cannot be ap- plied to ML targeting program analysis tasks. In this paper, we extend metamorphic testing approaches for ML models targeting software programs. We present Lampion, a novel testing frame- work that applies (semantics preserving) metamorphic transforma- tions on the test datasets. Lampion produces new code snippets equivalent to the original test set but different in their identifiers or syntactic structure. We evaluate Lampion against CodeBERT, a state-of-the-art ML model for Code-To-Text tasks that creates Javadoc summaries for given Java methods. Our results show that simple transformations significantly impact the target model be- havior, providing additional information on the models reasoning apart from the classic performance metric.

Original language	English
Number of pages	6
Publication status	Published - 2021
Event	IEEE/ACM International Conference on Automated Software Engineering - virtual event Duration: 14 Nov 2021 → 20 Nov 2021

Conference

Conference	IEEE/ACM International Conference on Automated Software Engineering
Abbreviated title	ASE 2021
Period	14/11/21 → 20/11/21

Keywords

Metamorphic Testing
Machine Learning
Documentation Generation
Code-To-Text
Deep learning

Access to Document

LampionAccepted author manuscript, 265 KBLicence: CC BY

Cite this

@conference{250720e734f84718adf47191d3c4b48d,

title = "Assessing Robustness of ML-Based Program Analysis Tools using Metamorphic Program Transformations",

abstract = "Metamorphic testing is a well-established testing technique that has been successfully applied in various domains, including testing deep learning models to assess their robustness against data noise or malicious input. Currently, metamorphic testing approaches for machine learning (ML) models focused on image processing and object recognition tasks. Hence, these approaches cannot be ap- plied to ML targeting program analysis tasks. In this paper, we extend metamorphic testing approaches for ML models targeting software programs. We present Lampion, a novel testing frame- work that applies (semantics preserving) metamorphic transforma- tions on the test datasets. Lampion produces new code snippets equivalent to the original test set but different in their identifiers or syntactic structure. We evaluate Lampion against CodeBERT, a state-of-the-art ML model for Code-To-Text tasks that creates Javadoc summaries for given Java methods. Our results show that simple transformations significantly impact the target model be- havior, providing additional information on the models reasoning apart from the classic performance metric.",

keywords = "Metamorphic Testing, Machine Learning, Documentation Generation, Code-To-Text, Deep learning",

author = "L.H. Applis and A. Panichella and {van Deursen}, A.",

year = "2021",

language = "English",

note = "IEEE/ACM International Conference on Automated Software Engineering, ASE 2021 ; Conference date: 14-11-2021 Through 20-11-2021",

}

TY - CONF

T1 - Assessing Robustness of ML-Based Program Analysis Tools using Metamorphic Program Transformations

AU - Applis, L.H.

AU - Panichella, A.

AU - van Deursen, A.

PY - 2021

Y1 - 2021

N2 - Metamorphic testing is a well-established testing technique that has been successfully applied in various domains, including testing deep learning models to assess their robustness against data noise or malicious input. Currently, metamorphic testing approaches for machine learning (ML) models focused on image processing and object recognition tasks. Hence, these approaches cannot be ap- plied to ML targeting program analysis tasks. In this paper, we extend metamorphic testing approaches for ML models targeting software programs. We present Lampion, a novel testing frame- work that applies (semantics preserving) metamorphic transforma- tions on the test datasets. Lampion produces new code snippets equivalent to the original test set but different in their identifiers or syntactic structure. We evaluate Lampion against CodeBERT, a state-of-the-art ML model for Code-To-Text tasks that creates Javadoc summaries for given Java methods. Our results show that simple transformations significantly impact the target model be- havior, providing additional information on the models reasoning apart from the classic performance metric.

AB - Metamorphic testing is a well-established testing technique that has been successfully applied in various domains, including testing deep learning models to assess their robustness against data noise or malicious input. Currently, metamorphic testing approaches for machine learning (ML) models focused on image processing and object recognition tasks. Hence, these approaches cannot be ap- plied to ML targeting program analysis tasks. In this paper, we extend metamorphic testing approaches for ML models targeting software programs. We present Lampion, a novel testing frame- work that applies (semantics preserving) metamorphic transforma- tions on the test datasets. Lampion produces new code snippets equivalent to the original test set but different in their identifiers or syntactic structure. We evaluate Lampion against CodeBERT, a state-of-the-art ML model for Code-To-Text tasks that creates Javadoc summaries for given Java methods. Our results show that simple transformations significantly impact the target model be- havior, providing additional information on the models reasoning apart from the classic performance metric.

KW - Metamorphic Testing

KW - Machine Learning

KW - Documentation Generation

KW - Code-To-Text

KW - Deep learning

UR - https://conf.researchr.org/track/ase-2021/ase-2021-nier-track?#event-overview

M3 - Paper

T2 - IEEE/ACM International Conference on Automated Software Engineering

Y2 - 14 November 2021 through 20 November 2021

ER -

Assessing Robustness of ML-Based Program Analysis Tools using Metamorphic Program Transformations

Abstract

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this