An Attacker's Dream? Exploring the Capabilities of ChatGPT for Developing Malware

Yin Minn Pa Pa; Shunsuke Tanizaki; Tetsui Kou; Michel Van Eeten; Katsunari Yoshioka; Tsutomu Matsumoto

doi:10.1145/3607505.3607513

An Attacker's Dream? Exploring the Capabilities of ChatGPT for Developing Malware

Yin Minn Pa Pa, Shunsuke Tanizaki, Tetsui Kou, Michel Van Eeten, Katsunari Yoshioka, Tsutomu Matsumoto

Organisation & Governance

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

2 Citations (Scopus)

51 Downloads (Pure)

Abstract

We investigate the potential for abuse of recent AI advances by developing seven malware programs and two attack tools using ChatGPT, OpenAI Playground's "text-davinci-003"model, and Auto-GPT - an open-source AI agent capable of generating automated prompts to accomplish user-defined goals. We confirm that: 1) Under the safety and moderation control of recent AI systems, it is possible to generate the functional malware and attack tools (up to about 400 lines of code) within 90 minutes, including the debugging time. 2) Auto-GPT does not ease the hurdle of generating the right prompts for malware generation, but it evades the safety controls of OpenAI with its automatically generated prompts. When given goals with sufficient details, it writes the code in nine of nine malware and attack tools we tested. 3) There is still room to improve the moderation and safety controls of ChatGPT and text-davinci-003 model, especially for the growing jailbreak prompts. Overall, we find that recent AI advances, including ChatGPT, Auto-GPT, and text-davinci-003, demonstrate the potential for generating malware and attack tools under safety and moderation control, highlighting the need for improved safety measures and enhanced safety controls in AI systems.

Original language	English
Title of host publication	Proceedings of CSET 2023 - 16th Cyber Security Experimentation and Test Workshop
Publisher	Association for Computing Machinery (ACM)
Pages	10-18
Number of pages	9
ISBN (Electronic)	9781450390651
DOIs	https://doi.org/10.1145/3607505.3607513
Publication status	Published - 2023
Event	16th Cyber Security Experimentation and Test Workshop, CSET 2023 - Hybrid, Marina Del Rey, United States Duration: 7 Aug 2023 → …

Publication series

Name	ACM International Conference Proceeding Series

Conference

Conference	16th Cyber Security Experimentation and Test Workshop, CSET 2023
Country/Territory	United States
City	Hybrid, Marina Del Rey
Period	7/08/23 → …

Bibliographical note

Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

AI generated malware
Auto-GPT abuses
ChatGPT abuses

Access to Document

10.1145/3607505.3607513

3607505.3607513Final published version, 1.12 MB

Cite this

Pa Pa, Y. M., Tanizaki, S., Kou, T., Van Eeten, M., Yoshioka, K., & Matsumoto, T. (2023). An Attacker's Dream? Exploring the Capabilities of ChatGPT for Developing Malware. In Proceedings of CSET 2023 - 16th Cyber Security Experimentation and Test Workshop (pp. 10-18). (ACM International Conference Proceeding Series). Association for Computing Machinery (ACM). https://doi.org/10.1145/3607505.3607513

@inproceedings{10bcf71844334b85801d1ebdcda0434b,

title = "An Attacker's Dream? Exploring the Capabilities of ChatGPT for Developing Malware",

abstract = "We investigate the potential for abuse of recent AI advances by developing seven malware programs and two attack tools using ChatGPT, OpenAI Playground's {"}text-davinci-003{"}model, and Auto-GPT - an open-source AI agent capable of generating automated prompts to accomplish user-defined goals. We confirm that: 1) Under the safety and moderation control of recent AI systems, it is possible to generate the functional malware and attack tools (up to about 400 lines of code) within 90 minutes, including the debugging time. 2) Auto-GPT does not ease the hurdle of generating the right prompts for malware generation, but it evades the safety controls of OpenAI with its automatically generated prompts. When given goals with sufficient details, it writes the code in nine of nine malware and attack tools we tested. 3) There is still room to improve the moderation and safety controls of ChatGPT and text-davinci-003 model, especially for the growing jailbreak prompts. Overall, we find that recent AI advances, including ChatGPT, Auto-GPT, and text-davinci-003, demonstrate the potential for generating malware and attack tools under safety and moderation control, highlighting the need for improved safety measures and enhanced safety controls in AI systems. ",

keywords = "AI generated malware, Auto-GPT abuses, ChatGPT abuses",

author = "{Pa Pa}, {Yin Minn} and Shunsuke Tanizaki and Tetsui Kou and {Van Eeten}, Michel and Katsunari Yoshioka and Tsutomu Matsumoto",

note = "Green Open Access added to TU Delft Institutional Repository {\textquoteleft}You share, we take care!{\textquoteright} – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.; 16th Cyber Security Experimentation and Test Workshop, CSET 2023 ; Conference date: 07-08-2023",

year = "2023",

doi = "10.1145/3607505.3607513",

language = "English",

series = "ACM International Conference Proceeding Series",

publisher = "Association for Computing Machinery (ACM)",

pages = "10--18",

booktitle = "Proceedings of CSET 2023 - 16th Cyber Security Experimentation and Test Workshop",

address = "United States",

}

Pa Pa, YM, Tanizaki, S, Kou, T, Van Eeten, M, Yoshioka, K & Matsumoto, T 2023, An Attacker's Dream? Exploring the Capabilities of ChatGPT for Developing Malware. in Proceedings of CSET 2023 - 16th Cyber Security Experimentation and Test Workshop. ACM International Conference Proceeding Series, Association for Computing Machinery (ACM), pp. 10-18, 16th Cyber Security Experimentation and Test Workshop, CSET 2023, Hybrid, Marina Del Rey, United States, 7/08/23. https://doi.org/10.1145/3607505.3607513

An Attacker's Dream? Exploring the Capabilities of ChatGPT for Developing Malware. / Pa Pa, Yin Minn; Tanizaki, Shunsuke; Kou, Tetsui et al.
Proceedings of CSET 2023 - 16th Cyber Security Experimentation and Test Workshop. Association for Computing Machinery (ACM), 2023. p. 10-18 (ACM International Conference Proceeding Series).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - An Attacker's Dream? Exploring the Capabilities of ChatGPT for Developing Malware

AU - Pa Pa, Yin Minn

AU - Tanizaki, Shunsuke

AU - Kou, Tetsui

AU - Van Eeten, Michel

AU - Yoshioka, Katsunari

AU - Matsumoto, Tsutomu

N1 - Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2023

Y1 - 2023

N2 - We investigate the potential for abuse of recent AI advances by developing seven malware programs and two attack tools using ChatGPT, OpenAI Playground's "text-davinci-003"model, and Auto-GPT - an open-source AI agent capable of generating automated prompts to accomplish user-defined goals. We confirm that: 1) Under the safety and moderation control of recent AI systems, it is possible to generate the functional malware and attack tools (up to about 400 lines of code) within 90 minutes, including the debugging time. 2) Auto-GPT does not ease the hurdle of generating the right prompts for malware generation, but it evades the safety controls of OpenAI with its automatically generated prompts. When given goals with sufficient details, it writes the code in nine of nine malware and attack tools we tested. 3) There is still room to improve the moderation and safety controls of ChatGPT and text-davinci-003 model, especially for the growing jailbreak prompts. Overall, we find that recent AI advances, including ChatGPT, Auto-GPT, and text-davinci-003, demonstrate the potential for generating malware and attack tools under safety and moderation control, highlighting the need for improved safety measures and enhanced safety controls in AI systems.

AB - We investigate the potential for abuse of recent AI advances by developing seven malware programs and two attack tools using ChatGPT, OpenAI Playground's "text-davinci-003"model, and Auto-GPT - an open-source AI agent capable of generating automated prompts to accomplish user-defined goals. We confirm that: 1) Under the safety and moderation control of recent AI systems, it is possible to generate the functional malware and attack tools (up to about 400 lines of code) within 90 minutes, including the debugging time. 2) Auto-GPT does not ease the hurdle of generating the right prompts for malware generation, but it evades the safety controls of OpenAI with its automatically generated prompts. When given goals with sufficient details, it writes the code in nine of nine malware and attack tools we tested. 3) There is still room to improve the moderation and safety controls of ChatGPT and text-davinci-003 model, especially for the growing jailbreak prompts. Overall, we find that recent AI advances, including ChatGPT, Auto-GPT, and text-davinci-003, demonstrate the potential for generating malware and attack tools under safety and moderation control, highlighting the need for improved safety measures and enhanced safety controls in AI systems.

KW - AI generated malware

KW - Auto-GPT abuses

KW - ChatGPT abuses

UR - http://www.scopus.com/inward/record.url?scp=85171441483&partnerID=8YFLogxK

U2 - 10.1145/3607505.3607513

DO - 10.1145/3607505.3607513

M3 - Conference contribution

AN - SCOPUS:85171441483

T3 - ACM International Conference Proceeding Series

SP - 10

EP - 18

BT - Proceedings of CSET 2023 - 16th Cyber Security Experimentation and Test Workshop

PB - Association for Computing Machinery (ACM)

T2 - 16th Cyber Security Experimentation and Test Workshop, CSET 2023

Y2 - 7 August 2023

ER -

Pa Pa YM, Tanizaki S, Kou T, Van Eeten M, Yoshioka K, Matsumoto T. An Attacker's Dream? Exploring the Capabilities of ChatGPT for Developing Malware. In Proceedings of CSET 2023 - 16th Cyber Security Experimentation and Test Workshop. Association for Computing Machinery (ACM). 2023. p. 10-18. (ACM International Conference Proceeding Series). doi: 10.1145/3607505.3607513

An Attacker's Dream? Exploring the Capabilities of ChatGPT for Developing Malware

Abstract

Publication series

Conference

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this