SpacePhish: The Evasion-space of Adversarial Attacks against Phishing Website Detectors using Machine Learning

Giovanni Apruzzese; Mauro Conti; Ying Yuan

doi:10.1145/3564625.3567980

SpacePhish: The Evasion-space of Adversarial Attacks against Phishing Website Detectors using Machine Learning

Giovanni Apruzzese, Mauro Conti, Ying Yuan^*

^*Corresponding author for this work

Cyber Security

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

81 Downloads (Pure)

Abstract

Existing literature on adversarial Machine Learning (ML) focuses either on showing attacks that break every ML model, or defenses that withstand most attacks. Unfortunately, little consideration is given to the actual cost of the attack or the defense. Moreover, adversarial samples are often crafted in the "feature-space", making the corresponding evaluations of questionable value. Simply put, the current situation does not allow to estimate the actual threat posed by adversarial attacks, leading to a lack of secure ML systems. We aim to clarify such confusion in this paper. By considering the application of ML for Phishing Website Detection (PWD), we formalize the "evasion-space"in which an adversarial perturbation can be introduced to fool a ML-PWD-demonstrating that even perturbations in the "feature-space"are useful. Then, we propose a realistic threat model describing evasion attacks against ML-PWD that are cheap to stage, and hence intrinsically more attractive for real phishers. Finally, we perform the first statistically validated assessment of state-of-the-art ML-PWD against 12 evasion attacks. Our evaluation shows (i) the true efficacy of evasion attempts that are more likely to occur; and (ii) the impact of perturbations crafted in different evasion-spaces. Our realistic evasion attempts induce a statistically significant degradation (3-10% at p < 0.05), and their cheap cost makes them a subtle threat. Notably, however, some ML-PWD are immune to our most realistic attacks (p=0.22). Our contribution paves the way for a much needed re-assessment of adversarial attacks against ML systems for cybersecurity.

Original language	English
Title of host publication	Proceedings - 38th Annual Computer Security Applications Conference, ACSAC 2022
Publisher	Association for Computing Machinery (ACM)
Pages	171-185
Number of pages	15
ISBN (Electronic)	9781450397599
DOIs	https://doi.org/10.1145/3564625.3567980
Publication status	Published - 2022
Event	38th Annual Computer Security Applications Conference, ACSAC 2022 - Austin, United States Duration: 5 Dec 2022 → 9 Dec 2022

Publication series

Name	ACM International Conference Proceeding Series

Conference

Conference	38th Annual Computer Security Applications Conference, ACSAC 2022
Country/Territory	United States
City	Austin
Period	5/12/22 → 9/12/22

Keywords

Adversarial Attacks
Machine Learning
Phishing
Website

Access to Document

10.1145/3564625.3567980

3564625.3567980Final published version, 1.78 MBLicence: CC BY

Cite this

Apruzzese, G., Conti, M., & Yuan, Y. (2022). SpacePhish: The Evasion-space of Adversarial Attacks against Phishing Website Detectors using Machine Learning. In Proceedings - 38th Annual Computer Security Applications Conference, ACSAC 2022 (pp. 171-185). (ACM International Conference Proceeding Series). Association for Computing Machinery (ACM). https://doi.org/10.1145/3564625.3567980

@inproceedings{a2f0115067ea484bae9404a2f4c95d9b,

title = "SpacePhish: The Evasion-space of Adversarial Attacks against Phishing Website Detectors using Machine Learning",

abstract = "Existing literature on adversarial Machine Learning (ML) focuses either on showing attacks that break every ML model, or defenses that withstand most attacks. Unfortunately, little consideration is given to the actual cost of the attack or the defense. Moreover, adversarial samples are often crafted in the {"}feature-space{"}, making the corresponding evaluations of questionable value. Simply put, the current situation does not allow to estimate the actual threat posed by adversarial attacks, leading to a lack of secure ML systems. We aim to clarify such confusion in this paper. By considering the application of ML for Phishing Website Detection (PWD), we formalize the {"}evasion-space{"}in which an adversarial perturbation can be introduced to fool a ML-PWD-demonstrating that even perturbations in the {"}feature-space{"}are useful. Then, we propose a realistic threat model describing evasion attacks against ML-PWD that are cheap to stage, and hence intrinsically more attractive for real phishers. Finally, we perform the first statistically validated assessment of state-of-the-art ML-PWD against 12 evasion attacks. Our evaluation shows (i) the true efficacy of evasion attempts that are more likely to occur; and (ii) the impact of perturbations crafted in different evasion-spaces. Our realistic evasion attempts induce a statistically significant degradation (3-10% at p < 0.05), and their cheap cost makes them a subtle threat. Notably, however, some ML-PWD are immune to our most realistic attacks (p=0.22). Our contribution paves the way for a much needed re-assessment of adversarial attacks against ML systems for cybersecurity. ",

keywords = "Adversarial Attacks, Machine Learning, Phishing, Website",

author = "Giovanni Apruzzese and Mauro Conti and Ying Yuan",

year = "2022",

doi = "10.1145/3564625.3567980",

language = "English",

series = "ACM International Conference Proceeding Series",

publisher = "Association for Computing Machinery (ACM)",

pages = "171--185",

booktitle = "Proceedings - 38th Annual Computer Security Applications Conference, ACSAC 2022",

address = "United States",

note = "38th Annual Computer Security Applications Conference, ACSAC 2022 ; Conference date: 05-12-2022 Through 09-12-2022",

}

Apruzzese, G, Conti, M & Yuan, Y 2022, SpacePhish: The Evasion-space of Adversarial Attacks against Phishing Website Detectors using Machine Learning. in Proceedings - 38th Annual Computer Security Applications Conference, ACSAC 2022. ACM International Conference Proceeding Series, Association for Computing Machinery (ACM), pp. 171-185, 38th Annual Computer Security Applications Conference, ACSAC 2022, Austin, United States, 5/12/22. https://doi.org/10.1145/3564625.3567980

SpacePhish: The Evasion-space of Adversarial Attacks against Phishing Website Detectors using Machine Learning. / Apruzzese, Giovanni; Conti, Mauro; Yuan, Ying.
Proceedings - 38th Annual Computer Security Applications Conference, ACSAC 2022. Association for Computing Machinery (ACM), 2022. p. 171-185 (ACM International Conference Proceeding Series).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - SpacePhish

T2 - 38th Annual Computer Security Applications Conference, ACSAC 2022

AU - Apruzzese, Giovanni

AU - Conti, Mauro

AU - Yuan, Ying

PY - 2022

Y1 - 2022

N2 - Existing literature on adversarial Machine Learning (ML) focuses either on showing attacks that break every ML model, or defenses that withstand most attacks. Unfortunately, little consideration is given to the actual cost of the attack or the defense. Moreover, adversarial samples are often crafted in the "feature-space", making the corresponding evaluations of questionable value. Simply put, the current situation does not allow to estimate the actual threat posed by adversarial attacks, leading to a lack of secure ML systems. We aim to clarify such confusion in this paper. By considering the application of ML for Phishing Website Detection (PWD), we formalize the "evasion-space"in which an adversarial perturbation can be introduced to fool a ML-PWD-demonstrating that even perturbations in the "feature-space"are useful. Then, we propose a realistic threat model describing evasion attacks against ML-PWD that are cheap to stage, and hence intrinsically more attractive for real phishers. Finally, we perform the first statistically validated assessment of state-of-the-art ML-PWD against 12 evasion attacks. Our evaluation shows (i) the true efficacy of evasion attempts that are more likely to occur; and (ii) the impact of perturbations crafted in different evasion-spaces. Our realistic evasion attempts induce a statistically significant degradation (3-10% at p < 0.05), and their cheap cost makes them a subtle threat. Notably, however, some ML-PWD are immune to our most realistic attacks (p=0.22). Our contribution paves the way for a much needed re-assessment of adversarial attacks against ML systems for cybersecurity.

AB - Existing literature on adversarial Machine Learning (ML) focuses either on showing attacks that break every ML model, or defenses that withstand most attacks. Unfortunately, little consideration is given to the actual cost of the attack or the defense. Moreover, adversarial samples are often crafted in the "feature-space", making the corresponding evaluations of questionable value. Simply put, the current situation does not allow to estimate the actual threat posed by adversarial attacks, leading to a lack of secure ML systems. We aim to clarify such confusion in this paper. By considering the application of ML for Phishing Website Detection (PWD), we formalize the "evasion-space"in which an adversarial perturbation can be introduced to fool a ML-PWD-demonstrating that even perturbations in the "feature-space"are useful. Then, we propose a realistic threat model describing evasion attacks against ML-PWD that are cheap to stage, and hence intrinsically more attractive for real phishers. Finally, we perform the first statistically validated assessment of state-of-the-art ML-PWD against 12 evasion attacks. Our evaluation shows (i) the true efficacy of evasion attempts that are more likely to occur; and (ii) the impact of perturbations crafted in different evasion-spaces. Our realistic evasion attempts induce a statistically significant degradation (3-10% at p < 0.05), and their cheap cost makes them a subtle threat. Notably, however, some ML-PWD are immune to our most realistic attacks (p=0.22). Our contribution paves the way for a much needed re-assessment of adversarial attacks against ML systems for cybersecurity.

KW - Adversarial Attacks

KW - Machine Learning

KW - Phishing

KW - Website

UR - http://www.scopus.com/inward/record.url?scp=85144043755&partnerID=8YFLogxK

U2 - 10.1145/3564625.3567980

DO - 10.1145/3564625.3567980

M3 - Conference contribution

AN - SCOPUS:85144043755

T3 - ACM International Conference Proceeding Series

SP - 171

EP - 185

BT - Proceedings - 38th Annual Computer Security Applications Conference, ACSAC 2022

PB - Association for Computing Machinery (ACM)

Y2 - 5 December 2022 through 9 December 2022

ER -

SpacePhish: The Evasion-space of Adversarial Attacks against Phishing Website Detectors using Machine Learning

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this