Federated learning for cyber security

Ekaterina Khramtsova; Christian Hammerschmidt; Sofian Lagraa; Radu State

doi:10.1109/ICDCS47774.2020.00171

Federated learning for cyber security

Ekaterina Khramtsova, Christian Hammerschmidt, Sofian Lagraa, Radu State

Algorithmics

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

8 Citations (Scopus)

Abstract

Managed security service providers increasingly rely on machine-learning methods to exceed traditional, signature-based threat detection and classification methods. As machine-learning often improves with more data available, smaller organizations and clients find themselves at a disadvantage: Without the ability to share their data and others willing to collaborate, their machine-learned threat detection will perform worse than the same model in a larger organization. We show that Federated Learning, i.e. collaborative learning without data sharing, successfully helps to overcome this problem. Our experiments focus on a common task in cyber security, the detection of unwanted URLs in network traffic seen by security-as-a-service providers. Our experiments show that i) Smaller participants benefit from larger participants ii) Participants seeing different types of malicious traffic can generalize better to unseen types of attacks, increasing performance by 8% to 15% on average, and up to 27% in the extreme case. iii) Participating in Federated training never harms the performance of the locally trained model. In our experiment modeling a security-as-a service setting, Federated Learning increased detection up to 30% for some participants in the scheme. This clearly shows that Federated Learning is a viable approach to address issues of data sharing in common cyber security settings.

Original language	English
Title of host publication	Proceedings - 2020 IEEE 40th International Conference on Distributed Computing Systems, ICDCS 2020
Publisher	Institute of Electrical and Electronics Engineers (IEEE)
Pages	1316-1321
Number of pages	6
ISBN (Electronic)	9781728170022
DOIs	https://doi.org/10.1109/ICDCS47774.2020.00171
Publication status	Published - 2020
Event	40th IEEE International Conference on Distributed Computing Systems, ICDCS 2020 - Singapore, Singapore Duration: 29 Nov 2020 → 1 Dec 2020

Publication series

Name	Proceedings - International Conference on Distributed Computing Systems
Volume	2020-November

Conference

Conference	40th IEEE International Conference on Distributed Computing Systems, ICDCS 2020
Country/Territory	Singapore
City	Singapore
Period	29/11/20 → 1/12/20

Keywords

cyber-security
Federated-learning
Machine-learning

Access to Document

10.1109/ICDCS47774.2020.00171

Cite this

Khramtsova, E., Hammerschmidt, C., Lagraa, S., & State, R. (2020). Federated learning for cyber security. In Proceedings - 2020 IEEE 40th International Conference on Distributed Computing Systems, ICDCS 2020 (pp. 1316-1321). Article 09355811 (Proceedings - International Conference on Distributed Computing Systems; Vol. 2020-November). Institute of Electrical and Electronics Engineers (IEEE). https://doi.org/10.1109/ICDCS47774.2020.00171

@inproceedings{41900ed9d2174c6699787a8740f76466,

title = "Federated learning for cyber security",

abstract = "Managed security service providers increasingly rely on machine-learning methods to exceed traditional, signature-based threat detection and classification methods. As machine-learning often improves with more data available, smaller organizations and clients find themselves at a disadvantage: Without the ability to share their data and others willing to collaborate, their machine-learned threat detection will perform worse than the same model in a larger organization. We show that Federated Learning, i.e. collaborative learning without data sharing, successfully helps to overcome this problem. Our experiments focus on a common task in cyber security, the detection of unwanted URLs in network traffic seen by security-as-a-service providers. Our experiments show that i) Smaller participants benefit from larger participants ii) Participants seeing different types of malicious traffic can generalize better to unseen types of attacks, increasing performance by 8% to 15% on average, and up to 27% in the extreme case. iii) Participating in Federated training never harms the performance of the locally trained model. In our experiment modeling a security-as-a service setting, Federated Learning increased detection up to 30% for some participants in the scheme. This clearly shows that Federated Learning is a viable approach to address issues of data sharing in common cyber security settings.",

keywords = "cyber-security, Federated-learning, Machine-learning",

author = "Ekaterina Khramtsova and Christian Hammerschmidt and Sofian Lagraa and Radu State",

year = "2020",

doi = "10.1109/ICDCS47774.2020.00171",

language = "English",

series = "Proceedings - International Conference on Distributed Computing Systems",

publisher = "Institute of Electrical and Electronics Engineers (IEEE)",

pages = "1316--1321",

booktitle = "Proceedings - 2020 IEEE 40th International Conference on Distributed Computing Systems, ICDCS 2020",

address = "United States",

note = "40th IEEE International Conference on Distributed Computing Systems, ICDCS 2020 ; Conference date: 29-11-2020 Through 01-12-2020",

}

Khramtsova, E, Hammerschmidt, C, Lagraa, S & State, R 2020, Federated learning for cyber security. in Proceedings - 2020 IEEE 40th International Conference on Distributed Computing Systems, ICDCS 2020., 09355811, Proceedings - International Conference on Distributed Computing Systems, vol. 2020-November, Institute of Electrical and Electronics Engineers (IEEE), pp. 1316-1321, 40th IEEE International Conference on Distributed Computing Systems, ICDCS 2020, Singapore, Singapore, 29/11/20. https://doi.org/10.1109/ICDCS47774.2020.00171

Federated learning for cyber security. / Khramtsova, Ekaterina; Hammerschmidt, Christian; Lagraa, Sofian et al.
Proceedings - 2020 IEEE 40th International Conference on Distributed Computing Systems, ICDCS 2020. Institute of Electrical and Electronics Engineers (IEEE), 2020. p. 1316-1321 09355811 (Proceedings - International Conference on Distributed Computing Systems; Vol. 2020-November).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Federated learning for cyber security

AU - Khramtsova, Ekaterina

AU - Hammerschmidt, Christian

AU - Lagraa, Sofian

AU - State, Radu

PY - 2020

Y1 - 2020

N2 - Managed security service providers increasingly rely on machine-learning methods to exceed traditional, signature-based threat detection and classification methods. As machine-learning often improves with more data available, smaller organizations and clients find themselves at a disadvantage: Without the ability to share their data and others willing to collaborate, their machine-learned threat detection will perform worse than the same model in a larger organization. We show that Federated Learning, i.e. collaborative learning without data sharing, successfully helps to overcome this problem. Our experiments focus on a common task in cyber security, the detection of unwanted URLs in network traffic seen by security-as-a-service providers. Our experiments show that i) Smaller participants benefit from larger participants ii) Participants seeing different types of malicious traffic can generalize better to unseen types of attacks, increasing performance by 8% to 15% on average, and up to 27% in the extreme case. iii) Participating in Federated training never harms the performance of the locally trained model. In our experiment modeling a security-as-a service setting, Federated Learning increased detection up to 30% for some participants in the scheme. This clearly shows that Federated Learning is a viable approach to address issues of data sharing in common cyber security settings.

AB - Managed security service providers increasingly rely on machine-learning methods to exceed traditional, signature-based threat detection and classification methods. As machine-learning often improves with more data available, smaller organizations and clients find themselves at a disadvantage: Without the ability to share their data and others willing to collaborate, their machine-learned threat detection will perform worse than the same model in a larger organization. We show that Federated Learning, i.e. collaborative learning without data sharing, successfully helps to overcome this problem. Our experiments focus on a common task in cyber security, the detection of unwanted URLs in network traffic seen by security-as-a-service providers. Our experiments show that i) Smaller participants benefit from larger participants ii) Participants seeing different types of malicious traffic can generalize better to unseen types of attacks, increasing performance by 8% to 15% on average, and up to 27% in the extreme case. iii) Participating in Federated training never harms the performance of the locally trained model. In our experiment modeling a security-as-a service setting, Federated Learning increased detection up to 30% for some participants in the scheme. This clearly shows that Federated Learning is a viable approach to address issues of data sharing in common cyber security settings.

KW - cyber-security

KW - Federated-learning

KW - Machine-learning

UR - http://www.scopus.com/inward/record.url?scp=85101996393&partnerID=8YFLogxK

U2 - 10.1109/ICDCS47774.2020.00171

DO - 10.1109/ICDCS47774.2020.00171

M3 - Conference contribution

AN - SCOPUS:85101996393

T3 - Proceedings - International Conference on Distributed Computing Systems

SP - 1316

EP - 1321

BT - Proceedings - 2020 IEEE 40th International Conference on Distributed Computing Systems, ICDCS 2020

PB - Institute of Electrical and Electronics Engineers (IEEE)

T2 - 40th IEEE International Conference on Distributed Computing Systems, ICDCS 2020

Y2 - 29 November 2020 through 1 December 2020

ER -

Khramtsova E, Hammerschmidt C, Lagraa S, State R. Federated learning for cyber security. In Proceedings - 2020 IEEE 40th International Conference on Distributed Computing Systems, ICDCS 2020. Institute of Electrical and Electronics Engineers (IEEE). 2020. p. 1316-1321. 09355811. (Proceedings - International Conference on Distributed Computing Systems). doi: 10.1109/ICDCS47774.2020.00171

Federated learning for cyber security

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this