BERT Rankers are Brittle: A Study using Adversarial Document Perturbations

Yumeng Wang; Lijun Lyu; Avishek Anand

doi:10.1145/3539813.3545122

BERT Rankers are Brittle: A Study using Adversarial Document Perturbations

Yumeng Wang, Lijun Lyu, Avishek Anand

Web Information Systems

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

6 Citations (Scopus)

17 Downloads (Pure)

Abstract

Contextual ranking models based on BERT are now well established for a wide range of passage and document ranking tasks. However, the robustness of BERT-based ranking models under adversarial inputs is under-explored. In this paper, we argue that BERT-rankers are not immune to adversarial attacks targeting retrieved documents given a query. Firstly, we propose algorithms for adversarial perturbation of both highly relevant and non-relevant documents using gradient-based optimization methods. The aim of our algorithms is to add/replace a small number of tokens to a highly relevant or non-relevant document to cause a large rank demotion or promotion. Our experiments show that a small number of tokens can already result in a large change in the rank of a document. Moreover, we find that BERT-rankers heavily rely on the document start/head for relevance prediction, making the initial part of the document more susceptible to adversarial attacks. More interestingly, we find a small set of recurring adversarial words that when added to documents result in successful rank demotion/promotion of any relevant/non-relevant document respectively. Finally, our adversarial tokens also show particular topic preferences within and across datasets, exposing potential biases from BERT pre-training or downstream datasets.

Original language	English
Title of host publication	ICTIR 2022 - Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval
Publisher	Association for Computing Machinery (ACM)
Pages	115-120
Number of pages	6
ISBN (Electronic)	978-1-4503-9412-3
DOIs	https://doi.org/10.1145/3539813.3545122
Publication status	Published - 2022
Event	8th ACM SIGIR International Conference on the Theory of Information Retrieval, ICTIR 2022 - Virtual, Online, Spain Duration: 11 Jul 2022 → 12 Jul 2022

Publication series

Name	ICTIR 2022 - Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval

Conference

Conference	8th ACM SIGIR International Conference on the Theory of Information Retrieval, ICTIR 2022
Country/Territory	Spain
City	Virtual, Online
Period	11/07/22 → 12/07/22

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

adversarial attack
bert
biases
neural networks
ranking

Access to Document

10.1145/3539813.3545122

3539813.3545122Final published version, 1.14 MB

Cite this

Wang, Y., Lyu, L., & Anand, A. (2022). BERT Rankers are Brittle: A Study using Adversarial Document Perturbations. In ICTIR 2022 - Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval (pp. 115-120). (ICTIR 2022 - Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval). Association for Computing Machinery (ACM). https://doi.org/10.1145/3539813.3545122

Wang, Yumeng ; Lyu, Lijun ; Anand, Avishek. / BERT Rankers are Brittle : A Study using Adversarial Document Perturbations. ICTIR 2022 - Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval. Association for Computing Machinery (ACM), 2022. pp. 115-120 (ICTIR 2022 - Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval).

@inproceedings{4c6da9fa794e4ad3b561be2115000def,

title = "BERT Rankers are Brittle: A Study using Adversarial Document Perturbations",

abstract = "Contextual ranking models based on BERT are now well established for a wide range of passage and document ranking tasks. However, the robustness of BERT-based ranking models under adversarial inputs is under-explored. In this paper, we argue that BERT-rankers are not immune to adversarial attacks targeting retrieved documents given a query. Firstly, we propose algorithms for adversarial perturbation of both highly relevant and non-relevant documents using gradient-based optimization methods. The aim of our algorithms is to add/replace a small number of tokens to a highly relevant or non-relevant document to cause a large rank demotion or promotion. Our experiments show that a small number of tokens can already result in a large change in the rank of a document. Moreover, we find that BERT-rankers heavily rely on the document start/head for relevance prediction, making the initial part of the document more susceptible to adversarial attacks. More interestingly, we find a small set of recurring adversarial words that when added to documents result in successful rank demotion/promotion of any relevant/non-relevant document respectively. Finally, our adversarial tokens also show particular topic preferences within and across datasets, exposing potential biases from BERT pre-training or downstream datasets. ",

keywords = "adversarial attack, bert, biases, neural networks, ranking",

author = "Yumeng Wang and Lijun Lyu and Avishek Anand",

note = "Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.; 8th ACM SIGIR International Conference on the Theory of Information Retrieval, ICTIR 2022 ; Conference date: 11-07-2022 Through 12-07-2022",

year = "2022",

doi = "10.1145/3539813.3545122",

language = "English",

series = "ICTIR 2022 - Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval",

publisher = "Association for Computing Machinery (ACM)",

pages = "115--120",

booktitle = "ICTIR 2022 - Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval",

address = "United States",

}

Wang, Y, Lyu, L & Anand, A 2022, BERT Rankers are Brittle: A Study using Adversarial Document Perturbations. in ICTIR 2022 - Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval. ICTIR 2022 - Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval, Association for Computing Machinery (ACM), pp. 115-120, 8th ACM SIGIR International Conference on the Theory of Information Retrieval, ICTIR 2022, Virtual, Online, Spain, 11/07/22. https://doi.org/10.1145/3539813.3545122

BERT Rankers are Brittle: A Study using Adversarial Document Perturbations. / Wang, Yumeng; Lyu, Lijun; Anand, Avishek.
ICTIR 2022 - Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval. Association for Computing Machinery (ACM), 2022. p. 115-120 (ICTIR 2022 - Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - BERT Rankers are Brittle

T2 - 8th ACM SIGIR International Conference on the Theory of Information Retrieval, ICTIR 2022

AU - Wang, Yumeng

AU - Lyu, Lijun

AU - Anand, Avishek

N1 - Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2022

Y1 - 2022

N2 - Contextual ranking models based on BERT are now well established for a wide range of passage and document ranking tasks. However, the robustness of BERT-based ranking models under adversarial inputs is under-explored. In this paper, we argue that BERT-rankers are not immune to adversarial attacks targeting retrieved documents given a query. Firstly, we propose algorithms for adversarial perturbation of both highly relevant and non-relevant documents using gradient-based optimization methods. The aim of our algorithms is to add/replace a small number of tokens to a highly relevant or non-relevant document to cause a large rank demotion or promotion. Our experiments show that a small number of tokens can already result in a large change in the rank of a document. Moreover, we find that BERT-rankers heavily rely on the document start/head for relevance prediction, making the initial part of the document more susceptible to adversarial attacks. More interestingly, we find a small set of recurring adversarial words that when added to documents result in successful rank demotion/promotion of any relevant/non-relevant document respectively. Finally, our adversarial tokens also show particular topic preferences within and across datasets, exposing potential biases from BERT pre-training or downstream datasets.

AB - Contextual ranking models based on BERT are now well established for a wide range of passage and document ranking tasks. However, the robustness of BERT-based ranking models under adversarial inputs is under-explored. In this paper, we argue that BERT-rankers are not immune to adversarial attacks targeting retrieved documents given a query. Firstly, we propose algorithms for adversarial perturbation of both highly relevant and non-relevant documents using gradient-based optimization methods. The aim of our algorithms is to add/replace a small number of tokens to a highly relevant or non-relevant document to cause a large rank demotion or promotion. Our experiments show that a small number of tokens can already result in a large change in the rank of a document. Moreover, we find that BERT-rankers heavily rely on the document start/head for relevance prediction, making the initial part of the document more susceptible to adversarial attacks. More interestingly, we find a small set of recurring adversarial words that when added to documents result in successful rank demotion/promotion of any relevant/non-relevant document respectively. Finally, our adversarial tokens also show particular topic preferences within and across datasets, exposing potential biases from BERT pre-training or downstream datasets.

KW - adversarial attack

KW - bert

KW - biases

KW - neural networks

KW - ranking

UR - http://www.scopus.com/inward/record.url?scp=85138329521&partnerID=8YFLogxK

U2 - 10.1145/3539813.3545122

DO - 10.1145/3539813.3545122

M3 - Conference contribution

AN - SCOPUS:85138329521

T3 - ICTIR 2022 - Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval

SP - 115

EP - 120

BT - ICTIR 2022 - Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval

PB - Association for Computing Machinery (ACM)

Y2 - 11 July 2022 through 12 July 2022

ER -

Wang Y, Lyu L, Anand A. BERT Rankers are Brittle: A Study using Adversarial Document Perturbations. In ICTIR 2022 - Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval. Association for Computing Machinery (ACM). 2022. p. 115-120. (ICTIR 2022 - Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval). doi: 10.1145/3539813.3545122

BERT Rankers are Brittle: A Study using Adversarial Document Perturbations

Abstract

Publication series

Conference

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this