Is cross-lingual readability assessment possible?

Ion Madrazo Azpiazu; Maria Soledad Pera

doi:10.1002/asi.24293

Is cross-lingual readability assessment possible?

Ion Madrazo Azpiazu^*, Maria Soledad Pera

^*Corresponding author for this work

Research output: Contribution to journal › Article › Scientific › peer-review

14 Citations (Scopus)

Abstract

Most research efforts related to automatic readability assessment focus on the design of strategies that apply to a specific language. These state-of-the-art strategies are highly dependent on linguistic features that best suit the language for which they were intended, constraining their adaptability and making it difficult to determine whether they would remain effective if they were applied to estimate the level of difficulty of texts in other languages. In this article, we present the results of a study designed to determine the feasibility of a cross-lingual readability assessment strategy. For doing so, we first analyzed the most common features used for readability assessment and determined their influence on the readability prediction process of 6 different languages: English, Spanish, Basque, Italian, French, and Catalan. In addition, we developed a cross-lingual readability assessment strategy that serves as a means to empirically explore the potential advantages of employing a single strategy (and set of features) for readability assessment in different languages, including interlanguage prediction agreement and prediction accuracy improvement for low-resource languages.

Original language	English
Pages (from-to)	644-656
Number of pages	13
Journal	Journal of the Association for Information Science and Technology
Volume	71
Issue number	6
DOIs	https://doi.org/10.1002/asi.24293
Publication status	Published - 1 Jun 2020
Externally published	Yes

Access to Document

10.1002/asi.24293

Cite this

@article{f2db13b1273d45f6897f33e58a1e775f,

title = "Is cross-lingual readability assessment possible?",

abstract = "Most research efforts related to automatic readability assessment focus on the design of strategies that apply to a specific language. These state-of-the-art strategies are highly dependent on linguistic features that best suit the language for which they were intended, constraining their adaptability and making it difficult to determine whether they would remain effective if they were applied to estimate the level of difficulty of texts in other languages. In this article, we present the results of a study designed to determine the feasibility of a cross-lingual readability assessment strategy. For doing so, we first analyzed the most common features used for readability assessment and determined their influence on the readability prediction process of 6 different languages: English, Spanish, Basque, Italian, French, and Catalan. In addition, we developed a cross-lingual readability assessment strategy that serves as a means to empirically explore the potential advantages of employing a single strategy (and set of features) for readability assessment in different languages, including interlanguage prediction agreement and prediction accuracy improvement for low-resource languages.",

author = "{Madrazo Azpiazu}, Ion and Pera, {Maria Soledad}",

year = "2020",

month = jun,

day = "1",

doi = "10.1002/asi.24293",

language = "English",

volume = "71",

pages = "644--656",

journal = "Journal of the Association for Information Science and Technology",

issn = "2330-1635",