Abstract
Tasks related to Natural Language Processing (NLP) have recently been the focus of a large research endeavor by the machine learning community. The increased interest in this area is mainly due to the success of deep learning methods. Genetic Programming (GP), however, was not under the spotlight with respect to NLP tasks. Here, we propose a first proof-of-concept that combines GP with the well established NLP tool word2vec for the next word prediction task. The main idea is that, once words have been moved into a vector space, traditional GP operators can successfully work on vectors, thus producing meaningful words as the output. To assess the suitability of this approach, we perform an experimental evaluation on a set of existing newspaper headlines. Individuals resulting from this (pre-)training phase can be employed as the initial population in other NLP tasks, like sentence generation, which will be the focus of future investigations, possibly employing adversarial co-evolutionary approaches.
| Original language | English |
|---|---|
| Title of host publication | GECCO 2020 |
| Subtitle of host publication | Proceedings of the 2020 Genetic and Evolutionary Computation Conference |
| Place of Publication | New York |
| Publisher | Association for Computing Machinery (ACM) |
| Pages | 985-993 |
| Number of pages | 9 |
| ISBN (Print) | 978-1-4503-7128-5 |
| DOIs | |
| Publication status | Published - 2020 |
| Event | 2020 Genetic and Evolutionary Computation Conference, GECCO 2020 - Cancun, Mexico Duration: 8 Jul 2020 → 12 Jul 2020 |
Conference
| Conference | 2020 Genetic and Evolutionary Computation Conference, GECCO 2020 |
|---|---|
| Country/Territory | Mexico |
| City | Cancun |
| Period | 8/07/20 → 12/07/20 |
Bibliographical note
Accepted author manuscriptKeywords
- Genetic programming
- Natural language processing
- Next word prediction
Fingerprint
Dive into the research topics of 'Towards an evolutionary-based approach for natural language processing'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver