Abstract
In this work, we address the information overload issue that learners in Massive Open Online Courses (MOOCs) face when attempting to close their knowledge gaps via the use of MOOC discussion forums. To this end, we investigate the recommendation of one-minute-resolution video clips given the textual similarity between the clips’ transcripts and MOOC discussion forum entries. We first create a large-scale dataset from Khan Academy video transcripts and their forum discussions. We then investigate the effectiveness of applying pre-trained transformers-based neural retrieval models to rank video clips in response to a forum discussion. The retrieval models are trained with supervised learning and distant supervision to effectively leverage the unlabeled data—which accounts for more than 80% of all available data. Our experimental results demonstrate that the proposed method is effective for this task, by outperforming a standard baseline by 0.208 on the absolute change in terms of precision.
Original language | English |
---|---|
Pages | 705-709 |
Number of pages | 5 |
DOIs | |
Publication status | Published - 2022 |
Event | 15th International Conference on Educational Data Mining - Durham University, Durham, United Kingdom Duration: 24 Jul 2022 → 27 Jul 2022 https://educationaldatamining.org/edm2022/ |
Conference
Conference | 15th International Conference on Educational Data Mining |
---|---|
Country/Territory | United Kingdom |
City | Durham |
Period | 24/07/22 → 27/07/22 |
Internet address |
Keywords
- MOOC
- Discussion Forum
- Video Clip Transcripts
- Clip Recommendation