Improving Quality of a Post's Set of Answers in Stack Overflow

Mohammadreza Tavakoli, Maliheh Izadi, Abbas Heydarnoori

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

3 Citations (Scopus)

Abstract

Community Question Answering platforms such as Stack Overflow help a wide range of users solve their challenges on-line. As the popularity of these communities has grown over the years, both the number of members and posts have escalated. Also, due to the diverse backgrounds, skills, expertise, and viewpoints of users, each question may obtain more than one answer. Therefore, the focus has changed toward producing posts that have a set of answers more valuable for the community as a whole, not just one accepted-answer aimed at satisfying only the question-asker. Same as every universal community, a large number of low-quality posts on Stack Overflow require improvement. We call these posts "deficient", and define them as posts with questions that either have no answer yet or can be improved by other ones. In this paper, we propose an approach to automate the identification process of such posts and boost their set of answers, utilizing the help of related experts. With the help of 60 participants, we trained a classification model to identify deficient posts by investigating the relationship between characteristics of 3075 questions posted on Stack Overflow and their need for better answers set. Then, we developed an Eclipse plugin named SOPI and integrated the prediction model in the plugin to link these deficient posts to related developers (in terms of their development context and expertise area) and help them improve the answer set. We evaluated both the functionality of our plugin and the impact of answers submitted to Stack Overflow with the help of 10 and 15 expert industrial developers, respectively. Our results indicate that decision trees, specifically the J48 algorithm, predicts a deficient question better than the other methods with 94.5% precision and 90.3% recall. We conclude that not only our plugin helps programmers contribute more easily to Stack Overflow, but also it improves the quality of existing answers.

Original languageEnglish
Title of host publicationProceedings - 46th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2020
EditorsAntonio Martini, Manuel Wimmer, Amund Skavhaug
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Pages504-512
Number of pages9
ISBN (Electronic)9781728195322
DOIs
Publication statusPublished - Aug 2020
Externally publishedYes
Event46th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2020 - Kranj, Slovenia
Duration: 26 Aug 202028 Aug 2020

Publication series

NameProceedings - 46th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2020

Conference

Conference46th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2020
Country/TerritorySlovenia
CityKranj
Period26/08/2028/08/20

Keywords

  • Machine Learning
  • Prediction Models
  • Question Answering
  • Recommender systems

Fingerprint

Dive into the research topics of 'Improving Quality of a Post's Set of Answers in Stack Overflow'. Together they form a unique fingerprint.

Cite this