TY - GEN
T1 - A dataset for pull-based development research
AU - Gousios, Georgios
AU - Zaidman, Andy
PY - 2014/5/31
Y1 - 2014/5/31
N2 - Pull requests form a new method for collaborating in distributed software development. To study the pull request distributed development model, we constructed a dataset of almost 900 projects and 350,000 pull requests, including some of the largest users of pull requests on Github. In this paper, we describe how the project selection was done, we analyze the selected features and present a machine learning tool set for the R statistics environment.
AB - Pull requests form a new method for collaborating in distributed software development. To study the pull request distributed development model, we constructed a dataset of almost 900 projects and 350,000 pull requests, including some of the largest users of pull requests on Github. In this paper, we describe how the project selection was done, we analyze the selected features and present a machine learning tool set for the R statistics environment.
KW - Distributed software development
KW - Empirical software engineering
KW - Pull request
KW - Pull-based development
UR - http://www.scopus.com/inward/record.url?scp=84938775992&partnerID=8YFLogxK
U2 - 10.1145/2597073.2597122
DO - 10.1145/2597073.2597122
M3 - Conference contribution
AN - SCOPUS:84938775992
T3 - 11th Working Conference on Mining Software Repositories, MSR 2014 - Proceedings
SP - 368
EP - 371
BT - 11th Working Conference on Mining Software Repositories, MSR 2014 - Proceedings
PB - ACM
T2 - 11th International Working Conference on Mining Software Repositories, MSR 2014
Y2 - 31 May 2014 through 1 June 2014
ER -