Abstract
Pull-based development is a widely adopted paradigm for collaboration in distributed software development, attracting eyeballs from both academic and industry. To better study pull-based development model, this paper presents a new dataset containing 96 features collected from 11,230 projects and 3,347,937 pull re- quests. We describe the creation process and explain the features in details. To the best of our knowledge, our dataset is the most comprehensive and largest one toward a complete picture for pull-based development research.
Original language | English |
---|---|
Title of host publication | MSR 2020 data showcase |
Subtitle of host publication | 2020 IEEE/ACM 17th International Conference on Mining Software Repositories (MSR) |
Pages | 543-547 |
Number of pages | 5 |
ISBN (Electronic) | 978-1-4503-7517-7 |
DOIs | |
Publication status | Published - 29 Jun 2020 |
Event | 17th International Conference on Mining Software Repositories - Seoul, Korea, Republic of Duration: 5 Oct 2020 → 6 Oct 2020 Conference number: 17 |
Conference
Conference | 17th International Conference on Mining Software Repositories |
---|---|
Abbreviated title | MSR 20 |
Country/Territory | Korea, Republic of |
City | Seoul |
Period | 5/10/20 → 6/10/20 |
Other | Virtual/online event due to COVID-19 |
Bibliographical note
Virtual/online event due to COVID-19Keywords
- distributed software development
- pull request
- pull-based development