On the Shoulders of Giants: A New Dataset for Pull-based Development Research

Xunhui Zhang, Ayushi Rastogi, Yue Yu

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientific

10 Citations (Scopus)

Abstract

Pull-based development is a widely adopted paradigm for collaboration in distributed software development, attracting eyeballs from both academic and industry. To better study pull-based development model, this paper presents a new dataset containing 96 features collected from 11,230 projects and 3,347,937 pull re- quests. We describe the creation process and explain the features in details. To the best of our knowledge, our dataset is the most comprehensive and largest one toward a complete picture for pull-based development research.
Original languageEnglish
Title of host publicationMSR 2020 data showcase
Subtitle of host publication2020 IEEE/ACM 17th International Conference on Mining Software Repositories (MSR)
Pages543-547
Number of pages5
ISBN (Electronic)978-1-4503-7517-7
DOIs
Publication statusPublished - 29 Jun 2020
Event17th International Conference on Mining Software Repositories - Seoul, Korea, Republic of
Duration: 5 Oct 20206 Oct 2020
Conference number: 17

Conference

Conference17th International Conference on Mining Software Repositories
Abbreviated titleMSR 20
Country/TerritoryKorea, Republic of
CitySeoul
Period5/10/206/10/20
OtherVirtual/online event due to COVID-19

Bibliographical note

Virtual/online event due to COVID-19

Keywords

  • distributed software development
  • pull request
  • pull-based development

Fingerprint

Dive into the research topics of 'On the Shoulders of Giants: A New Dataset for Pull-based Development Research'. Together they form a unique fingerprint.

Cite this