One-off events? An empirical study of hackathon code creation and reuse.

Code reuse Empirical study Hackathon Mining software repositories Survey World of code

Journal

Empirical software engineering
ISSN: 1573-7616
Titre abrégé: Empir Softw Eng
Pays: United States
ID NLM: 101769304

Informations de publication

Date de publication:
2022
Historique:
accepted: 02 07 2022
entrez: 26 9 2022
pubmed: 27 9 2022
medline: 27 9 2022
Statut: ppublish

Résumé

Hackathons have become popular events for teams to collaborate on projects and develop software prototypes. Most existing research focuses on activities during an event with limited attention to the evolution of the hackathon code. We aim to understand the evolution of code used in and created during hackathon events, with a particular focus on the code blobs, specifically, how frequently hackathon teams reuse pre-existing code, how much new code they develop, if that code gets reused afterwards, and what factors affect reuse. We collected information about 22,183 hackathon projects from Devpost and obtained related code blobs, authors, project characteristics, original author, code creation time, language, and size information from World of Code. We tracked the reuse of code blobs by identifying all commits containing blobs created during hackathons and identifying all projects that contain those commits. We also conducted a series of surveys in order to gain a deeper understanding of hackathon code evolution that we sent out to hackathon participants whose code was reused, whose code was not reused, and developers who reused some hackathon code. 9.14% of the code blobs in hackathon repositories and 8% of the lines of code (LOC) are created during hackathons and around a third of the hackathon code gets reused in other projects by both blob count and LOC. The number of associated technologies and the number of participants in hackathons increase reuse probability. The results of our study demonstrates hackathons are not always "one-off" events as the common knowledge dictates and it can serve as a starting point for further studies in this area.

Identifiants

pubmed: 36159898
doi: 10.1007/s10664-022-10201-x
pii: 10201
pmc: PMC9489595
doi:

Types de publication

Journal Article

Langues

eng

Pagination

167

Informations de copyright

© The Author(s) 2022.

Références

Int J Med Educ. 2011 Jun 27;2:53-55
pubmed: 28029643
Gigascience. 2016 Mar 31;5:16
pubmed: 27042293
F1000Res. 2016 Apr 13;5:672
pubmed: 27134733
Proc Natl Acad Sci U S A. 2018 Sep 4;115(36):8872-8877
pubmed: 30127025
BMC Bioinformatics. 2014;15 Suppl 14:S7
pubmed: 25472764

Auteurs

Tapajit Dey (T)

Lero-the Irish Software Research Centre, University of Limerick, Limerick, Ireland.

Alexander Nolte (A)

University of Tartu, Tartu, Estonia.
Carnegie Mellon University, Pittsburgh, PA USA.

Audris Mockus (A)

University of Tennessee, Knoxville, TN USA.

James D Herbsleb (JD)

Carnegie Mellon University, Pittsburgh, PA USA.

Classifications MeSH