A fine-grained data set and analysis of tangling in bug fixing commits

dc.contributor.authorHerbold, Steffen
dc.contributor.authorTrautsch, Alexander
dc.contributor.authorLedel, Benjamin
dc.contributor.authorAghamohammadi, Alireza
dc.contributor.authorGhaleb, Taher A.
dc.contributor.authorChahal, Kuljit Kaur
dc.contributor.authorBossenmaier, Tim
dc.contributor.authorNagaria, Bhaveet
dc.contributor.authorMakedonski, Philip
dc.contributor.authorAhmadabadi, Matin Nili
dc.contributor.authorSzabados, Kristof
dc.contributor.authorSpieker, Helge
dc.contributor.authorMadeja, Matej
dc.contributor.authorHoy, Nathaniel
dc.contributor.authorLenarduzzi, Valentina
dc.contributor.authorWang, Shangwen
dc.contributor.authorRodríguez-Pérez, Gema
dc.contributor.authorColomo-Palacios, Ricardo
dc.contributor.authorVerdecchia, Roberto
dc.contributor.authorSingh, Paramvir
dc.contributor.authorQin, Yihao
dc.contributor.authorChakroborti, Debasish
dc.contributor.authorDavis, Willard
dc.contributor.authorWalunj, Vijay
dc.contributor.authorWu, Hongjun
dc.contributor.authorMarcilio, Diego
dc.contributor.authorAlam, Omar
dc.contributor.authorAldaeej, Abdullah
dc.contributor.authorAmit, Idan
dc.contributor.authorTurhan, Burak
dc.contributor.authorEismann, Simon
dc.contributor.authorWickert, Anna-Katharina
dc.contributor.authorMalavolta, Ivano
dc.contributor.authorSulír, Matúš
dc.contributor.authorFard, Fatemeh
dc.contributor.authorHenley, Austin Z.
dc.contributor.authorKourtzanidis, Stratos
dc.contributor.authorTuzun, Eray
dc.contributor.authorTreude, Christoph
dc.contributor.authorShamasbi, Simin Maleki
dc.contributor.authorPashchenko, Ivan
dc.contributor.authorWyrich, Marvin
dc.contributor.authorDavis, James
dc.contributor.authorSerebrenik, Alexander
dc.contributor.authorAlbrecht, Ella
dc.contributor.authorAktas, Ethem Utku
dc.contributor.authorStrüber, Daniel
dc.contributor.authorErbel, Johannes
dc.date.accessioned2024-11-06T15:50:39Z
dc.date.available2024-11-06T15:50:39Z
dc.date.issued2022de
dc.date.updated2024-10-18T08:10:50Z
dc.description.abstractContext: Tangled commits are changes to software that address multiple concerns at once. For researchers interested in bugs, tangled commits mean that they actually study not only bugs, but also other concerns irrelevant for the study of bugs. Objective: We want to improve our understanding of the prevalence of tangling and the types of changes that are tangled within bug fixing commits. Methods: We use a crowd sourcing approach for manual labeling to validate which changes contribute to bug fixes for each line in bug fixing commits. Each line is labeled by four participants. If at least three participants agree on the same label, we have consensus. Results: We estimate that between 17% and 32% of all changes in bug fixing commits modify the source code to fix the underlying problem. However, when we only consider changes to the production code files this ratio increases to 66% to 87%. We find that about 11% of lines are hard to label leading to active disagreements between participants. Due to confirmed tangling and the uncertainty in our data, we estimate that 3% to 47% of data is noisy without manual untangling, depending on the use case.en
dc.description.sponsorshipTechnische Universität Clausthalde
dc.identifier.issn1573-7616
dc.identifier.issn1382-3256
dc.identifier.other1909570680
dc.identifier.urihttp://nbn-resolving.de/urn:nbn:de:bsz:93-opus-ds-152303de
dc.identifier.urihttp://elib.uni-stuttgart.de/handle/11682/15230
dc.identifier.urihttp://dx.doi.org/10.18419/opus-15211
dc.language.isoende
dc.relation.uridoi:10.1007/s10664-021-10083-5de
dc.rightsinfo:eu-repo/semantics/openAccessde
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/de
dc.subject.ddc620de
dc.titleA fine-grained data set and analysis of tangling in bug fixing commitsen
dc.typearticlede
ubs.fakultaetInformatik, Elektrotechnik und Informationstechnikde
ubs.fakultaetFakultätsübergreifend / Sonstige Einrichtungde
ubs.institutInstitut für Software Engineeringde
ubs.institutFakultätsübergreifend / Sonstige Einrichtungde
ubs.publikation.seiten49de
ubs.publikation.sourceEmpirical software engineering 27 (2022), No. 125de
ubs.publikation.typZeitschriftenartikelde

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
s10664-021-10083-5.pdf
Size:
2.22 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
3.3 KB
Format:
Item-specific license agreed upon to submission
Description: