Please use this identifier to cite or link to this item: http://dx.doi.org/10.18419/opus-12130
Authors: Senger, Tobias
Title: A unified open- and closed-source software requirements dataset
Issue Date: 2022
metadata.ubs.publikation.typ: Abschlussarbeit (Bachelor)
metadata.ubs.publikation.seiten: 85
URI: http://nbn-resolving.de/urn:nbn:de:bsz:93-opus-ds-121478
http://elib.uni-stuttgart.de/handle/11682/12147
http://dx.doi.org/10.18419/opus-12130
Abstract: Requirements Engineering (RE) has proven to be an important factor for the success of a software project. The common use of natural language for writing requirements often results in problems that should be detected and avoided early. For this reason, we want to build automatic tools to support the process of specifying requirements using Deep Learning (DL). However, training robust DL models is very data-intensive and the RE community still suffers from a lack of large-scale requirement datasets that are easy to use. Therefore, the goal of this study is to create such a dataset that can be used for various tasks in the RE domain. To do this, we collect functional and non-functional requirements from a large number of both open and closed source software projects and combine them into a unified dataset using a simple data format. We then train a DL model for automatically classifying functional and non-functional requirements to show the potential of our dataset for training efficient DL models. We compare its performance with a state-of-the-art model and students at the University of Stuttgart. We also examine the differences between the open and closed source requirements in our dataset and compare the textual corpus of our dataset with common English datasets and corpora. Our studies showed that our model outperforms both the state-of-the-art model and most of the students. Further, we observed remarkable differences between the open and closed source requirements and found that our requirements use a unique vocabulary compared to common English texts.
Appears in Collections:05 Fakultät Informatik, Elektrotechnik und Informationstechnik

Files in This Item:
File Description SizeFormat 
Bachelorarbeit_BSc_Informatik_Tobias_Senger_2022.pdf1,86 MBAdobe PDFView/Open


Items in OPUS are protected by copyright, with all rights reserved, unless otherwise indicated.