MetricHunter: A software metric dataset generator utilizing SourceMonitor upon public GitHub repositories

dc.contributor.authorÖzçevik Y.
dc.contributor.authorAltay O.
dc.date.accessioned2024-07-22T08:02:37Z
dc.date.available2024-07-22T08:02:37Z
dc.date.issued2023
dc.description.abstractVersion control systems are pervasively consulted nowadays to obtain software metric datasets. Accordingly, machine learning is applied to predict different aspects of a software including quality monitoring, influence analysis, etc. However, construction of a metric dataset is challenging and the dataset content may affect the success of the learning-based models. In this study, we propose a dataset construction tool, MetricHunter, which is able to produce platform/language specific datasets that can be used for predicting the features of newly created software. The proposed tool is developed by C# programming language utilizing a known metric gathering tool, i.e. SourceMonitor, and the GitHub REST API for public repositories. Thus, one can construct a proper dataset from a graphical user interface by simply specifying the programming language or target platform. The outputs of the tool on a set of repositories are validated by investigating automatically generated attribute values and comparing them with the measurements of metric gathering tools as well as the GitHub metric values. © 2023 The Author(s)
dc.identifier.DOI-ID10.1016/j.softx.2023.101499
dc.identifier.issn23527110
dc.identifier.urihttp://akademikarsiv.cbu.edu.tr:4000/handle/123456789/11938
dc.language.isoEnglish
dc.publisherElsevier B.V.
dc.rightsAll Open Access; Gold Open Access
dc.subjectApplication programming interfaces (API)
dc.subjectComputer software selection and evaluation
dc.subjectControl systems
dc.subjectGraphical user interfaces
dc.subjectInformation management
dc.subjectLearning systems
dc.subjectC++ programming
dc.subjectDataset construction
dc.subjectGit version control system
dc.subjectInfluence analysis
dc.subjectLearning Based Models
dc.subjectMachine-learning
dc.subjectQuality monitoring
dc.subjectSoftware metrics
dc.subjectSoftware Quality
dc.subjectVersion control system
dc.subjectQuality control
dc.titleMetricHunter: A software metric dataset generator utilizing SourceMonitor upon public GitHub repositories
dc.typeArticle

Files