MetricHunter: A software metric dataset generator utilizing SourceMonitor upon public GitHub repositories
dc.contributor.author | Özçevik Y. | |
dc.contributor.author | Altay O. | |
dc.date.accessioned | 2024-07-22T08:02:37Z | |
dc.date.available | 2024-07-22T08:02:37Z | |
dc.date.issued | 2023 | |
dc.description.abstract | Version control systems are pervasively consulted nowadays to obtain software metric datasets. Accordingly, machine learning is applied to predict different aspects of a software including quality monitoring, influence analysis, etc. However, construction of a metric dataset is challenging and the dataset content may affect the success of the learning-based models. In this study, we propose a dataset construction tool, MetricHunter, which is able to produce platform/language specific datasets that can be used for predicting the features of newly created software. The proposed tool is developed by C# programming language utilizing a known metric gathering tool, i.e. SourceMonitor, and the GitHub REST API for public repositories. Thus, one can construct a proper dataset from a graphical user interface by simply specifying the programming language or target platform. The outputs of the tool on a set of repositories are validated by investigating automatically generated attribute values and comparing them with the measurements of metric gathering tools as well as the GitHub metric values. © 2023 The Author(s) | |
dc.identifier.DOI-ID | 10.1016/j.softx.2023.101499 | |
dc.identifier.issn | 23527110 | |
dc.identifier.uri | http://akademikarsiv.cbu.edu.tr:4000/handle/123456789/11938 | |
dc.language.iso | English | |
dc.publisher | Elsevier B.V. | |
dc.rights | All Open Access; Gold Open Access | |
dc.subject | Application programming interfaces (API) | |
dc.subject | Computer software selection and evaluation | |
dc.subject | Control systems | |
dc.subject | Graphical user interfaces | |
dc.subject | Information management | |
dc.subject | Learning systems | |
dc.subject | C++ programming | |
dc.subject | Dataset construction | |
dc.subject | Git version control system | |
dc.subject | Influence analysis | |
dc.subject | Learning Based Models | |
dc.subject | Machine-learning | |
dc.subject | Quality monitoring | |
dc.subject | Software metrics | |
dc.subject | Software Quality | |
dc.subject | Version control system | |
dc.subject | Quality control | |
dc.title | MetricHunter: A software metric dataset generator utilizing SourceMonitor upon public GitHub repositories | |
dc.type | Article |