Browsing by Author "Birant, D"
Now showing 1 - 5 of 5
Results Per Page
Sort Options
Item An ant colony optimisation approach for optimising SPARQL queries by reordering triple patternsKalayci, EG; Kalayci, TE; Birant, DProcessing the excessive volumes of information on the Web is an important issue. The Semantic Web paradigm has been proposed as the solution. However, this approach generates several challenges, such as query processing and optimisation. This paper proposes a novel approach for optimising SPARQL queries with different graph shapes. This new method reorders the triple patterns using Ant Colony Optimisation (ACO) algorithms. Reordering the triple patterns is a way of decreasing the execution times of the SPARQL queries. The proposed approach is focused on in-memory models of RDF data, and it optimises the SPARQL queries by means of Ant System, Elitist Ant System and MAX-MIN Ant System algorithms. The approach is implemented in the Apache Jena ARQ query engine, which is used for the experimentation, and the new method is compared with Normal Execution, Jena Reorder Algorithms, and the Stocker et al. Algorithms. All of the experiments are performed using the LUBM dataset for various shapes of queries, such as chain, star, cyclic, and chain-star. The first contribution is the real-time optimisation of SPARQL query triple pattern orders using ACO algorithms, and the second contribution is the concrete implementation for the ARQ query engine, which is a component of the widely used Semantic Web framework Apache Jena. The experiments demonstrate that the proposed method reduces the execution time of the queries significantly. (C) 2015 Elsevier Ltd. All rights reserved.Item Naive Bayes Classifier for Continuous Variables using Novel Method (NBC4D) and DistributionsYildirim, P; Birant, DIn data mining, when using Naive Bayes classification technique, it is necessary to overcome the problem of how to deal with continuous attributes. Most previous work has solved the problem either by using discretization, normal method or kernel method. This study proposes the usage of different continuous probability distribution techniques for Naive Bayes classification. It explores various probability density functions of distributions. The experimental results show that the proposed probability distributions also classify continuous data with potentially high accuracy. In addition, this paper introduces a novel method, named NBC4D, which offers a new approach for classification by applying different distribution types on different attributes. The results (obtained classification accuracy rates) show that our proposed method (the usage of more than one distribution types) has success on real-world datasets when compared with the usage of only one well known distribution type.Item Development of an Interactive Game-Based Learning Environment to Teach Data MiningCengiz, M; Birant, KU; Yildirim, P; Birant, DGame-based learning has become a popular topic in all levels of education. A number of computer games have been developed to teach different subjects such as mathematics, English language, medicine, and music. This paper presents the first study that proposes the development of edutainment games to teach data mining techniques with the scope of gamebased learning. The aim of this study is to provide an environment that is both fun and enables the achievement of learning goals in data mining training in computer engineering. An escape game called Mine4Escape, which consists of different rooms to teach different data mining techniques (classification and association rule mining), has been developed for individuals at the undergraduate and post-graduate levels. The advantages of the proposed approach are discussed in comparison with traditional data mining training. In addition, this paper describes a dynamic scoring system designed for game-based learning. Finally, an experimental study was carried out to evaluate the performance of our learning environment by analyzing feedback received from a test group consisting of 39 undergraduate and graduate students in computer engineering. The findings from the questionnaire show that it is possible to enhance knowledge acquisition about data mining via the game-based approach. However, the degree of learning interest and information acceptance changes according to students' age, gender, educational level, and game habits.Item EBOC: Ensemble-Based Ordinal Classification in TransportationYildirim, P; Birant, UK; Birant, DLearning the latent patterns of historical data in an efficient way to model the behaviour of a system is a major need for making right decisions. For this purpose, machine learning solution has already begun its promising marks in transportation as well as in many areas such as marketing, finance, education, and health. However, many classification algorithms in the literature assume that the target attribute values in the datasets are unordered, so they lose inherent order between the class values. To overcome the problem, this study proposes a novel ensemble-based ordinal classification (EBOC) approach which suggests bagging and boosting (AdaBoost algorithm) methods as a solution for ordinal classification problem in transportation sector. This article also compares the proposed EBOC approach with ordinal class classifier and traditional tree-based classification algorithms (i.e., C4.5 decision tree, RandomTree, and REPTree) in terms of accuracy. The results indicate that the proposed EBOC approach achieves better classification performance than the conventional solutions.Item Comparative Analysis of Ensemble Learning Methods for Signal ClassificationYildirim, P; Birant, KU; Radevski, V; Kut, A; Birant, DIn recent years, the machine learning algorithms commenced to be used widely in signal classification area as well as many other areas. Ensemble learning has become one of the most popular Machine Learning approaches due to the high classification performance it provides. In this study, the application of four fundamental ensemble learning methods (Bagging, Boosting, Stacking, and Voting) with five different classification algorithms (Neural Network, Support Vector Machines, k-Nearest Neighbor, Naive Bayes, and C4.5) with the most optimal parameter values on signal datasets is presented. In the experimental studies, ensemble learning methods were applied on 14 different signal datasets and the results were compared in terms of classification accuracy rates. According to the results, the best classification performance was obtained with the Random Forest algorithm which is a Bagging based method.