Browsing by Subject "K nearest neighbor algorithm"
Now showing 1 - 2 of 2
Results Per Page
Sort Options
Item Review spam detection based on psychological and linguistic features; [Psikolojik ve Dilbilimsel Özniteliklere Dayali Istenmeyen Inceleme Metni Belirleme](Institute of Electrical and Electronics Engineers Inc., 2018) Onan A.With the advances in information and communication technologies, the immense quantity of review texts have become available on the Web. Review text can serve as an essential source of information for individual decision makers and business organizations. Some of the reviews shared on the Web may contain deceptive information to mislead the existing decision making process. In this study, we have presented a supervised learning based scheme for review spam detection. In the presented study, psychological and linguistic feature sets and their combinations are taken into consideration. In the study, the predictive performances of four conventional supervised learning methods (namely, Naive Bayes classifier, K-nearest neighbor algorithm, support vector machines and C4.5 algorithm) are evaluated on the different feature sets. © 2018 IEEE.Item Satire identification in Turkish news articles based on ensemble of classifiers(Turkiye Klinikleri, 2020) Onan A.; Tocoglu M.A.Social media and microblogging platforms generally contain elements of figurative and nonliteral language, including satire. The identification of figurative language is a fundamental task for sentiment analysis. It will not be possible to obtain sentiment analysis methods with high classification accuracy if elements of figurative language have not been properly identified. Satirical text is a kind of figurative language, in which irony and humor have been utilized to ridicule or criticize an event or entity. Satirical news is a pervasive issue on social media platforms, which can be deceptive and harmful. This paper presents an ensemble scheme for satirical news identification in Turkish news articles. In the presented scheme, linguistic and psychological feature sets have been utilized to extract the feature sets (i.e. linguistic, psychological, personal, spoken categories, and punctuation). In the classification phase, accuracy rates of five supervised learning algorithms (i.e. naive Bayes algorithm, logistic regression, support vector machines, random forest, and k-nearest neighbor algorithm) with three widely utilized ensemble methods (i.e. AdaBoost, bagging, and random subspace) have been considered. Based on the results, we concluded that the random forest algorithm yielded the highest performance, with a classification accuracy of 96.92% for satire detection in Turkish. For deep learning-based architectures, we have achieved classification accuracy of 97.72% with the recurrent neural network architecture with attention mechanism. © 2020 Turkiye Klinikleri. All rights reserved.