Exploring performance of instance selection methods in text sentiment classification

dc.contributor.authorOnan A.
dc.contributor.authorKorukoğlu S.
dc.date.accessioned2024-07-22T08:12:23Z
dc.date.available2024-07-22T08:12:23Z
dc.date.issued2016
dc.description.abstractSentiment analysis is the process of extracting subjective information in source materials. Sentiment analysis is a subfield of web and text mining. One major problem encountered in these areas is overwhelming amount of data available. Hence, instance selection and feature selection become two essential tasks for achieving scalability in machine learning based sentiment classification. Instance selection is a data reduction technique which aims to eliminate redundant, noisy data from the training dataset so that training time can be reduced, scalability and generalization ability can be enhanced. This paper examines the predictive performance of fifteen benchmark instance selection methods for text classification domain. The instance selection methods are evaluated by decision tree classifier (C4.5 algorithm) and radial basis function networks in terms of classification accuracy and data reduction rates. The experimental results indicate that the highest classification accuracies on C4.5 algorithm are generally obtained by model class selection method, while the highest classification accuracies on radial basis function networks are obtained by nearest centroid neighbor edition. © Springer International Publishing Switzerland 2016.
dc.identifier.DOI-ID10.1007/978-3-319-33625-1_16
dc.identifier.issn21945357
dc.identifier.urihttp://akademikarsiv.cbu.edu.tr:4000/handle/123456789/16036
dc.language.isoEnglish
dc.publisherSpringer Verlag
dc.subjectArtificial intelligence
dc.subjectBenchmarking
dc.subjectClassification (of information)
dc.subjectData mining
dc.subjectData reduction
dc.subjectDecision trees
dc.subjectFunctions
dc.subjectIntelligent systems
dc.subjectLearning algorithms
dc.subjectLearning systems
dc.subjectNatural language processing systems
dc.subjectRadial basis function networks
dc.subjectScalability
dc.subjectTrees (mathematics)
dc.subjectClassification accuracy
dc.subjectDecision tree classifiers
dc.subjectGeneralization ability
dc.subjectInstance selection
dc.subjectPredictive performance
dc.subjectSentiment classification
dc.subjectSubjective information
dc.subjectText mining
dc.subjectText processing
dc.titleExploring performance of instance selection methods in text sentiment classification
dc.typeConference paper

Files