Ensemble Learning Based Feature Selection with an Application to Text Classification

dc.contributor.authorOnan, A
dc.date.accessioned2024-07-18T11:46:47Z
dc.date.available2024-07-18T11:46:47Z
dc.description.abstractAn important problem of text classification is high dimensionality. The performance of different feature selection methods can change based on the characteristics of different datasets. In this study, a feature selection method is developed, which integrates different filter-based feature selection methods by an ensemble learning approach. In the presented method, feature rankings obtained by five filter-based feature selection methods (mutual information measure, chi-square statistics, odds ratio, information gain and weighted log likelihood ratio) are aggregated by enhanced Borda count rank aggregation. In the experimental analysis, Reuters-21578 and 20 Newsgroups datasets are employed on support vector machines and C4.5 classifier. The experimental results indicate that the presented method outperforms conventional filter-based feature selection schemes.
dc.identifier.issn2165-0608
dc.identifier.urihttp://akademikarsiv.cbu.edu.tr:4000/handle/123456789/3012
dc.language.isoTurkish
dc.publisherIEEE
dc.subjectSCHEME
dc.titleEnsemble Learning Based Feature Selection with an Application to Text Classification
dc.typeProceedings Paper

Files