A machine learning based approach to identify geo-location of Twitter users

dc.contributor.authorOnan A.
dc.date.accessioned2024-07-22T08:10:49Z
dc.date.available2024-07-22T08:10:49Z
dc.date.issued2017
dc.description.abstractTwitter, a popular microblogging platform, has attracted great attention. Twitter enables people from all over the world to interact in an extremely personal way. The immense quantity of user-generated text messages become available on Twitter that could potentially serve as an important source of information for researchers and practitioners. The information available on Twitter may be utilized for many purposes, such as event detection, public health and crisis management. In order to effectively coordinate such activities, the identification of Twitter users' geo-locations is extremely important. Though online social networks can provide some sort of geo-location information based on GPS coordinates, Twitter suffers from geo-location sparseness problem. The identification of Twitter users' geo-location based on the content of send out messages, becomes extremely important. In this regard, this paper presents a machine learning based approach to the problem. In this study, our corpora is represented as a word vector. To obtain a classification scheme with high predictive performance, the performance of five classification algorithms, three ensemble methods and two feature selection methods are evaluated. Among the compared algorithms, the highest results (84.85%) is achieved by AdaBoost ensemble of Random Forest, when the feature set is selected with the use of consistency-based feature selection method in conjunction with best first search. © 2017 ACM.
dc.identifier.DOI-ID10.1145/3018896.3018969
dc.identifier.urihttp://akademikarsiv.cbu.edu.tr:4000/handle/123456789/15377
dc.language.isoEnglish
dc.publisherAssociation for Computing Machinery
dc.subjectAdaptive boosting
dc.subjectArtificial intelligence
dc.subjectCloud computing
dc.subjectData mining
dc.subjectDecision trees
dc.subjectFeature extraction
dc.subjectInternet of things
dc.subjectLearning systems
dc.subjectLocation
dc.subjectNatural language processing systems
dc.subjectClassification algorithm
dc.subjectFeature selection methods
dc.subjectGeolocations
dc.subjectLocation based
dc.subjectMicro-blogging platforms
dc.subjectOn-line social networks
dc.subjectPredictive performance
dc.subjectText mining
dc.subjectSocial networking (online)
dc.titleA machine learning based approach to identify geo-location of Twitter users
dc.typeConference paper

Files