A framework for data mining and knowledge discovery in cloud computing
No Thumbnail Available
Date
2016
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
The massive amounts of data being generated in the current world of information technology have increased from terabytes to petabytes in volume. The fact that extracting knowledge from large-scale data is a challenging issue creates a great demand for cloud computing because of its potential benefits such as scalable storage and processing services. Considering this motivation, this chapter introduces a novel framework, data mining in cloud computing (DMCC), that allows users to apply classification, clustering, and association rule mining methods on huge amounts of data efficiently by combining data mining, cloud computing, and parallel computing technologies. The chapter discusses the main architectural components, interfaces, features, and advantages of the proposed DMCC framework. This study also compares the running times when data mining algorithms are executed in serial and parallel in a cloud environment through DMCC framework. Experimental results show that DMCC greatly decreases the execution times of data mining algorithms. © Springer International Publishing Switzerland 2016. All rights reserved.
Description
Keywords
Association rules , Classification (of information) , Cloud computing , Data reduction , Digital storage , Architectural components , Association rule mining methods , Clustering , Data mining algorithm , Data mining and knowledge discovery , DMCC , Parallel com- puting , Potential benefits , Data mining