Efficiently mining frequent patterns from dense datasets using a cluster of computers

Yudho Giri Sucahyo, Raj P. Gopalan, Amit Rudra

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Citations (Scopus)

Abstract

Efficient mining of frequent patterns from large databases has been an active area of research since it is the most expensive step in association rules mining. In this paper, we present an algorithm for finding complete frequent patterns from very large dense datasets in a cluster environment. The data needs to be distributed to the nodes of the cluster only once and the mining can be performed in parallel many times with different parameter settings for minimum support. The algorithm is based on a master-slave scheme where a coordinator controls the data parallel programs running on a number of nodes of the cluster. The parallel program was executed on a cluster of Alpha SMPs. The performance of the algorithm was studied on small and large dense datasets. We report the results of the experiments that show both speed up and scale up of our algorithm along with our conclusions and pointers for further work.

Original languageEnglish
Title of host publicationAI 2003
Subtitle of host publicationAdvances in Artificial Intelligence - 16th Australian Conference on AI, Proceedings
EditorsTamas D. Gedeon, Lance Chun Che Fung, Tamas D. Gedeon
PublisherSpringer Verlag
Pages233-244
Number of pages12
ISBN (Print)9783540206460
DOIs
Publication statusPublished - 2003
Event16th Australian Conference on Artificial Intelligence, AI 2003 - Perth, Australia
Duration: 3 Dec 20035 Dec 2003

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2903
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th Australian Conference on Artificial Intelligence, AI 2003
Country/TerritoryAustralia
CityPerth
Period3/12/035/12/03

Fingerprint

Dive into the research topics of 'Efficiently mining frequent patterns from dense datasets using a cluster of computers'. Together they form a unique fingerprint.

Cite this