Sequential pattern mining using personalized minimum support threshold with minimum items

One of the challenges of Sequential Pattern Mining is finding frequent sequential patterns in a huge click stream data (web logs) since the data has the issue of a very low support distribution.By applying a Frequent Pattern Discovery technique, a sequence is considered as frequent if it occurs more...

全面介绍

Saved in:
书目详细资料
Main Authors: Alias, Suraya, Razali, Mohd Norhisham, Tan, Soo Fun, Sainin, Mohd Shamrie
格式: Conference or Workshop Item
语言:English
出版: 2011
主题:
在线阅读:http://repo.uum.edu.my/12320/1/06.pdf
http://repo.uum.edu.my/12320/
http://dx.doi.org/10.1109/ICRIIS.2011.6125688
标签: 添加标签
没有标签, 成为第一个标记此记录!
id my.uum.repo.12320
record_format eprints
spelling my.uum.repo.123202014-10-21T01:05:34Z http://repo.uum.edu.my/12320/ Sequential pattern mining using personalized minimum support threshold with minimum items Alias, Suraya Razali, Mohd Norhisham Tan, Soo Fun Sainin, Mohd Shamrie QA76 Computer software One of the challenges of Sequential Pattern Mining is finding frequent sequential patterns in a huge click stream data (web logs) since the data has the issue of a very low support distribution.By applying a Frequent Pattern Discovery technique, a sequence is considered as frequent if it occurs more than the minimum support (min sup) threshold value.The conventional method of assuming one min sup value is valid for all levels of k-sequence, may have an impact on the overall results or pattern generation. In this paper, a personalized minimum support (P_minsup) threshold with user specified minimum items or min_i is introduced. The P_minsup is generated for each k-sequence by analyzing the overall support pattern distribution of the click stream data; while the min_i value gives the user the flexibility to gain control on the number of patterns to be generated on the next k-sequence by using the top min_i items. This approach is then applied in the SPADE Algorithm using vector array as an extension from the previous method of using relational database and pre-defined threshold.The result from this experiment demonstrates that P_minsup with the complement of min_i value approach is applicable in assisting the process of determining the suitable threshold value to be used in detecting users' frequent k-sequential topics in navigating the World Wide Web (WWW). 2011 Conference or Workshop Item PeerReviewed application/pdf en http://repo.uum.edu.my/12320/1/06.pdf Alias, Suraya and Razali, Mohd Norhisham and Tan, Soo Fun and Sainin, Mohd Shamrie (2011) Sequential pattern mining using personalized minimum support threshold with minimum items. In: International Conference on Research and Innovation in Information Systems (ICRIIS), 23-24 Nov. 2011, Kuala Lumpur. http://dx.doi.org/10.1109/ICRIIS.2011.6125688 doi:10.1109/ICRIIS.2011.6125688
institution Universiti Utara Malaysia
building UUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Utara Malaysia
content_source UUM Institutionali Repository
url_provider http://repo.uum.edu.my/
language English
topic QA76 Computer software
spellingShingle QA76 Computer software
Alias, Suraya
Razali, Mohd Norhisham
Tan, Soo Fun
Sainin, Mohd Shamrie
Sequential pattern mining using personalized minimum support threshold with minimum items
description One of the challenges of Sequential Pattern Mining is finding frequent sequential patterns in a huge click stream data (web logs) since the data has the issue of a very low support distribution.By applying a Frequent Pattern Discovery technique, a sequence is considered as frequent if it occurs more than the minimum support (min sup) threshold value.The conventional method of assuming one min sup value is valid for all levels of k-sequence, may have an impact on the overall results or pattern generation. In this paper, a personalized minimum support (P_minsup) threshold with user specified minimum items or min_i is introduced. The P_minsup is generated for each k-sequence by analyzing the overall support pattern distribution of the click stream data; while the min_i value gives the user the flexibility to gain control on the number of patterns to be generated on the next k-sequence by using the top min_i items. This approach is then applied in the SPADE Algorithm using vector array as an extension from the previous method of using relational database and pre-defined threshold.The result from this experiment demonstrates that P_minsup with the complement of min_i value approach is applicable in assisting the process of determining the suitable threshold value to be used in detecting users' frequent k-sequential topics in navigating the World Wide Web (WWW).
format Conference or Workshop Item
author Alias, Suraya
Razali, Mohd Norhisham
Tan, Soo Fun
Sainin, Mohd Shamrie
author_facet Alias, Suraya
Razali, Mohd Norhisham
Tan, Soo Fun
Sainin, Mohd Shamrie
author_sort Alias, Suraya
title Sequential pattern mining using personalized minimum support threshold with minimum items
title_short Sequential pattern mining using personalized minimum support threshold with minimum items
title_full Sequential pattern mining using personalized minimum support threshold with minimum items
title_fullStr Sequential pattern mining using personalized minimum support threshold with minimum items
title_full_unstemmed Sequential pattern mining using personalized minimum support threshold with minimum items
title_sort sequential pattern mining using personalized minimum support threshold with minimum items
publishDate 2011
url http://repo.uum.edu.my/12320/1/06.pdf
http://repo.uum.edu.my/12320/
http://dx.doi.org/10.1109/ICRIIS.2011.6125688
_version_ 1644280881903304704
score 13.252575