Openly accessible

Automatic pattern-taxonomy extraction for web mining

Wu, Sheng-Tang, Li, Yuefeng, Xu, Yue, Pham, Binh and Chen, Yi-Ping Phoebe 2004, Automatic pattern-taxonomy extraction for web mining, in IEEE/WIC International Conference on Web Intelligence (WI 2004) : Beijing, China, September 20-24, 2004 : proceedings, IEEE Xplore, Piscataway, N.J., pp. 242-248.

Attached Files
Name Description MIMEType Size Downloads
chen-automaticpatterntaxonomy-2004.pdf Published version application/pdf 159.04KB 1341

Title Automatic pattern-taxonomy extraction for web mining
Author(s) Wu, Sheng-Tang
Li, Yuefeng
Xu, Yue
Pham, Binh
Chen, Yi-Ping Phoebe
Conference name IEEE/WIC/ACM International Conference on Intelligent Agent Technology (2004 : Beijing, China)
Conference location Beijing, China
Conference dates 20-24 September 2004
Title of proceedings IEEE/WIC International Conference on Web Intelligence (WI 2004) : Beijing, China, September 20-24, 2004 : proceedings
Editor(s) Zhong, Ning
Tirri, Henry
Yao, Yiyu
Zhou, Lizhu
Liu, Jiming
Cercone, Nick
Publication date 2004
Start page 242
End page 248
Publisher IEEE Xplore
Place of publication Piscataway, N.J.
Summary In this paper, we propose a model for discovering frequent sequential patterns, phrases, which can be used as profile descriptors of documents. It is indubitable that we can obtain numerous phrases using data mining algorithms. However, it is difficult to use these phrases effectively for answering what users want. Therefore, we present a pattern taxonomy extraction model which performs the task of extracting descriptive frequent sequential patterns by pruning the meaningless ones. The model then is extended and tested by applying it to the information filtering system. The results of the experiment show that pattern-based methods outperform the keyword-based methods. The results also indicate that removal of meaningless patterns not only reduces the cost of computation but also improves the effectiveness of the system.
ISBN 0769521002
9780769521008
Language eng
Field of Research 080609 Information Systems Management
HERDC Research category E1 Full written paper - refereed
Copyright notice ©2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
Persistent URL http://hdl.handle.net/10536/DRO/DU:30009625

Document type: Conference Paper
Collections: School of Information Technology
Open Access Collection
Connect to link resolver
 
Unless expressly stated otherwise, the copyright for items in DRO is owned by the author, with all rights reserved.

Every reasonable effort has been made to ensure that permission has been obtained for items included in DRO. If you believe that your rights have been infringed by this repository, please contact drosupport@deakin.edu.au.

Versions
Version Filter Type
Access Statistics: 506 Abstract Views, 1341 File Downloads  -  Detailed Statistics
Created: Tue, 14 Oct 2008, 07:00:18 EST

Every reasonable effort has been made to ensure that permission has been obtained for items included in DRO. If you believe that your rights have been infringed by this repository, please contact drosupport@deakin.edu.au.