Utilizing hyperlink transitivity to improve web page clustering

Hou, Jingyu and Zhang, Yanchun 2003, Utilizing hyperlink transitivity to improve web page clustering, in Database technologies 2003 : proceedings of the fourteenth Australasian Database Conference, Australian Computer Society, Sydney, N.S.W., pp. 49-57.

Attached Files
Name Description MIMEType Size Downloads

Title Utilizing hyperlink transitivity to improve web page clustering
Author(s) Hou, JingyuORCID iD for Hou, Jingyu orcid.org/0000-0002-6403-9786
Zhang, Yanchun
Conference name Australasian Database Conference (14th : 2003 : Adelaide, S. Aust.)
Conference location Adelaide, S. Aust.
Conference dates 4-7 February 2003
Title of proceedings Database technologies 2003 : proceedings of the fourteenth Australasian Database Conference
Editor(s) Dieter-Schewe, Klaus
Publication date 2003
Series Australian computer science communications ; v. 25, no. 2
Conference series Australasian Database Conference
Start page 49
End page 57
Publisher Australian Computer Society
Place of publication Sydney, N.S.W.
Keyword(s) World Wide Web
hyperlink analysis
web page similarity
web clustering
Summary The rapid increase of web complexity and size makes web searched results far from satisfaction in many cases due to a huge amount of information returned by search engines. How to find intrinsic relationships among the web pages at a higher level to implement efficient web searched information management and retrieval is becoming a challenge problem. In this paper, we propose an approach to measure web page similarity. This approach takes hyperlink transitivity and page importance into consideration. From this new similarity measurement, an effective hierarchical web page clustering algorithm is proposed. The primary evaluations show the effectiveness of the new similarity measurement and the improvement of web page clustering. The proposed page similarity, as well as the matrix-based hyperlink analysis methods, could be applied to other web-based research areas..
ISBN 090992595X
Language eng
Field of Research 080505 Web Technologies (excl Web Search)
HERDC Research category E1 Full written paper - refereed
Copyright notice ©2003, Australian Computer Society
Persistent URL http://hdl.handle.net/10536/DRO/DU:30005062

Connect to link resolver
Unless expressly stated otherwise, the copyright for items in DRO is owned by the author, with all rights reserved.

Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 0 times in TR Web of Science
Scopus Citation Count Cited 0 times in Scopus
Google Scholar Search Google Scholar
Access Statistics: 766 Abstract Views, 41 File Downloads  -  Detailed Statistics
Created: Mon, 07 Jul 2008, 09:44:56 EST

Every reasonable effort has been made to ensure that permission has been obtained for items included in DRO. If you believe that your rights have been infringed by this repository, please contact drosupport@deakin.edu.au.