Deakin University
Browse

File(s) under permanent embargo

Web page clustering : a hyperlink-based similarity and matrix-based hierarchical algorithms

conference contribution
posted on 2003-01-01, 00:00 authored by Jingyu HouJingyu Hou, Y Zhang, J Cao
This paper proposes a hyperlink-based web page similarity measurement and two matrix-based hierarchical web page clustering algorithms. The web page similarity measurement incorporates hyperlink transitivity and page importance within the concerned web page space. One clustering algorithm takes cluster overlapping into account, another one does not. These algorithxms do not require predefined similarity thresholds for clustering, and are independent of the page order. The primary evaluations show the effectiveness of the proposed algorithms in clustering improvement.

History

Title of proceedings

APWeb 2003 : web technologies and applications : 5th Asia-Pacific Web Conference proceedings

Event

Asia-Pacific Web Conference (5th : 2003 : Xi'an, Shaanxi Sheng, China)

Series

Lecture notes in computer science ; 2642

Pagination

201 - 212

Publisher

Springer

Location

Xi'an, Shaanxi Sheng, China

Place of publication

New York N.Y.

Start date

2003-04-23

End date

2003-04-25

ISSN

0302-9743

eISSN

1611-3349

ISBN-13

9783540023548

ISBN-10

3540023542

Language

eng

Notes

The original publication can be found at www.springerlink.com

Publication classification

E1 Full written paper - refereed

Copyright notice

2003, Springer-Verlag Berlin Heidelberg

Editor/Contributor(s)

X Zhou, Y Zhang, M Orlowska

Usage metrics

    Research Publications

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC