File(s) under permanent embargo
A matrix approach for hierarchical web page clustering based in hyperlinks
This paper proposes a matrix approach for hierarchical web page clustering with two algorithms using hyperlink information among pages. One clustering algorithm clusters web pages without considering cluster overlapping. Another one takes cluster overlapping into account. These algorithms take advantage of intrinsic relationships among the pages, and are independent of the order in which the pages are presented to the algorithms. Furthermore, the proposed algorithms do not require a predefined similarity threshold for clustering. They are easy to be implemented for web applications. The primary evaluations show the effectiveness of the proposed algorithms, as well as a promising application.
History
Event
IEEE Computer Society. Conference (3rd : 2002 : Singapore)Series
IEEE Computer Society ConferencePagination
207 - 216Publisher
Institute of Electrical and Electronics EngineersLocation
SingaporePlace of publication
Piscataway, N.J.Publisher DOI
Start date
2002-12-11End date
2002-12-11ISBN-13
9780769518138ISBN-10
0769518133Language
engPublication classification
E1.1 Full written paper - refereedCopyright notice
2002, IEEEEditor/Contributor(s)
Bo Huang, Tok Ling, Mukesh Mohania, Wee Ng, Ji-Rong Wen, S GuptaTitle of proceedings
WISE 2002 : Proceedings of the 3rd International Conference on Web Information Systems Engineering Workshops 2002Usage metrics
Categories
No categories selectedLicence
Exports
RefWorks
BibTeX
Ref. manager
Endnote
DataCite
NLM
DC