File(s) under permanent embargo
Web page clustering : a hyperlink-based similarity and matrix-based hierarchical algorithms
conference contribution
posted on 2003-01-01, 00:00 authored by Jingyu HouJingyu Hou, Y Zhang, J CaoThis paper proposes a hyperlink-based web page similarity measurement and two matrix-based hierarchical web page clustering algorithms. The web page similarity measurement incorporates hyperlink transitivity and page importance within the concerned web page space. One clustering algorithm takes cluster overlapping into account, another one does not. These algorithxms do not require predefined similarity thresholds for clustering, and are independent of the page order. The primary evaluations show the effectiveness of the proposed algorithms in clustering improvement.
History
Title of proceedings
APWeb 2003 : web technologies and applications : 5th Asia-Pacific Web Conference proceedingsEvent
Asia-Pacific Web Conference (5th : 2003 : Xi'an, Shaanxi Sheng, China)Series
Lecture notes in computer science ; 2642Pagination
201 - 212Publisher
SpringerLocation
Xi'an, Shaanxi Sheng, ChinaPlace of publication
New York N.Y.Publisher DOI
Start date
2003-04-23End date
2003-04-25ISSN
0302-9743eISSN
1611-3349ISBN-13
9783540023548ISBN-10
3540023542Language
engNotes
The original publication can be found at www.springerlink.comPublication classification
E1 Full written paper - refereedCopyright notice
2003, Springer-Verlag Berlin HeidelbergEditor/Contributor(s)
X Zhou, Y Zhang, M OrlowskaUsage metrics
Categories
No categories selectedKeywords
Licence
Exports
RefWorks
BibTeX
Ref. manager
Endnote
DataCite
NLM
DC