posted on 2003-01-01, 00:00authored byX Hang, Honghua Dai
This paper proposes an optimal strategy for extracting probabilistic rules from databases. Two inductive learning-based statistic measures and their rough set-based definitions: accuracy and coverage are introduced. The simplicity of a rule emphasized in this paper has previously been ignored in the discovery of probabilistic rules. To avoid the high computational complexity of rough-set approach, some rough-set terminologies rather than the approach itself are applied to represent the probabilistic rules. The genetic algorithm is exploited to find the optimal probabilistic rules that have the highest accuracy and coverage, and shortest length. Some heuristic genetic operators are also utilized in order to make the global searching and evolution of rules more efficiently. Experimental results have revealed that it run more efficiently and generate probabilistic classification rules of the same integrity when compared with traditional classification methods.<br><br>
History
Location
Sapporo, Hokkaido, Japan
Language
eng
Publication classification
E1 Full written paper - refereed
Copyright notice
2003 Springer-Verlag Berlin Heidelberg
Editor/Contributor(s)
G Grieser, Y Tanaka, A Yamamoto
Pagination
153 - 165
Start date
2003-10-17
End date
2003-10-19
ISSN
1611-3349
eISSN
0302-9743
ISBN-13
9783540202936
ISBN-10
3540202935
Title of proceedings
Discovery science : 6th international conference, DS 2003, Sapporo, Japan, October 17-19, 2003 : proceedings
Event
International Conference on Discovery Science (6th : 2003 : Sapporo-shi, Japan)