Proceedings of PKAW '02 : the 2002 Pacific Rim Knowledge Aquisition Workshop
Place of publication
During knowledge acquisition multiple alternative potential rules all appear equally credible. This paper addresses the dearth of formal analysis about how to select between such alternatives. It presents two hypotheses about the expected impact of selecting between classification rules of differing levels of generality in the absence of other evidence about their likely relative performance on unseen data. It is argued that the accuracy on unseen data of the more general rule will tend to be closer to that of a default rule for the class than will that of the more specific rule. It is also argued that in comparison to the more general rule, the accuracy of the more specific rule on unseen cases will tend to be closer to the accuracy obtained on training data. Experimental evidence is provided in support of these hypotheses. We argue that these hypotheses can be of use in selecting between rules in order to achieve specific knowledge acquisition objectives.
Field of Research
080110 Simulation and Modelling
Socio Economic Objective
970108 Expanding Knowledge in the Information and Computing Sciences