Deakin University
Browse

Finding coverage using incremental attribute combinations

Download (874.71 kB)
journal contribution
posted on 2009-05-01, 00:00 authored by Jiyuan An, Yi-Ping Phoebe Chen
Coverage is the range that covers only positive samples in attribute (or feature) space. Finding coverage is the kernel problem in induction algorithms because of the fact that coverage can be used as rules to describe positive samples. To reflect the characteristic of training samples, it is desirable that the large coverage that cover more positive samples. However, it is difficult to find large coverage, because the attribute space is usually very high dimensionality. Many heuristic methods such as ID3, AQ and CN2 have been proposed to find large coverage. A robust algorithm also has been proposed to find the largest coverage, but the complexities of time and space are costly when the dimensionality becomes high. To overcome this drawback, this paper proposes an algorithm that adopts incremental feature combinations to effectively find the largest coverage. In this algorithm, the irrelevant coverage can be pruned away at early stages because potentially large coverage can be found earlier. Experiments show that the space and time needed to find the largest coverage has been significantly reduced.

History

Journal

International journal of innovative computing, information and control

Volume

5

Pagination

1419 - 1428

Location

Kumamoto, Japan

Open access

  • Yes

ISSN

1349-4198

Language

eng

Notes

Every reasonable effort has been made to ensure that permission has been obtained for items included in Deakin Research Online. If you believe that your rights have been infringed by this repository, please contact drosupport@deakin.edu.au

Publication classification

C1 Refereed article in a scholarly journal

Copyright notice

2009, ICIC International

Usage metrics

    Research Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC