Data pre-processing for more effective gene clustering
Hou, Jingyu and Chen, Yi-Ping Phoebe 2009, Data pre-processing for more effective gene clustering, in Proceedings of the 2nd International Joint Conference on Computational Sciences and Optimization, CSO 2009, IEEE Computer Society, Los Alamitos, Calif., pp. 710-713.
(Some files may be inaccessible until you login with your Deakin Research Online credentials)
The high-throughput experimental data from the new gene microarray technology has spurred numerous efforts to find effective ways of processing microarray data for revealing real biological relationships among genes. This work proposes an innovative data pre-processing approach to identify noise data in the data sets and eliminate or reduce the impact of the noise data on gene clustering, With the proposed algorithm, the pre-processed data sets make the clustering results stable across clustering algorithms with different similarity metrics, the important information of genes and features is kept, and the clustering quality is improved. The primary evaluation on real microarray data sets has shown the effectiveness of the proposed algorithm.
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
Field of Research
060102 Bioinformatics 080109 Pattern Recognition and Data Mining
Unless expressly stated otherwise, the copyright for items in Deakin Research Online is owned by the author, with all rights reserved.
Every reasonable effort has been made to ensure that permission has been obtained for items included in DRO.
If you believe that your rights have been infringed by this repository, please contact firstname.lastname@example.org.