The high-throughput experimental data from the new gene microarray technology has spurred numerous efforts to find effective ways of processing microarray data for revealing real biological relationships among genes. This work proposes an innovative data pre-processing approach to identify noise data in the data sets and eliminate or reduce the impact of the noise data on gene clustering, With the proposed algorithm, the pre-processed data sets make the clustering results stable across clustering algorithms with different similarity metrics, the important information of genes and features is kept, and the clustering quality is improved. The primary evaluation on real microarray data sets has shown the effectiveness of the proposed algorithm.
History
Event
International Joint Conference on Computational Sciences and Optimization (2nd : 2009 : Sanya, Hainan, China)
Pagination
710 - 713
Publisher
IEEE Computer Society
Location
Sanya, Hainan, China
Place of publication
Los Alamitos, Calif.
Start date
2009-04-24
End date
2009-04-26
ISBN-13
9780769536057
Language
eng
Notes
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
Publication classification
E1 Full written paper - refereed; E Conference publication
Copyright notice
2009, IEEE
Editor/Contributor(s)
L Yu, K Lai, S Mishra
Title of proceedings
Proceedings of the 2nd International Joint Conference on Computational Sciences and Optimization, CSO 2009