File(s) under permanent embargo
Improved random forest algorithm to classify methicillin-resistant and methicillin-susceptible staphylococcus aureus on mass spectra
conference contribution
posted on 2017-05-14, 00:00 authored by Y L Dai, Z C Fan, L P Zhang, X Y Xu, Zili ZhangZili Zhang© 2017 Association for Computing Machinery. Mass spectrometry (MS) method has been one of the most popular subjects in the field of microbial identification by the reason of its rapid identification and variety application. However, the biomarker of Methicillin-resistant and methicillin-susceptible Staphylococcus aureus is out of measuring range by mass spectrometer, which leads to hard to classify for the molecules by those instruments. In this paper, to classify the molecules based on the MS in the small measurable range, we propose a reducing dimensions algorithm to deal with the heterogeneity of variables. With the preprocessed data, a random forest (RF) is used to identify the methicillin-susceptible S. aureus (MSSA) and methicillin-resistant S. aureus (MRSA). Experiments verify the accuracy of proposed methods. The result shows that the accuracy, recall, false positive rate and precision of proposed method are more than 90 percent. For medical institutions, the method which we proposed could identify MRSA from MSSA rapidly and savecosting on mass spectrometry data-set.