puri-analysisandpredict-2013.pdf (355.81 kB)
Analysis and prediction of major blood proteins based on their amino acid and dipeptide composition
journal contributionposted on 2013-01-01, 00:00 authored by S Muthukrishnan, Munish PuriMunish Puri, C Lefevre
A method has been developed for predicting blood proteins using the SVM based machine learning approach. In this prediction method a two-step strategy was deployed to predict blood proteins and their subclasses. We have developed models of blood proteins and achieved the maximum accuracies of 90.57% and 91.39% with Matthews correlation coefficient (MCC) of 0.89 and 0.90 using single amino acid and dipeptide composition respectively. Furthermore, the method is able to predict major subclasses of blood proteins; developed based on amino acid (AC) and dipeptide composition (DC) with a maximum accuracy 90.38%, 92.83%, 87.41%, 92.52% and 85.27%, 89.07%, 94.82%, 86.31 for albumin, globulin, fibrinogen, and regulatory proteins respectively. All modules were trained, tested, and evaluated using the five-fold cross-validation technique.