Tuesday 23 April 2013

Protein Local Structure Prediction Using Support Vector Machine


Abstract

          Structural bioinformatics is getting essential these days. Secondary structure segments are the building blocks of protein local structure. Protein local structure prediction leads to a better three dimensional protein structure and function prediction. Predicting protein local structure is capable to predict protein global structure at ease based on amino acid sequence. However, most related works are conducted using fixed segment length which possibly leads to bias results. Therefore this experiment is conducted to determine the optimal segment length. In this research, residue score is quantified using the hybrid of conservation score and propensity score named CP. Each amino acid is assigned a secondary structure alphabet using DSSP and STRIDE. The residue scores with their respective structural class are sliced into segments using sliding window technique and then modified to feature vector and feature class. Next, Support Vector Machine (SVM) performs classification based on feature vector and feature class to predict protein local structure. This experiment utilizes RS126 dataset (Rost and Sander, 1994) and two fold cross validation. The optimal length of protein local structure for helix is 9, strand is 19 and coil is 5 consecutive residues based on the performance of accuracy and proportion of support vectors. The optimal Secondary Structure Assignment Method (SSAM) that gives the best performance in accuracy, specificity and sensitivity is determined.

Website : http://ir.fsksm.utm.my/59/

No comments:

Post a Comment