PDF电子书：PROTEIN SECONDARY STRUCTURE PREDICTION USING NEURAL NETWORKS AND SUPP - 经典PDF电子书下载区干细胞之家



免疫细胞治疗专区	欢迎关注干细胞微信公众号

12 3 4 5 6 7 8 9 10 ... 27 下一页

返回列表

查看: 821468\|回复: 269	go [实验技术类] PDF电子书：PROTEIN SECONDARY STRUCTURE PREDICTION USING NEURAL NETWORKS AND SUPP [复制链接]

hilib

新手上路

Rank: 1

积分: 0
威望: 0
包包: 21

楼主

发表于 2012-8-27 13:25 |只看该作者 |倒序浏览 |打印

PROTEIN SECONDARY STRUCTURE PREDICTION USING NEURAL NETWORKS AND SUPPORT VECTOR MACHINES$ e& t( t" W; R$ v
Abstract  K. \6 D0 A( k% t
Predicting the secondary structure of proteins is important in biochemistry because the 3D
structure can be determined from the local folds that are found in secondary structures.: ^. `- \+ i8 e3 D" D+ [# ?
Moreover, knowing the tertiary structure of proteins can assist in determining their functions.2 M# \! q& q* S7 T6 X3 x( E  M
The objective of this thesis is to compare the performance of Neural Networks (NN) and
Support Vector Machines (SVM) in predicting the secondary structure of 62 globular proteins
from their primary sequence. For each NN and SVM, we created six binary classifiers to9 _" M% h+ k9 E2 h
distinguish between the classes’ helices (H) strand (E), and coil (C). For NN we use Resilient% I7 D/ Q; Q& Y" O" m
Backpropagation training with and without early stopping. We use NN with either no hidden
layer or with one hidden layer with 1,2,...,40 hidden neurons. For SVM we use a Gaussian
kernel with parameter fixed at
= 0.1 and varying cost parameters C in the range [0.1,5]. 10-
fold cross-validation is used to obtain overall estimates for the probability of making a correct. S- v" N  D; P3 N% t: C' B) Z
prediction. Our experiments indicate for NN and SVM that the different binary classifiers
have varying accuracies: from 69% correct predictions for coils vs. non-coil up to 80% correct
predictions for stand vs. non-strand. It is further demonstrated that NN with no hidden layer) N" V7 R1 [7 \
or not more than 2 hidden neurons in the hidden layer are sufficient for better predictions. For
SVM we show that the estimated accuracies do not depend on the value of the cost parameter.: V! W' {* y- g  I3 f: u' h6 Z. b
As a major result, we will demonstrate that the accuracy estimates of NN and SVM binary
classifiers cannot distinguish. This contradicts a modern belief in bioinformatics that SVM
outperforms other predictors.
keywords: Neural Networks, Support Vector Machines, Protein Secondary Structure Prediction

收藏0 分享0 顶0 踩0