- 3
- 0
- 约7.31万字
- 约 89页
- 2019-03-23 发布于上海
- 举报
布的评分函数。并通过多次实验确定计算距离分布时的离散区间数目为20。
布的评分函数。并通过多次实验确定计算距离分布时的离散区间数目为
20。
(2)在蛋白质结构中,主链二面角(≯,妒)的分布就可用拉氏构象 图来描画。本文构建了一个基于二面角的评分函数,通过计算确定把 (≯,伊)空间离散为60的网格是最好的选择。
(3)进一步组合上述从距离和角度两个方面建立的评分函数,所得 的评分函数性能比前两者有大幅提高。通过正确识别蛋白质天然结构总 数和Z score这两个性能指标,确定了性能最好的一组组合能量,此组 合能量函数能识别出150条天然结构的测试集中的109条。
(4)由于20种氨基酸在蛋白质中出现的频率不一样,因此存在着 数据稀疏性。本文采用了一种稀疏数据校正策略,通过计算确定了另一 组识别性能最优的组合能量,能识别1 14条天然结构,识别率为76%,
Z_score值也同时得到改善。
关键词:基于知识的评分函数,能量函数,依赖距离分布的能量函数,
二面角能量函数,组合能量函数,稀疏数据校正
A
A KNOWLEDGE—BASED SCOIUNG FUNCTION FOR
PREDICTING PROTEIN STRUCTURES
ABSTRACT
It has recently been a challenging research topic in bioinformatics to predict the tertiary structure of a protein from its amino acid sequence.It is critical to design a good scoring function in recognizing the native structure
of a protein.
Scoring function,which is also called as energy function or potential function,can be classified into two categories:Physics—based scoring function and knowledge-based scoring function.The former is all experiential formula resulting from analyzing the forces between the particles SO it really
reflects the forces between the particles inside the protein or between the
particles of the protein and solvent.It is complicated and very
time—consuming to calculate physics—based scoring function.The latter is
derived from the known protein structures data in protein database(PDB)as training data and is statistically effective.The knowledge-based scoring function implicitly represents the physical and chemical forces in the native
protein structures and can be computed more easily than the former.The
performance of knowledge—based scoring function largely depends on the
quantity and quality of known protein structures that were selected as the
training data.
In this paper,we employed proteins listed in pdb select 25 as learning data.As the selection ofa representative set ofPDB chains,the pdb select 25 list has been continually updated by the member of European Molecular
Biology
Biology Laboratory.The main p
您可能关注的文档
- 基于质量功能配置(QFD)的质量特征并行优化模型研究-机械制造及自动化专业论文.docx
- 基于小生境技术改进遗传算法在供电网规划中的应用-电力系统及其自动化专业论文.docx
- 基于蚁群算法的网格计算资源调度策略仿真研究-计算机应用技术专业论文.docx
- 基于双端口RAM的数据Cache的研究与实现-计算机科学与技术专业论文.docx
- 基于万米同轴电缆的图像信号传输方案设计与分析-信号与信息处理专业论文.docx
- 基于移动平台的个性化搜索系统研究-计算机应用技术专业论文.docx
- 基于引文的信息检索可视化系统研究-情报学·网络信息技术专业论文.docx
- 基于小波分析理论的GPS动态监测数据处理及分析-大地测量学与测量工程专业论文.docx
- 基于双目立体视觉的三维人脸重构及其识别-电路与系统专业论文.docx
- 基于图像处理技术的药用玻璃瓶检测系统-电子与通信工程专业论文.docx
最近下载
- 九年义务教育控辍保学工作方案.doc VIP
- 2025年安徽中考语文试卷及答案出炉 .pdf VIP
- KA 25-2025 煤矿井下机电设备完好性要求.docx VIP
- 劳动合同中止期间的工资支付与社保缴纳义务.docx VIP
- T BALI 003—2023 节律照明灯具性能要求.pdf VIP
- 2012年江苏高考数学试卷真题及答案.doc VIP
- 高中地理野外实践活动与乡土文化传承的结合研究教学研究课题报告.docx
- 2025光伏电站光伏组件并网验收测试标准光伏组件安装质量检查标准.docx VIP
- 上汽通用五菱宝骏610_汽车使用手册用户操作图解驾驶车主车辆说明书pdf电子版下载.pdf VIP
- 运筹学题库及答案.doc VIP
原创力文档

文档评论(0)