序列比对算法详解.ppt

  1. 1、本文档共66页,可阅读全部内容。
  2. 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
  3. 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载
  4. 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
PsiBlast介绍 Psi (Position Specific Iterated) 是一个自动的 profile-like搜索 程序首先执行一个有空位的对数据库的Blast搜索。然后,重要的比对信息被用于构建一个“特定位置”打分矩阵。 该矩阵在下一轮数据库搜索中替换了查询序列 程序将迭代不断运行,直到发现新的有意义的信息 * 思考题 问题1:为什么说如果在一个比对中获得的P-值小(0.001) ,则比对可能具有生物学意义? P值概念:一个变量取值严格说碰巧大于等于观察值的概率记为P(zzo) 问题2 :如何理解“成对序列的比对是窃窃私语… 多序列比对是大声嚷嚷” ——Hubbard et al 1996 * * * What you want to learn when evaluating an alignment is whether it is random or meaningful. If it is meaningful, then how much? * * * * All pairs of sequences are aligned separately in order to calculate a distance matrix indicating the divergence of each pair of sequences. ? A guide tree is calculated from the distance matrix. * ? Once a gap, always a gap. The positions of the gaps that were introduced during the early alignments of closely related sequences are not changed as new sequences are added. (The placements of gaps in alignments between closely related sequences are much more accurate than those between distantly related ones.) * ? The final multiple alignment is performed by a series of progressive, pairwise alignments between sequences and clusters of sequences, according to the branching order in the guide tree. ? Since the alignment is calculated on a progressive basis, the choice of sequences in the initial stage of alignment can affect the final alignment. In addition, anything that affects the calculation of the guide tree such as different similarity matrices or gap weights can also affect the multiple alignment. * Averaged scores used at each aligned position A A C . . . 0 -1 -2 -3 . . . A -1 T -2 C -3 . . . . . . 1 0 -1 0 1 0 -1 0 2 C C AC TC AAC ATC 追朔 : 比对的构建 (5) * 数学的表达 0 0 2 0 1 0 £ - = £ - = l j j S l i i S j i 序列 1长度 序列 2长度 0 f o r 1 ) , ( w i t h 0 , 0 ) , ( m a x 2 1 1 , 1 , 1 , 1 ? í ì = = £ £ ? ? ? í ì - + - = - - - - e l s e b a b a s l j l i 1 S b a s S 1 S S j i j i j i j i j i j i i j 初始化 打分 * 动态规划计算的复杂度 计算时间: O(nm),这里 n 和 m 是序列的长度 检索时间: O(Max (n,m)) [worst case: n+m; best case: Min(n,m)] 需要的内存: O(nm

文档评论(0)

妈妈王子 + 关注
实名认证
内容提供者

该用户很懒,什么也没介绍

1亿VIP精品文档

相关文档