A New Simulated Annealing Algorithm for the Multiple Sequence Alignment Problem The approac.pdfVIP

  • 1
  • 0
  • 约2.58万字
  • 约 7页
  • 2017-04-13 发布于江苏
  • 举报

A New Simulated Annealing Algorithm for the Multiple Sequence Alignment Problem The approac.pdf

A New Simulated Annealing Algorithm for the Multiple Sequence Alignment Problem The approac

a r X i v : q - b i o / 0 5 0 1 0 1 2 v 1 [ q - b i o .G N ] 1 0 J a n 2 0 0 5 A New Simulated Annealing Algorithm for the Multiple Sequence Alignment Problem: The approach of Polymers in a Random Media M. Herna?ndez-Gu??a Henri-Poincare? Group of Complex Systems, Physics Faculty, University of Havana, La Habana, CP 10400, Cuba and National Bioinformatics Center, Industria y San Jose?, Habana Vieja, Capitolio Nacional, CP 10200, Cuba R. Mulet Henri-Poincare? Group of Complex Systems, Physics Faculty, University of Havana, La Habana, CP 10400, Cuba and Department of Theoretical Physics, Physics Faculty, University of Havana, La Habana, CP 10400, Cuba S. Rodr??guez-Pe?rez Henri-Poincare? Group of Complex Systems, Physics Faculty, University of Havana, La Habana, CP 10400, Cuba and Department of Informatics, University Center of Las Tunas Vladimir Illich Lenin, Las Tunas, CP 75200, Cuba (Dated: February 9, 2008) We proposed a probabilistic algorithm to solve the Multiple Sequence Alignment problem. The algorithm is a Simulated Annealing (SA) that exploits the representation of the Multiple Alignment between D sequences as a directed polymer in D dimensions. Within this representation we can easily track the evolution in the configuration space of the alignment through local moves of low computational cost. At variance with other probabilistic algorithms proposed to solve this problem, our approach allows for the creation and deletion of gaps without extra computational cost. The al- gorithm was tested aligning proteins from the kinases family. When D = 3 the results are consistent with those obtained using a complete algorithm. For D 3 where the complete algorithm fails, we show that our algorithm still converges to reasonable alignments. Moreover, we study the space of solutions obtained and show that depending on the number of sequences aligned the solutions are organized in different ways, suggesting a possible source of errors for progressive algorithms. P

文档评论(0)

1亿VIP精品文档

相关文档