pattern matching through chaos game representation bridging numerical and discrete data structures for biological sequence analysis通过衔接混乱游戏表示数值模式匹配和离散数据结构生物序列分析.pdfVIP
- 1、本文档共12页,可阅读全部内容。
- 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
- 5、该文档为VIP文档,如果想要下载,成为VIP会员后,下载免费。
- 6、成为VIP后,下载本文档将扣除1次下载权益。下载后,不支持退款、换文档。如有疑问请联系我们。
- 7、成为VIP后,您将拥有八大权益,权益包括:VIP文档下载权益、阅读免打扰、文档格式转换、高级专利检索、专属身份标志、高级客服、多端互通、版权登记。
- 8、VIP文档为合作方或网友上传,每下载1次, 网站将根据用户上传文档的质量评分、类型等,对文档贡献者给予高额补贴、流量扶持。如果你也想贡献VIP文档。上传文档
查看更多
pattern matching through chaos game representation bridging numerical and discrete data structures for biological sequence analysis通过衔接混乱游戏表示数值模式匹配和离散数据结构生物序列分析
Vinga et al. Algorithms for Molecular Biology 2012, 7:10
/content/7/1/10
RESEARCH Open Access
Pattern matching through Chaos Game
Representation: bridging numerical and discrete
data structures for biological sequence analysis
Susana Vinga1,2*, Alexandra M Carvalho3,4, Alexandre P Francisco1,4, Luís MS Russo1,4 and Jonas S Almeida5
Abstract
Background: Chaos Game Representation (CGR) is an iterated function that bijectively maps discrete sequences
into a continuous domain. As a result, discrete sequences can be object of statistical and topological analyses
otherwise reserved to numerical systems. Characteristically, CGR coordinates of substrings sharing an L-long suffix
will be located within 2-L distance of each other. In the two decades since its original proposal, CGR has been
generalized beyond its original focus on genomic sequences and has been successfully applied to a wide range of
problems in bioinformatics. This report explores the possibility that it can be further extended to approach
algorithms that rely on discrete, graph-based representations.
Results: The exploratory analysis described here consisted of selecting foundational string problems and
refactoring them using CGR-based algorithms. We found that CGR can take the role of suffix trees and emulate
sophisticated string algorithms, efficiently solving exact and approximate string matching problems such as finding
all palindromes and tandem repeats, and matching with mismatches. The common feature of these problems is
that they use longest common extension (LCE) queries as subtasks of their procedures, which we show to have a
constant time solution with CGR. Additionally, we show that CGR can be used as a rolling hash function within the
Rabin-Karp algorithm.
Conclusions: The analysis of biological sequences relies on algorithmic fou
您可能关注的文档
- overcoming the barriers to organic adoption in the united states a look at pragmatic conventional producers in texas克服障碍有机采用在美国一看务实传统生产者在德克萨斯州.pdf
- ovariectomized rats as a model of postmenopausal osteoarthritis validation and application切除卵巢的老鼠绝经后骨关节炎模型的验证和应用.pdf
- overdistension in ventilated childrenoverdistension通风的孩子.pdf
- overcoming the slow recovery of mox gas sensors through a system modeling approach克服缓慢复苏的金属氧化物气体传感器通过系统建模方法.pdf
- ovarian carcinoma associated with pregnancy a clinicopathologic analysis of 23 cases and review of the literature与怀孕有关卵巢癌23例的临床病理的分析和文献之回顾.pdf
- overcoming phase 1 delays the critical component of obstetric fistula prevention programs in resource-poor countries克服第一阶段延迟产科瘘的预防方案的关键组件在资源贫乏的国家.pdf
- ovariectomy and overeating palatable, energy-dense food increase subcutaneous adipose tissue more than intra-abdominal adipose tissue in rats卵巢切除术和暴饮暴食美味,高能量食物增加皮下脂肪组织多在大鼠腹腔脂肪组织.pdf
- overdose beliefs and management practices among ethnic vietnamese heroin users in sydney, australia越南吸食海洛因过量民族信仰和管理实践在悉尼,澳大利亚.pdf
- over-expression of atpap2 in camelina sativa leads to faster plant growth and higher seed yield表达的atpap2亚麻荠马唐导致更快的植物生长和种子产量高.pdf
- overdose prevention for injection drug users lessons learned from naloxone training and distribution programs in new york city过量预防注射吸毒者教训纳洛酮在纽约培训和分配方案.pdf
- pattern recognition for selective odor detection with gas sensor arrays模式识别与气体传感器选择性气味检测数组.pdf
- pattern of fractures across pediatric age groups analysis of individual and lifestyle factors模式在小儿骨折年龄组分析个人和生活方式因素.pdf
- pattern of neural responses to verbal fluency shows diagnostic specificity for schizophrenia and bipolar disorder模式语言流畅的神经反应显示了精神分裂症和双相情感障碍的诊断特异性.pdf
- pattern statistics on markov chains and sensitivity to parameter estimation模式统计马尔可夫链和敏感参数估计.pdf
- patterns of active and passive smoking, and associated factors, in the south-east anatolian project (seap) region in turkey主动和被动吸烟、模式和相关因素,在东南部安纳托利亚工程(seap)地区在土耳其.pdf
- patterns and predictors of place of cancer death for the oldest old模式和预测的癌症死亡之地最古老的历史.pdf
- patterns of drug use among a sample of drug users and injecting drug users attending a general practice in iran模式使用毒品的吸毒者的样本和注射吸毒者在伊朗参加一个惯例.pdf
- patterns and rates of intron divergence between humans and chimpanzees模式和基因内区人类和黑猩猩之间的分歧.pdf
- pattern recognition via pcnn and tsallis entropy模式识别;再利用和tsallis熵.pdf
- patterns of expansion and expression divergence in the plant polygalacturonase gene family的扩张模式和表达差异的聚半乳糖醛酸酶基因家族.pdf
最近下载
- 13.5 道路运输法律制度(政策与法律法规 第五版).pptx VIP
- RB_T 089-2022 绿色供应链管理体系 要求及使用指南.docx VIP
- 13.4 铁路运输法律制度(政策与法律法规 第五版).pptx VIP
- NBT47025-2012缠绕垫片-标准图集.docx VIP
- 派出所矛盾纠纷排查 化解调研.pdf VIP
- 2025年中国人工智能计算力发展评估报告.pdf VIP
- 三峡郦道元的文言文.ppt VIP
- 高中英语与语文课程融合的实践与反思教学研究课题报告.docx
- 医院优质服务基层行创建资料(优质服务基层行建设工作汇报).pptx VIP
- 打叶复烤机械修理工职业技能竞赛培训综合试题五(答案).docx VIP
文档评论(0)