- 1、本文档共14页,可阅读全部内容。
- 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
Using RepeatMasker to Identify UNIT 4.10
Repetitive Elements in Genomic
Sequences
Maja Tarailo-Graovac1 and Nansheng Chen1
1Simon Fraser University, Burnaby, British Columbia, Canada
ABSTRACT
RepeatMasker is a popular software tool widely used in computational genomics to
identify, classify, and mask repetitive elements, including low-complexity sequences and
interspersed repeats. RepeatMasker searches for repetitive sequence by aligning the input
genome sequence against a library of known repeats, such as Repbase. Here, we describe
two Basic Protocols that provide detailed guidelines on how to use RepeatMasker, either
via the Web interface or command-line Unix/Linux system, to analyze repetitive elements
in genomic sequences. Sequence comparisons in RepeatMasker are usually performed by
the alignment program cross match, which requires significant processing time for larger
sequences. An Alternate Protocol describes how to reduce the processing time using an
alternative alignment program, such as WU-BLAST. Further, the advantages, limitations,
and known bugs of the software are discussed. Finally, guidelines for understanding the
C
results are provided. Curr. Protoc. Bioinform. 25:4.10.1-4.10.14. 2009 by John Wiley
Sons, Inc.
Keywords: RepeatMasker genome annotation repetitive elements repeat library
cross match WU-BLAST RECON
INTRODUCTION
RepeatMasker (developed by A.F.A. Smit, R. Hubley, and P. Green; see
/) was designed to identify and annotate repetitive elements
in nucleotide sequences and mask them for further analysis. The repetitive elements,
including low-complexity DNA sequences and interspersed repeats, are annotated and
replaced by Ns, Xs, or lowercase letters (see below for options) in the correspond-
ing pos
文档评论(0)