基于BLAST 的数据清洗与质量控制方案.pdfVIP

  • 4
  • 0
  • 约1.13万字
  • 约 3页
  • 2017-08-13 发布于北京
  • 举报

基于BLAST 的数据清洗与质量控制方案.pdf

37 4 2011 2 Vol.37 No.4 Computer Engineering February 2011 ·· 2011 A TP393 BLAST 1 1 1 2 1 1 1 1 (1. 1001902. 518004) (BLAST) BLAST blastn blastp Data Cleaning and Quality Control Scheme Based on BLAST 1 1 1 2 1 1 1 1 LIU Qi , MENG Zhen , LIU Yong , DONG Hui , LIN Xiao-guang , GAO Yan-ping , ZHOU Yuan-chun , LI Jian-hui (1. Scientific Data Center, Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190, China; 2. Fairylake Botanical Garden, Chinese Academy of Sciences, Shenzhen 518004, China) AbstractThis paper researches the application of Basic Local Alignment Search Tool(BLAST) in the Platform for Phylogenetic Analysis of Land Plant Platform(PALPP). In data cleaning, it uses the data extraction based on gene annotation and extraction based on BLAST similarity matching to filter the related sequence information, control the sequence quality and remove the original gene sequence annotation errors. In the quality control of self-sequence data, it uses the way of alignment scoring based on blastn and template matching based on blastp to report the overall quality of sequence, control the storage of the pollution sequences and pseudo genes. Key wordssequence alignment; data cleaning; Basic Local Alignment Search Tool(BLAST); Phylogenetic Analysis of Land Plant Platform (PALPP) DOI: 10.3969/j.issn.1000-3428.2011.04.026 1 PALPP MapReduce BLAST-cloud [4] (Phylogenetic Analysis of Land Plant Platform, PALPP)

文档评论(0)

1亿VIP精品文档

相关文档