数据库格式精要.pptVIP

  • 13
  • 0
  • 约1.32万字
  • 约 43页
  • 2017-04-19 发布于湖北
  • 举报
数据库格式精要

二级数据库简介; 二级数据库的形式:大多以web界面为基础,具有文字信息、表格、图形、图表等方式显示数据库内容; 一级数据库与二级数据库之间并无明确的界限。(例如:GDB、AceDB、SCOP、CATH等都已经具有二级数据库的特色) ;1、基因组信息二级数据库;2、蛋白质序列二级数据库;3、蛋白质结构二级数据库;数据库格式简介;不同数据库的序列格式;1. GenBank中DNA序列格式;LOCUS name of locus, length and type of sequence, classification of organism, data of entry DEFINITION desicription of entry ACCESSION accession number of original source KEYWORDS key words for cross referencing this entry SOURCE source organism of DNA ORGANISM description of organism REFERENCE COMMENT biological function of database information FEATURES information about sequence by base position or range of positions source range of sequence, source organism misc_signal range of sequence, type of function or signal mRNA range of sequence, mRNA CDS range of sequence, protein coding region intron range of sequence, position of intron mutation sequence position, change in sequence for mutation BASE COUNT count of A, C, G, T and other symbols ORIGIN text indicating start of sequence 1 gaattcgata aatctctggt ttattgtgca gtttatggtt ccaaaatcgc 51 atatactcac agcataactg tatatacacc cagggggcgg aatgaaagcg // database symbol for end of sequence;ACCESSION Organism Reference Name Keywords Sequence no ..123 Escherichia. Medline1, LexA SOS regulon, ATG.. coli ...... protein repressor, transcriptional regulator, .. ..124 Escherichia Medline2, UmuD SOS regulon, .. GTA.. Coli

您可能关注的文档

文档评论(0)

1亿VIP精品文档

相关文档