- 3
- 0
- 约3.15万字
- 约 58页
- 2016-11-23 发布于广东
- 举报
HotData动抽取模块的分析与设计3
HotData自动抽取模块的分析与设计
[摘要] 本文探讨了如何对生物医学学术期刊网站的附加数据库进行半自动化抽取。文章以17本国际知名生物医学期刊作为分析对象,确认了学术期刊网站附加数据抽取的必要性和可行性。并提出了这些期刊网站附加数据的关键字段及组合规律,逐步讨论如何将网站附加数据抽取到本地的过程。
[关键词] HotData、ETL、生物医药文献、附加数据、自动抽取
HotData automatic extraction module Analysis and Design
[Abstract] This paper discussed how to semi-automatically sample additional data from professional academic periodical websites. The paper analyzed 17 international well-known biomedical periodicals. The necessity and possibility of sampling additional data from academic periodical websites was confirmed. And proposed the keywords and comb
原创力文档

文档评论(0)