- 1、本文档共4页,可阅读全部内容。
- 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
- 5、该文档为VIP文档,如果想要下载,成为VIP会员后,下载免费。
- 6、成为VIP后,下载本文档将扣除1次下载权益。下载后,不支持退款、换文档。如有疑问请联系我们。
- 7、成为VIP后,您将拥有八大权益,权益包括:VIP文档下载权益、阅读免打扰、文档格式转换、高级专利检索、专属身份标志、高级客服、多端互通、版权登记。
- 8、VIP文档为合作方或网友上传,每下载1次, 网站将根据用户上传文档的质量评分、类型等,对文档贡献者给予高额补贴、流量扶持。如果你也想贡献VIP文档。上传文档
查看更多
Facts from Text—Is Text Mining Ready to Deliver 英文参考文献
Open access, freely available online
Essay
Facts from Text—Is Text Mining Ready
to Deliver?
Dietrich Rebholz-Schuhmann*, Harald Kirsch, Francisco Couto
B
iological databases offer access
to formalized facts about many
aspects of biology—genes and
gene products, protein structure,
metabolic pathways, diseases,
organisms, and so on. These databases
are becoming increasingly important
to researchers. The information that
populates databases is generated by
research teams and is usually published
in peer-reviewed journals. As part of
the publication process, some authors
deposit data into a database but,
more often, it is extracted from the
published literature and deposited into
the databases by human curators, a
painstaking process.
Research literature and scienti?c
databases ful?l different needs.
Literature provides ideas and new
hypotheses, but is not constrained to
provide facts in formats suitable for
use in databases. By contrast, databases
ef?ciently provide large quantities of
data and information in a standardised
schema representing a prede?ned
interpretation of the data. While the
acceptance of a paper can enforce the
submission of data to a central data
repository, such as EMBL (http:??www.
ebi.ac.uk/embl/) or ArrayExpress
(http:??www.ebi.ac.uk/arrayexpress/),
nobody receives credit for the
submission of a fact to a database
without an associated publication.
As long as this practice continues,
curation will be necessary to add the
(re)formalised facts to biological
databases.
Given that publications are not about
to be replaced with routine deposition
of data into databases, is it possible
to develop software tools to support
the work of the curator? Could we
automatically analyse new scienti?c
publications routinely to extract facts,
which could then be inserted into
scienti?c databases? Could we tag gene
and protein names, as well as other
DOI: 10.1371/journal.pbio.0030065.g001
Figure 1. Medline Article Deluge
This ?gure shows the exploding number
您可能关注的文档
- Expression of Human Beta-Defensins in Children with Chronic Inflammatory Bowel Disease 英文参考文献.doc
- Expression of HA of HPAI H5N1 Virus at US2 Gene Insertion Site of Turkey Herpesvirus Induced Better Protection than That at US10 Gene Insertion Site 英文参考文献.doc
- Expression of Ixodes scapularis Antifreeze Glycoprotein Enhances Cold Tolerance in Drosophila melanogaster 英文参考文献.doc
- Expression of Eukaryotic Initiation Factor 5A and Hypusine Forming Enzymes in Glioblastoma Patient Samples Implications for New Targeted Therapies 英文参考文献.doc
- Expression of HLA Class I and HLA Class II by Tumor Cells in Chinese Classical Hodgkin Lymphoma Patients 英文参考文献.doc
- Expression of Kruppel-Like Factor KLF4 in Mouse Hair Follicle Stem Cells Contributes to Cutaneous Wound Healing 英文参考文献.doc
- Expression of Measles Virus Nucleoprotein Induces Apoptosis and Modulates Diverse Functional Proteins in Cultured Mammalian Cells 英文参考文献.doc
- Expression of Genes Encoding Multi-Transmembrane Proteins in Specific Primate Taste Cell Populations 英文参考文献.doc
- Expression of NADPH Oxidase (NOX) 5 in Rabbit Corneal Stromal Cells 英文参考文献.doc
- Expression of Osteoprotegerin in Placenta and Its Association with Preeclampsia 英文参考文献.doc
最近下载
- 2025年中国猪肉脯市场调查研究报告.docx
- 部编版二年级语文课文填空汇总.doc VIP
- 国家工商行政管理总局通达商标服务中心招聘模拟备考预测(共1000题)综合模拟试卷+答案解析.docx
- 教科版小学科学知识点汇总.docx VIP
- 部编版二年级语文下册课文填空练习.pdf VIP
- 2025届THUSSAT北京市清华大学中学高考生物二模试卷含解析.doc VIP
- 《数学课程标准》义务教育2022年修订版(原版).pdf VIP
- 半中半理论_del35论数字心理.pdf VIP
- THUSSAT北京市清华大学中学2025届高三第二次调研化学试卷含解析.doc
- 浪荡子美学与跨文化现代性-中国文哲研究所.PDF
文档评论(0)