- 1、本文档共9页,可阅读全部内容。
- 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
Discovery Systems Laboratory,
Algorithms for Large-Scale Local Causal Discovery and Feature
Selection In the Presence Of Limited Sample Or Large Causal
Neighbourhoods
C.F. Aliferis and I. Tsamardinos
Department of Biomedical Informatics,
Discovery Systems Laboratory,
Vanderbilt University
Technical Report DSL-02-08
October 8th 2002
1
Introduction
When the Causal Neighbourhood of a target node T is so large relative to the available
sample then one cannot obtain statistically reliable tests of independence and association
metrics when conditioning on all members of the causal neighbourhood. Thus both the
growing and the shrinking phase of the algorithms in the IAMB family [Aliferis and
Tsamardinos 2002a] will fail to give reliable results. Such situations arise often for
example in text categorization, in studies with limited sampling (e.g., gene expression
experiments), or when one wishes to learn the causal neighbourhood of a constellation of
variables (e.g., in epidemiology when learning the causal neighbourhood of a “disease”
that in effect summarizes the states of several variables some or all of which are also
measured).
We give here algorithms that circumvent the problem by trading off less sample
(equivalently large causal neighbourhood when the sample is fixed) with computational
complexity and with the possibility that false positives may be introduced in the
estimated causal neighbourhood. We collectively call these algorithms S/LCN (Small
Sample/Large Causal Neighbourhood). The basic structure for all such algorithms is
identical, so we will first outline the main concept and see how different members of this
family can be created by changes in the basic structure. We will also discuss how the
same techniques we introduced for the IAMB family for chunking and parallelization can
be directly applied in the S/LCN family to provide distributed processing/storage
versions.
The Main
您可能关注的文档
- CRI for wooden toy.pdf
- Crime and punishment in scientific research.pdf
- Crime things aren’t what they seem to be.ppt
- crime watching week 5 bb.pdf
- CRISPR实验流程.pdf
- CRISPR_Cas9基因组编辑技术在农业动物中的应用.pdf
- Critical success factors for public-private partnerships in infrastructure development.pdf
- Crime Data Mining An Overview and Case Studies.pdf
- Cross comparison of US, EU, JP and Korean companies patenting activity in Japan and in the Peoples R.pdf
- CRM的IE配置 网上报销打印报错设置 - 13.1wise.pdf
- Discovery4100 开发套件.pdf
- Discovery_V12_中文操作说明.pdf
- DISCOVERY心得_GEOALTLAS等值线图直接导入Geomap之一.pdf
- Distribution of bacteria involved in nitrogen and sulphur cycles in shrimp culture systems of Wes.pdf
- Distribution of the extensive Doppler redshift of quasars.pdf
- Division and Modulus for Computer Scientists.pdf
- DM00039084【Discovery kit for STM32F407417 lines】.pdf
- DM-Intro_14.5_L01_Introduction_to_ANSYS.pdf
- DMN3009SFG替代型号AON7426.docx
- DNA编码化合物库应用与新药发现的效率分析-roche.pdf
文档评论(0)