- 2
- 0
- 约5.4千字
- 约 6页
- 2017-06-18 发布于湖北
- 举报
目标信息等效数据集(TIED: Target Information
Equivalent Dataset)
数据介绍:
TIED stands for Target Information Equivalent Dataset. It is an
artificial simulated dataset constructed to illustrate that there may be
many minimal sets of features with optimal predictivity (i.e., Markov
boundaries) and likewise many sets of features that are statistically
indistinguishable from the set of direct causes and direct effects of the
target.
关键词:
贝叶斯网络,马尔可夫边界, 目标信息,等效,最优关系, bayesian
network,markov boundary,target infomation,equivalent,optimal
predictivity,
数据格式:
TEXT
数据详细介绍:
TIED: Target Information Equivalent Dataset
Contact: Alexander Statnikov - Submitted: 2008-09-12 20:24 - Views : 1320 -
[Edit entry]
Authors: Causality Workbench Team
Key facts: Number of variables: 999 + the target,
Number of entries: 750 (train) + 3000 (test),
Variable types: categorical (up to 4 values),
Missing data: no.
Keywords: bayesian.network, markov.boundary
Abstract:
TIED dataset
© 2008 Alexander Statnikov and Constantin Aliferis
Introduction
TIED stands for Target Information Equivalent Dataset. It is an artificial
simulated dataset constructed to illustrate that there may be many minimal
sets of features with optimal predictivity (i.e., Markov boundaries) and likewise
many sets of features that are statistically indistinguishable from the set of
direct causes and direct effects of the target.
Data-analysis tasks
It is recommended that participants complete all 3 tasks given below; however
the submitted results will be evaluated even if a participant completed at least
task 1 or 2.
1. Using training data, find all sets of variables that are statistically
indistinguishable from the set of direct causes and direct effects (DCE) of the
target variable.
2. Using training data, find all Markov boundaries (defined as in Pearl,
Probabilistic Reasoning in Intelligent Systems, 1988).
3. For each of the Markov boundari
原创力文档

文档评论(0)