bayesian inference for genomic data integration reduces misclassification rate in predicting protein-protein interactions贝叶斯推理的基因组数据集成减少误分类率预测蛋白质相互作用.pdfVIP
- 10
- 0
- 约9.07万字
- 约 10页
- 2017-08-31 发布于上海
- 举报
bayesian inference for genomic data integration reduces misclassification rate in predicting protein-protein interactions贝叶斯推理的基因组数据集成减少误分类率预测蛋白质相互作用
Bayesian Inference for Genomic Data Integration
Reduces Misclassification Rate in Predicting Protein-
Protein Interactions
1 2
Chuanhua Xing *, David B. Dunson
1 Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina, United States of America, 2 Department of Statistical Science, Duke University,
Durham, North Carolina, United States of America
Abstract
Protein-protein interactions (PPIs) are essential to most fundamental cellular processes. There has been increasing interest in
reconstructing PPIs networks. However, several critical difficulties exist in obtaining reliable predictions. Noticeably, false
positive rates can be as high as .80%. Error correction from each generating source can be both time-consuming and
inefficient due to the difficulty of covering the errors from multiple levels of data processing procedures within a single test.
We propose a novel Bayesian integration method, deemed nonparametric Bayes ensemble learning (NBEL), to lower the
misclassification rate (both false positives and negatives) through automatically up-weighting data sources that are most
informative, while down-weighting less informative and biased sources. Extensive studies indicate that NBEL is significantly
¨
more robust than the classic naıve Bayes to unreliable, error-prone and contaminated data. On a large human data set our
¨
NBEL approach predicts many more PPIs than naıve Bayes. This suggests that previous studies may have large numbers of
not only false positives but also false negatives. The validation on two human PPIs datasets having high quality supports our
observations. Our experiments demonstrate that it is feasible to predict high-throughput
您可能关注的文档
- associations between screen time and physical activity among spanish adolescents屏幕时间之间的联系和体育活动在西班牙的青少年.pdf
- associations between total cerebral blood flow and age related changes of the brain总脑血流量之间的关联和年龄有关的大脑变化.pdf
- associations of different phenotypes of wheezing illness in early childhood with environmental variables implicated in the aetiology of asthma关联不同表型的气喘病在儿童早期环境变量与哮喘的病因学.pdf
- associations of amylin with inflammatory markers and metabolic syndrome in apparently healthy chinese糊精协会与炎症标记物和代谢综合征在看似健康的中国人.pdf
- associations of fatty acids in cerebrospinal fluid with peripheral glucose concentrations and energy metabolism协会的脂肪酸与外围脑脊液能量代谢和血糖浓度.pdf
- associations between the expression of epigenetically regulated genes and the expression of dnmts and mbds in systemic lupus erythematosus之间的关联epigenetically调节基因的表达和表达dnmts mbds系统性红斑狼疮.pdf
- association study of the β2-adrenergic receptor gene polymorphisms and hypertension in the northern han chinese协会的研究β2-adrenergic受体基因多态性与高血压北方汉族.pdf
- associations between variation in chrna5-chrna3-chrnb4, body mass index and blood pressure in the northern finland birth cohort 1966chrna5-chrna3-chrnb4变化之间的联系,身体质量指数和血压在芬兰北部1966年出生队列.pdf
- associations of hla-dp variants with hepatitis b virus infection in southern and northern han chinese populations a multicenter case-control study协会hla-dp变异与乙型肝炎病毒感染在南方和北方汉族人口多中心病例对照研究.pdf
- associations between gene expression variations and ovarian cancer risk alleles identified from genome wide association studies协会之间的基因表达变化和卵巢癌风险等位基因从全基因组关联研究确定.pdf
原创力文档

文档评论(0)