- 7
- 0
- 约8.6万字
- 约 12页
- 2017-09-09 发布于上海
- 举报
species tree inference by minimizing deep coalescences物种树推理通过最小化深聚结
Species Tree Inference by Minimizing Deep Coalescences
Cuong Than, Luay Nakhleh*
Department of Computer Science, Rice University, Houston, Texas, United States of America
Abstract
In a 1997 seminal paper, W. Maddison proposed minimizing deep coalescences, or MDC, as an optimization criterion for
inferring the species tree from a set of incongruent gene trees, assuming the incongruence is exclusively due to lineage
sorting. In a subsequent paper, Maddison and Knowles provided and implemented a search heuristic for optimizing the
MDC criterion, given a set of gene trees. However, the heuristic is not guaranteed to compute optimal solutions, and its hill-
climbing search makes it slow in practice. In this paper, we provide two exact solutions to the problem of inferring the
species tree from a set of gene trees under the MDC criterion. In other words, our solutions are guaranteed to find the tree
that minimizes the total number of deep coalescences from a set of gene trees. One solution is based on a novel integer
linear programming (ILP) formulation, and another is based on a simple dynamic programming (DP) approach. Powerful ILP
solvers, such as CPLEX, make the first solution appealing, particularly for very large-scale instances of the problem, whereas
the DP-based solution eliminates dependence on proprietary tools, and its simplicity makes it easy to integrate with other
genomic events that may cause gene tree incongruence. Using the exact solutions, we analyze a data set of 106 loci from
eight yeast species, a data set of 268 loci from eight Apicomplexan species, and several simulated data sets. We show that
the MDC criterion provides very accurate estimates of the species tree topologies, and that our solutions are very fast, thus
allowing for the accurate analysis of genome-scale data sets. Further, the efficie
您可能关注的文档
- simvastatin re-couples dysfunctional endothelial nitric oxide synthase in experimental subarachnoid hemorrhage辛伐他汀减少失调内皮一氧化氮合酶在实验性蛛网膜下腔出血.pdf
- single-cell analysis of ploidy and centrosomes underscores the peculiarity of normal hepatocytes单细胞分析倍数性和中心体凸显了正常肝细胞的特点.pdf
- single-cell stat5 signal transduction profiling in normal and leukemic stem and progenitor cell populations reveals highly distinct cytokine responses单细胞stat5信号转导分析在正常和白血病干细胞和祖细胞数量显示高度不同的细胞因子的反应.pdf
- single-molecule dynamics reveals cooperative binding-folding in protein recognition单分子动力学显示合作binding-folding蛋白质识别.pdf
- single-molecule late-pcr analysis of human mitochondrial genomic sequence variations单分子late-pcr人类线粒体基因组序列变化的分析.pdf
- single-species microarrays and comparative transcriptomics单一物种微阵列和比较转录组.pdf
- single-cell analysis of murine long-term hematopoietic stem cells reveals distinct patterns of gene expression during fetal migration单细胞分析小鼠长期造血干细胞揭示胎儿迁移期间不同的基因表达模式.pdf
- single-tissue and cross-tissue heritability of gene expression via identity-by-descent in related or unrelated individuals通过identity-by-descent single-tissue和cross-tissue遗传基因表达相关或无关的个人.pdf
- sir2-independent life span extension by calorie restriction in yeast卡路里限制sir2-independent寿命延长的酵母.pdf
- single-cell profiling reveals the origin of phenotypic variability in adipogenesis单细胞分析揭示表型多样性的起源在脂肪生成.pdf
- 2025-2026学年天津市和平区高三(上)期末数学试卷(含解析).pdf
- 2025-2026学年云南省楚雄州高三(上)期末数学试卷(含答案).pdf
- 2025-2026学年甘肃省天水市张家川实验中学高三(上)期末数学试卷(含答案).docx
- 2025-2026学年福建省厦门市松柏中学高二(上)期末数学试卷(含答案).docx
- 2025-2026学年广西钦州市高一(上)期末物理试卷(含答案).docx
- 2025-2026学年河北省邯郸市临漳县九年级(上)期末化学试卷(含答案).docx
- 2025-2026学年河北省石家庄二十三中七年级(上)期末历史试卷(含答案).docx
- 2025-2026学年海南省五指山市九年级(上)期末化学试卷(含答案).docx
- 2025-2026学年河北省唐山市玉田县九年级(上)期末化学试卷(含答案).docx
- 2025-2026学年河北省邢台市市区九年级(上)期末化学试卷(含答案).docx
原创力文档

文档评论(0)