A Hybrid Instance Selection Using Nearest-Neighbor for Cross-Project Defect Prediction-使用最近邻进行跨项目缺陷预测的混合实例选择.pdfVIP
- 32
- 0
- 约8.06万字
- 约 12页
- 2017-10-15 发布于上海
- 举报
A Hybrid Instance Selection Using Nearest-Neighbor for Cross-Project Defect Prediction-使用最近邻进行跨项目缺陷预测的混合实例选择
Ryu D, Jang JI, Baik J. A hybrid instance selection using nearest-neighbor for cross-project defect prediction. JOURNAL
OF COMPUTER SCIENCE AND TECHNOLOGY 30(5): 969–980 Sept. 2015. DOI 10.1007/s11390-015-1575-5
A Hybrid Instance Selection Using Nearest-Neighbor for
Cross-Project Defect Prediction
Duksan Ryu, Jong-In Jang, and Jongmoon Baik, Member, ACM, IEEE
School of Computing, Korea Advanced Institute of Science and Technology, Yuseong-gu, Daejeon 305-701, Korea
E-mail: {dsryu, forestar0719, jbaik}@kaist.ac.kr
Received March 20, 2015; revised July 7, 2015.
Abstract Software defect prediction (SDP) is an active research field in software engineering to identify defect-prone
modules. Thanks to SDP, limited testing resources can be e昇ectively allocated to defect-prone modules. Although SDP
requires su昋cient local data within a company, there are cases where local data are not available, e.g., pilot projects.
Companies without local data can employ cross-project defect prediction (CPDP) using external data to build classifiers.
The major challenge of CPDP is di昇erent distributions between training and test data. To tackle this, instances of source
data similar to target data are selected to build classifiers. Software datasets have a class imbalance problem meaning
the ratio of defective class to clean class is far low. It usually lowers the performance of classifiers. We propose a Hybrid
Instance Selection Using Nearest-Neighbor (HISNN) method that performs a hybrid classification selectively learning local
knowledge (via k-nearest neighbor) and global knowledge (via na¨ıve Bayes). Instances having strong local knowledge are
identified via nearest-neighbors with the same class label. Previous studies showed low PD (probability of detection) or
high PF (p
您可能关注的文档
- A descriptor system approach to robust stability of uncertain degenerate systems with discrete and distribute delays-描述了离散和分布时滞不确定退化系统鲁棒稳定性的描述系统.pdf
- A Fast Ambient Occlusion Method for Real-Time Plant Rendering-一种快速的环境遮挡法,用于实时的植物渲染.pdf
- A Hybrid Circular Queue Method for Iterative Stencil Computations on GPUs-在gpu上迭代模板计算的混合循环队列方法.pdf
- A new adaptive mutative scale chaos optimization algorithm and its application-一种新的自适应混沌尺度混沌优化算法及其应用.pdf
- A minimal-energy driving strategy for high-speed electric train-一种高速电动列车的最低能量驱动策略.pdf
- A New Approach to Graph Recognition and Applications to Distance-Hereditary Graphs-一种将图像识别和应用于距离遗传图的新方法.pdf
- A new approach for automated image segmentation based on simplified PCNN-一种基于简化PCNN的自动图像分割方法.pdf
- A new multisensor fusion SLAM approach for mobile robots-一种新的移动机器人多传感器融合技术.pdf
- A Multi-Threaded Semantic Focused Crawler-一个多线程的语义聚焦爬虫.pdf
- A Geometric Approach for Multi-Degree Spline-多学位样条的几何方法.pdf
- 初中九年级英语Unit 13环境保护主题听说整合教学设计.docx
- 85分式方程及其解法课件人教版数学八年级上册.pptx
- 基于核心素养的博物馆主题说明文写作教学设计与实施——以九年级英语为例.docx
- 53一次函数的意义第课时课件浙教版八年级数学上册.pptx
- 大单元视角下“人民民主政权的巩固”与历史关键能力进阶教学设计——以初中历史中考复习课为例.docx
- 五年级数学下册典型例题解析人教版期末重点攻克.pptx
- 小学四年级信息技术《智启信息时代:查找网上信息的基石》教学设计及反思.docx
- 大疆域·大人口·大战略:中国国家空间认知的初步建构.docx
- 人教版(一年级起点)小学英语四年级上册Revision 1 Lesson 2教学设计.docx
- 大单元结构化复习:旧民主主义革命时期(18401919)的内忧外患与救亡图存.docx
原创力文档

文档评论(0)