Abstract
To deal with missing data in historical project data sets is an important issue for constructing effort estimation models. Past researches have showed that the similarity-based imputation showed high estimation performance. However, it is unclear if it is still effective for small data sets. In this paper, using multiple data sets with different project cases each extracted from ISBSG data set, we present an experimental evaluation among four methods: mean imputation, similarity-based imputation, row-column deletion and pairwise deletion. The result showed that the row-column deletion showed better performance than similarity-based imputation for data sets not exceeding 220 cases.
Original language | English |
---|---|
Pages (from-to) | 100-105 |
Number of pages | 6 |
Journal | Computer Software |
Volume | 27 |
Issue number | 2 |
Publication status | Published - May 1 2010 |
Externally published | Yes |
ASJC Scopus subject areas
- Software