Improving analogy-based software cost estimation through probabilistic-based similarity measures

Passakorn Phannachitta, Jacky Keung, Akito Monden, Ken Ichi Matsumoto

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

The performance of software cost estimation based on analogy reasoning depends upon the measures that specifying the similarity between software projects. This paper empirically investigates the use of probabilistic-based distance functions to improve the similarity measurement. The probabilistic-based distance functions are considerably more robust, because they collect the implicit correlation between the occurrences of project feature attributes. This information gain enables the constructed estimation model to be more concise and comprehensible. The study compares 6 probabilistic-based distance functions against the commonlyused Euclidian distance. We empirically evaluate the implemented cost estimation model using 5 real-world datasets collected from the PROMISE repository. The result shows a significant improvement in terms of error reduction, that implies an estimation based on probabilistic-based distance functions achieve higher accuracy on average, and the peak performance significantly outperforms the Euclidian distance based on Wilcoxon signed-rank test.

Original languageEnglish
Title of host publicationAPSEC 2013 - Proceedings of the 20th Asia-Pacific Software Engineering Conference
PublisherIEEE Computer Society
Pages541-546
Number of pages6
Volume1
ISBN (Print)9781479921430, 9780769549224
DOIs
Publication statusPublished - 2013
Externally publishedYes
Event20th Asia-Pacific Software Engineering Conference, APSEC 2013 - Bangkok, Thailand
Duration: Dec 2 2013Dec 5 2013

Other

Other20th Asia-Pacific Software Engineering Conference, APSEC 2013
CountryThailand
CityBangkok
Period12/2/1312/5/13

Fingerprint

Costs

Keywords

  • Analogy
  • Heterogeneous distance function
  • Machine learning
  • Software cost estimation

ASJC Scopus subject areas

  • Software

Cite this

Phannachitta, P., Keung, J., Monden, A., & Matsumoto, K. I. (2013). Improving analogy-based software cost estimation through probabilistic-based similarity measures. In APSEC 2013 - Proceedings of the 20th Asia-Pacific Software Engineering Conference (Vol. 1, pp. 541-546). [6805449] IEEE Computer Society. https://doi.org/10.1109/APSEC.2013.78

Improving analogy-based software cost estimation through probabilistic-based similarity measures. / Phannachitta, Passakorn; Keung, Jacky; Monden, Akito; Matsumoto, Ken Ichi.

APSEC 2013 - Proceedings of the 20th Asia-Pacific Software Engineering Conference. Vol. 1 IEEE Computer Society, 2013. p. 541-546 6805449.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Phannachitta, P, Keung, J, Monden, A & Matsumoto, KI 2013, Improving analogy-based software cost estimation through probabilistic-based similarity measures. in APSEC 2013 - Proceedings of the 20th Asia-Pacific Software Engineering Conference. vol. 1, 6805449, IEEE Computer Society, pp. 541-546, 20th Asia-Pacific Software Engineering Conference, APSEC 2013, Bangkok, Thailand, 12/2/13. https://doi.org/10.1109/APSEC.2013.78
Phannachitta P, Keung J, Monden A, Matsumoto KI. Improving analogy-based software cost estimation through probabilistic-based similarity measures. In APSEC 2013 - Proceedings of the 20th Asia-Pacific Software Engineering Conference. Vol. 1. IEEE Computer Society. 2013. p. 541-546. 6805449 https://doi.org/10.1109/APSEC.2013.78
Phannachitta, Passakorn ; Keung, Jacky ; Monden, Akito ; Matsumoto, Ken Ichi. / Improving analogy-based software cost estimation through probabilistic-based similarity measures. APSEC 2013 - Proceedings of the 20th Asia-Pacific Software Engineering Conference. Vol. 1 IEEE Computer Society, 2013. pp. 541-546
@inproceedings{3c52ece4a204420b914b7c78b7154b5c,
title = "Improving analogy-based software cost estimation through probabilistic-based similarity measures",
abstract = "The performance of software cost estimation based on analogy reasoning depends upon the measures that specifying the similarity between software projects. This paper empirically investigates the use of probabilistic-based distance functions to improve the similarity measurement. The probabilistic-based distance functions are considerably more robust, because they collect the implicit correlation between the occurrences of project feature attributes. This information gain enables the constructed estimation model to be more concise and comprehensible. The study compares 6 probabilistic-based distance functions against the commonlyused Euclidian distance. We empirically evaluate the implemented cost estimation model using 5 real-world datasets collected from the PROMISE repository. The result shows a significant improvement in terms of error reduction, that implies an estimation based on probabilistic-based distance functions achieve higher accuracy on average, and the peak performance significantly outperforms the Euclidian distance based on Wilcoxon signed-rank test.",
keywords = "Analogy, Heterogeneous distance function, Machine learning, Software cost estimation",
author = "Passakorn Phannachitta and Jacky Keung and Akito Monden and Matsumoto, {Ken Ichi}",
year = "2013",
doi = "10.1109/APSEC.2013.78",
language = "English",
isbn = "9781479921430",
volume = "1",
pages = "541--546",
booktitle = "APSEC 2013 - Proceedings of the 20th Asia-Pacific Software Engineering Conference",
publisher = "IEEE Computer Society",
address = "United States",

}

TY - GEN

T1 - Improving analogy-based software cost estimation through probabilistic-based similarity measures

AU - Phannachitta, Passakorn

AU - Keung, Jacky

AU - Monden, Akito

AU - Matsumoto, Ken Ichi

PY - 2013

Y1 - 2013

N2 - The performance of software cost estimation based on analogy reasoning depends upon the measures that specifying the similarity between software projects. This paper empirically investigates the use of probabilistic-based distance functions to improve the similarity measurement. The probabilistic-based distance functions are considerably more robust, because they collect the implicit correlation between the occurrences of project feature attributes. This information gain enables the constructed estimation model to be more concise and comprehensible. The study compares 6 probabilistic-based distance functions against the commonlyused Euclidian distance. We empirically evaluate the implemented cost estimation model using 5 real-world datasets collected from the PROMISE repository. The result shows a significant improvement in terms of error reduction, that implies an estimation based on probabilistic-based distance functions achieve higher accuracy on average, and the peak performance significantly outperforms the Euclidian distance based on Wilcoxon signed-rank test.

AB - The performance of software cost estimation based on analogy reasoning depends upon the measures that specifying the similarity between software projects. This paper empirically investigates the use of probabilistic-based distance functions to improve the similarity measurement. The probabilistic-based distance functions are considerably more robust, because they collect the implicit correlation between the occurrences of project feature attributes. This information gain enables the constructed estimation model to be more concise and comprehensible. The study compares 6 probabilistic-based distance functions against the commonlyused Euclidian distance. We empirically evaluate the implemented cost estimation model using 5 real-world datasets collected from the PROMISE repository. The result shows a significant improvement in terms of error reduction, that implies an estimation based on probabilistic-based distance functions achieve higher accuracy on average, and the peak performance significantly outperforms the Euclidian distance based on Wilcoxon signed-rank test.

KW - Analogy

KW - Heterogeneous distance function

KW - Machine learning

KW - Software cost estimation

UR - http://www.scopus.com/inward/record.url?scp=84936889065&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84936889065&partnerID=8YFLogxK

U2 - 10.1109/APSEC.2013.78

DO - 10.1109/APSEC.2013.78

M3 - Conference contribution

SN - 9781479921430

SN - 9780769549224

VL - 1

SP - 541

EP - 546

BT - APSEC 2013 - Proceedings of the 20th Asia-Pacific Software Engineering Conference

PB - IEEE Computer Society

ER -