Using software birthmarks to identify similar classes and major functionalities

Takeshi Kakimoto, Akito Monden, Yasutaka Kamei, Haruaki Tamada, Masateru Tsunoda, Ken Ichi Matsumoto

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

Software birthmarks are unique and native characteristics of every software component. Two components having similar birthmarks indicate that they are similar in functionality, structure and im-plementation. Questions addressed in this paper include: Which are similar class files? Can they be gathered into one class file? What are major functionalities among class files? To answer to these questions, this paper analyzed the similarity of birthmarks for all pairs of classes in ArgoUML, and visualized them using Multi-Dimensional Scaling (MDS). As a result, three pairs of very similar class files, which seem to be made by copy-and-paste programming, were identified. Also, four major functionalities were identified in the MDS space.

Original languageEnglish
Title of host publicationProceedings - International Conference on Software Engineering
Pages171-172
Number of pages2
DOIs
Publication statusPublished - 2006
Externally publishedYes
Event2006 International Workshop on Mining Software Repositories, MSR '06, Co-located with the 28th International Conference on Software Engineering, ICSE 2006 - Shanghai, China
Duration: May 20 2006May 28 2006

Other

Other2006 International Workshop on Mining Software Repositories, MSR '06, Co-located with the 28th International Conference on Software Engineering, ICSE 2006
CountryChina
CityShanghai
Period5/20/065/28/06

Keywords

  • multi-dimensional scaling
  • software birthmark

ASJC Scopus subject areas

  • Software

Cite this

Kakimoto, T., Monden, A., Kamei, Y., Tamada, H., Tsunoda, M., & Matsumoto, K. I. (2006). Using software birthmarks to identify similar classes and major functionalities. In Proceedings - International Conference on Software Engineering (pp. 171-172) https://doi.org/10.1145/1137983.1138026

Using software birthmarks to identify similar classes and major functionalities. / Kakimoto, Takeshi; Monden, Akito; Kamei, Yasutaka; Tamada, Haruaki; Tsunoda, Masateru; Matsumoto, Ken Ichi.

Proceedings - International Conference on Software Engineering. 2006. p. 171-172.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Kakimoto, T, Monden, A, Kamei, Y, Tamada, H, Tsunoda, M & Matsumoto, KI 2006, Using software birthmarks to identify similar classes and major functionalities. in Proceedings - International Conference on Software Engineering. pp. 171-172, 2006 International Workshop on Mining Software Repositories, MSR '06, Co-located with the 28th International Conference on Software Engineering, ICSE 2006, Shanghai, China, 5/20/06. https://doi.org/10.1145/1137983.1138026
Kakimoto T, Monden A, Kamei Y, Tamada H, Tsunoda M, Matsumoto KI. Using software birthmarks to identify similar classes and major functionalities. In Proceedings - International Conference on Software Engineering. 2006. p. 171-172 https://doi.org/10.1145/1137983.1138026
Kakimoto, Takeshi ; Monden, Akito ; Kamei, Yasutaka ; Tamada, Haruaki ; Tsunoda, Masateru ; Matsumoto, Ken Ichi. / Using software birthmarks to identify similar classes and major functionalities. Proceedings - International Conference on Software Engineering. 2006. pp. 171-172
@inproceedings{3d6692330cf74e67bd3239eb3ad2e0a9,
title = "Using software birthmarks to identify similar classes and major functionalities",
abstract = "Software birthmarks are unique and native characteristics of every software component. Two components having similar birthmarks indicate that they are similar in functionality, structure and im-plementation. Questions addressed in this paper include: Which are similar class files? Can they be gathered into one class file? What are major functionalities among class files? To answer to these questions, this paper analyzed the similarity of birthmarks for all pairs of classes in ArgoUML, and visualized them using Multi-Dimensional Scaling (MDS). As a result, three pairs of very similar class files, which seem to be made by copy-and-paste programming, were identified. Also, four major functionalities were identified in the MDS space.",
keywords = "multi-dimensional scaling, software birthmark",
author = "Takeshi Kakimoto and Akito Monden and Yasutaka Kamei and Haruaki Tamada and Masateru Tsunoda and Matsumoto, {Ken Ichi}",
year = "2006",
doi = "10.1145/1137983.1138026",
language = "English",
isbn = "1595933972",
pages = "171--172",
booktitle = "Proceedings - International Conference on Software Engineering",

}

TY - GEN

T1 - Using software birthmarks to identify similar classes and major functionalities

AU - Kakimoto, Takeshi

AU - Monden, Akito

AU - Kamei, Yasutaka

AU - Tamada, Haruaki

AU - Tsunoda, Masateru

AU - Matsumoto, Ken Ichi

PY - 2006

Y1 - 2006

N2 - Software birthmarks are unique and native characteristics of every software component. Two components having similar birthmarks indicate that they are similar in functionality, structure and im-plementation. Questions addressed in this paper include: Which are similar class files? Can they be gathered into one class file? What are major functionalities among class files? To answer to these questions, this paper analyzed the similarity of birthmarks for all pairs of classes in ArgoUML, and visualized them using Multi-Dimensional Scaling (MDS). As a result, three pairs of very similar class files, which seem to be made by copy-and-paste programming, were identified. Also, four major functionalities were identified in the MDS space.

AB - Software birthmarks are unique and native characteristics of every software component. Two components having similar birthmarks indicate that they are similar in functionality, structure and im-plementation. Questions addressed in this paper include: Which are similar class files? Can they be gathered into one class file? What are major functionalities among class files? To answer to these questions, this paper analyzed the similarity of birthmarks for all pairs of classes in ArgoUML, and visualized them using Multi-Dimensional Scaling (MDS). As a result, three pairs of very similar class files, which seem to be made by copy-and-paste programming, were identified. Also, four major functionalities were identified in the MDS space.

KW - multi-dimensional scaling

KW - software birthmark

UR - http://www.scopus.com/inward/record.url?scp=77954011222&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77954011222&partnerID=8YFLogxK

U2 - 10.1145/1137983.1138026

DO - 10.1145/1137983.1138026

M3 - Conference contribution

SN - 1595933972

SN - 9781595933973

SP - 171

EP - 172

BT - Proceedings - International Conference on Software Engineering

ER -