Using software birthmarks to identify similar classes and major functionalities

Takeshi Kakimoto, Akito Monden, Yasutaka Kamei, Haruaki Tamada, Masateru Tsunoda, Ken Ichi Matsumoto

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)

Abstract

Software birthmarks are unique and native characteristics of every software component. Two components having similar birthmarks indicate that they are similar in functionality, structure and im-plementation. Questions addressed in this paper include: Which are similar class files? Can they be gathered into one class file? What are major functionalities among class files? To answer to these questions, this paper analyzed the similarity of birthmarks for all pairs of classes in ArgoUML, and visualized them using Multi-Dimensional Scaling (MDS). As a result, three pairs of very similar class files, which seem to be made by copy-and-paste programming, were identified. Also, four major functionalities were identified in the MDS space.

Original languageEnglish
Title of host publicationProceedings of the 2006 International Workshop on Mining Software Repositories, MSR '06, Co-located with the 28th International Conference on Software Engineering, ICSE 2006
Pages171-172
Number of pages2
DOIs
Publication statusPublished - 2006
Externally publishedYes
Event2006 International Workshop on Mining Software Repositories, MSR '06, Co-located with the 28th International Conference on Software Engineering, ICSE 2006 - Shanghai, China
Duration: May 20 2006May 28 2006

Publication series

NameProceedings - International Conference on Software Engineering
ISSN (Print)0270-5257

Other

Other2006 International Workshop on Mining Software Repositories, MSR '06, Co-located with the 28th International Conference on Software Engineering, ICSE 2006
Country/TerritoryChina
CityShanghai
Period5/20/065/28/06

Keywords

  • multi-dimensional scaling
  • software birthmark

ASJC Scopus subject areas

  • Software

Cite this