A fuzzy hashing technique for large scale software birthmarks

Takehiro Tsuzaki, Teruaki Yamamoto, Haruaki Tamada, Akito Monden

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Software birthmarks have been proposed as a method for enabling the detection of programs that may have been stolen by measuring the similarity between the two programs. A birthmark is created from each program by extracting its native characteristics. The birthmarks of the programs can then be compared. However, because the extracted birthmarks contain a large amount of information, a large amount of time is needed when using them to compare large programs. This paper describes our work to reduce this comparison time. Achieving faster comparisons will enable the evaluation of large programs and simplify the use of birthmarks. Specifically, our method creates hashes from conventional birthmark information using fuzzy hashing, and then measures the similarity of the programs using the obtained hash values. Using the proposed method, we achieved a major speed increase over the conventional birthmark method with distinction rates of over 90%. On the other hand, because preservation performance decreased substantially, the similarity threshold value needed to be lowered when using the proposed method.

Original languageEnglish
Title of host publication2016 IEEE/ACIS 15th International Conference on Computer and Information Science, ICIS 2016 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781509008063
DOIs
Publication statusPublished - Aug 23 2016
Event15th IEEE/ACIS International Conference on Computer and Information Science, ICIS 2016 - Okayama, Japan
Duration: Jun 26 2016Jun 29 2016

Other

Other15th IEEE/ACIS International Conference on Computer and Information Science, ICIS 2016
CountryJapan
CityOkayama
Period6/26/166/29/16

Fingerprint

Hashing
Software
Fuzzy Information
Threshold Value
Preservation
Simplify
Evaluation
Similarity

ASJC Scopus subject areas

  • Computer Science(all)
  • Energy Engineering and Power Technology
  • Control and Optimization

Cite this

Tsuzaki, T., Yamamoto, T., Tamada, H., & Monden, A. (2016). A fuzzy hashing technique for large scale software birthmarks. In 2016 IEEE/ACIS 15th International Conference on Computer and Information Science, ICIS 2016 - Proceedings [7550868] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICIS.2016.7550868

A fuzzy hashing technique for large scale software birthmarks. / Tsuzaki, Takehiro; Yamamoto, Teruaki; Tamada, Haruaki; Monden, Akito.

2016 IEEE/ACIS 15th International Conference on Computer and Information Science, ICIS 2016 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2016. 7550868.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Tsuzaki, T, Yamamoto, T, Tamada, H & Monden, A 2016, A fuzzy hashing technique for large scale software birthmarks. in 2016 IEEE/ACIS 15th International Conference on Computer and Information Science, ICIS 2016 - Proceedings., 7550868, Institute of Electrical and Electronics Engineers Inc., 15th IEEE/ACIS International Conference on Computer and Information Science, ICIS 2016, Okayama, Japan, 6/26/16. https://doi.org/10.1109/ICIS.2016.7550868
Tsuzaki T, Yamamoto T, Tamada H, Monden A. A fuzzy hashing technique for large scale software birthmarks. In 2016 IEEE/ACIS 15th International Conference on Computer and Information Science, ICIS 2016 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2016. 7550868 https://doi.org/10.1109/ICIS.2016.7550868
Tsuzaki, Takehiro ; Yamamoto, Teruaki ; Tamada, Haruaki ; Monden, Akito. / A fuzzy hashing technique for large scale software birthmarks. 2016 IEEE/ACIS 15th International Conference on Computer and Information Science, ICIS 2016 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2016.
@inproceedings{f6d97133cf564d9e8962650116557343,
title = "A fuzzy hashing technique for large scale software birthmarks",
abstract = "Software birthmarks have been proposed as a method for enabling the detection of programs that may have been stolen by measuring the similarity between the two programs. A birthmark is created from each program by extracting its native characteristics. The birthmarks of the programs can then be compared. However, because the extracted birthmarks contain a large amount of information, a large amount of time is needed when using them to compare large programs. This paper describes our work to reduce this comparison time. Achieving faster comparisons will enable the evaluation of large programs and simplify the use of birthmarks. Specifically, our method creates hashes from conventional birthmark information using fuzzy hashing, and then measures the similarity of the programs using the obtained hash values. Using the proposed method, we achieved a major speed increase over the conventional birthmark method with distinction rates of over 90{\%}. On the other hand, because preservation performance decreased substantially, the similarity threshold value needed to be lowered when using the proposed method.",
author = "Takehiro Tsuzaki and Teruaki Yamamoto and Haruaki Tamada and Akito Monden",
year = "2016",
month = "8",
day = "23",
doi = "10.1109/ICIS.2016.7550868",
language = "English",
booktitle = "2016 IEEE/ACIS 15th International Conference on Computer and Information Science, ICIS 2016 - Proceedings",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - A fuzzy hashing technique for large scale software birthmarks

AU - Tsuzaki, Takehiro

AU - Yamamoto, Teruaki

AU - Tamada, Haruaki

AU - Monden, Akito

PY - 2016/8/23

Y1 - 2016/8/23

N2 - Software birthmarks have been proposed as a method for enabling the detection of programs that may have been stolen by measuring the similarity between the two programs. A birthmark is created from each program by extracting its native characteristics. The birthmarks of the programs can then be compared. However, because the extracted birthmarks contain a large amount of information, a large amount of time is needed when using them to compare large programs. This paper describes our work to reduce this comparison time. Achieving faster comparisons will enable the evaluation of large programs and simplify the use of birthmarks. Specifically, our method creates hashes from conventional birthmark information using fuzzy hashing, and then measures the similarity of the programs using the obtained hash values. Using the proposed method, we achieved a major speed increase over the conventional birthmark method with distinction rates of over 90%. On the other hand, because preservation performance decreased substantially, the similarity threshold value needed to be lowered when using the proposed method.

AB - Software birthmarks have been proposed as a method for enabling the detection of programs that may have been stolen by measuring the similarity between the two programs. A birthmark is created from each program by extracting its native characteristics. The birthmarks of the programs can then be compared. However, because the extracted birthmarks contain a large amount of information, a large amount of time is needed when using them to compare large programs. This paper describes our work to reduce this comparison time. Achieving faster comparisons will enable the evaluation of large programs and simplify the use of birthmarks. Specifically, our method creates hashes from conventional birthmark information using fuzzy hashing, and then measures the similarity of the programs using the obtained hash values. Using the proposed method, we achieved a major speed increase over the conventional birthmark method with distinction rates of over 90%. On the other hand, because preservation performance decreased substantially, the similarity threshold value needed to be lowered when using the proposed method.

UR - http://www.scopus.com/inward/record.url?scp=84988028193&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84988028193&partnerID=8YFLogxK

U2 - 10.1109/ICIS.2016.7550868

DO - 10.1109/ICIS.2016.7550868

M3 - Conference contribution

AN - SCOPUS:84988028193

BT - 2016 IEEE/ACIS 15th International Conference on Computer and Information Science, ICIS 2016 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

ER -