Mechanical Detection of Similarities in Documents: The Source Code Files Case

Postgraduate Thesis uoadl:1316459 562 Read counter

Unit:
ΠΜΣ Πληροφορικής και Τηλεπικοινωνιών με ειδίκευση Υπολογιστική Επιστήμη
Library of the School of Science
Deposit date:
2016-03-07
Year:
2016
Author:
Χρονόπουλος Διονύσιος
Supervisors info:
Παναγιώτης Σταματόπουλος
Original Title:
Αυτόματη Ανίχνευση Ομοιοτήτων σε Έγγραφα: Η Περίπτωση Αρχείων Πηγαίου Κώδικα
Languages:
Greek
Translated title:
Mechanical Detection of Similarities in Documents: The Source Code Files Case
Summary:
In the digital world anything can be copied: Documents, music, programs,
photographs, etc. Contrary to the analog world, these copies are not affected
by any data loss or distortion with respect to their originals. Frequently
though, partial copies of documents are made and their contents are even
slightly modified on purpose. Given such an altered document, our objective is
to detect its originals.
We include techniques and algorithms used for the mechanical detection of
similarities among a specific finite group of documents. In particular, we
examine the similarity among computer source code files.
Keywords:
Strings, Similarity, Lexical Analysis, Fingerprinting, Set Theory
Index:
Yes
Number of index pages:
5
Contains images:
Yes
Number of references:
12
Number of pages:
39
File:
File access is restricted only to the intranet of UoA.

document.pdf
434 KB
File access is restricted only to the intranet of UoA.

 


attachments.zip
7 KB
File access is restricted.