Al-Fahim Mubarak-Ali, Shahida Sulaiman


Pre-processing and transformation are the first two common processes that occur in a code clone detection process. The purpose of these two processes is to transform the source codes into a more representable form that can be used later on as input for code clone detection. Main issue arises in both of these processes is the application of the pre-processing and transformation rules might cause loss of critical information thus affecting the code clone detection results. Therefore, this work proposes a combination pre-processing and transformation process that can produce a better source unit representation of .Net platform source code which is C#. Net and VB.Net by enhancing an existing work that was done on Java language without affecting the critical information in the source code. The proposed enhancement was tested and the result showed that the proposed work was able to produce the expected source unit for the .Net platform languages together.


Pre-processing process, transformation process, code clone

Full Text:



Kawaguchi, S., Yamashina, T., Uwano, H., Fushida, K., Kamei, Y., Nagura, M., et al. 2009. SHINOBI: A Tool for Automatic Code Clone Detection in the IDE. Presented at the Reverse Engineering, 2009. WCRE '09. 16th Working Conference on, 2009.

Bellon, S., Koschke, R., Antoniol, G., Krinke, J., and Merlo, E. 2007. Comparison and Evaluation of Clone Detection Tools. IEEE Transactions on Software Engineering. 33: 577-591,

Roy, C. K. 2009. Detection and analysis of near-miss software clones. In IEEE International Conference on Software Maintenance. 447-450.

Mubarak-Ali A.-F., Syed-Mohamed S.-M., and Sulaiman S. 2011. An Enhanced Generic Pipeline Model for Code Clone Detection. In 5th Malaysian Conference in Software Engineering (MySEC). 434-438.

Hou, D., Jacob F., and Jablonski P. 2009. Exploring the Design Space of Proactive Tool Support for Copy-and-Paste Programming. Presented at the Proceedings of the 2009 conference of the Centre for Advanced Studies on Collaborative Research (CASCON '09), Toronto, Ontario, Canada.

Kapser, C. J. and Godfrey, M. W. 2006. Supporting the Analysis of Clones in Software Systems. Journal of Software Maintenance and Evolution: Research and Practice. 18: 61-82.

Kapser, C. J. and Godfrey, M. W. 2008. Cloning Considered Harmful Considered Harmful: Patterns of Cloning In Software. Empirical Software Engineering. 13: 645-692,

Roy, C. K. and Cordy, J. R. 2007. A Survey on Software Clone Detection Research. SCHOOL OF COMPUTING TR 2007-541. QUEEN’S UNIVERSITY. 115.

Johnson, J. H. 1994. Substring Matching for Clone Detection and Change Tracking. In International Conference on Software Maintenance. 120-126.

Ducasse, S., Rieger, M., and Demeyer, S. 1999. A Language Independent Approach for Detecting Duplicated Code. Presented at the Proceedings of the IEEE International Conference on Software Maintenance.

Kamiya, T., Kusumoto, S., and Inoue, K. 2002. CCFinder: A Multilinguistic Token-based Code Clone Detection System for Large Scale Source Code. IEEE Transactions on Software Engineering. 28: 654-670.

Mubarak-Ali A.-F. and Sulaiman S. 2014. A Hybrid Technique in Pre-Processing and Transformation Process for Code Clone Detection. In Software Engineering Conference (MySEC), 2014 8th Malaysian. 102-107.

FreeStudentsProjects. 2014. FreeStudents Projects. Available: http://www.freestudentprojects.com/.

Sourceforge.net. 2014. Sourceforge.net. Available: http://sourceforge.net/home.html.

DOI: https://doi.org/10.11113/jt.v77.6202


  • There are currently no refbacks.


Copyright © 2012 Penerbit UTM Press, Universiti Teknologi Malaysia.
Disclaimer : This website has been updated to the best of our knowledge to be accurate. However, Universiti Teknologi Malaysia shall not be liable for any loss or damage caused by the usage of any information obtained from this web site.
Best viewed: Mozilla Firefox 4.0 & Google Chrome at 1024 × 768 resolution.