Tree Slicing in Clone Detection: Syntactic Analysis Made (Semi)-Semantic

Marat Akhin; Vladimir Itsykson

doi:10.18255/1818-1015-2012-6-69-78

Tree Slicing in Clone Detection: Syntactic Analysis Made (Semi)-Semantic

Marat Akhin, Vladimir Itsykson

https://doi.org/10.18255/1818-1015-2012-6-69-78

Full Text:

PDF (Rus)

Generate QR code

Abstract

Nowadays most of software contains code duplication that leads to serious problems in software maintenance. A lot of different clone detection approaches have been proposed over the years to deal with this problem, but almost all of them do not consider semantic properties of the source code. We propose to reinforce traditional tree-based clone detection algorithms by using additional information about variable slices. This allows to find intertwined/gapped clones on variables; preliminary evaluation confirms applicability of our approach to real-world software.

Keywords

Clone Detection, Tree Slicing, Tree Patterns, Software Maintenance

About the Authors

Marat Akhin

Санкт-Петербургский государственный политехнический университет
Russian Federation
аспирант

Vladimir Itsykson

Санкт-Петербургский государственный политехнический университет
Russian Federation

доцент

References

1. Deshpande A., Riehle D. The Total Growth of Open Source // Conference on Open Source Systems. Springer Verlag, 2008.

2. Roy C. K., Cordy J. R. A Survey on Software Clone Detection Research. TR 2007-541, School of Computing, Queen’s University, 2007.

3. Ахин М.Х., Ицыксон В.М. Обнаружение клонов исходного кода: теория и практика // Системное программирование. 2010. 5. С. 145–163.

4. Li Z., Lu S., Myagmar S., Zhou Y. CP-Miner: a Tool for Finding Copy-Paste and Related Bugs in Operating System Code // Conference on Symposium on Opearting Systems Design & Implementation, USENIX Association, Berkeley, 2004. P. 20–20.

5. Kontogiannis K. Evaluation Experiments on the Detection of Programming Patterns Using Software Metrics // Working Conference on Reverse Engineering. IEEE Computer Society. Washington, DC, 1997. P. 44–44.

6. Koschke R., Falke R., Frenzel P. Clone Detection Using Abstract Syntax Suffix Trees // Working Conference on Reverse Engineering. IEEE Computer Society. Washington, DC, 2006. P. 253–262.

7. Hummel B., Juergens E., Heinemann L., Conradt M. Index-based Code Clone Detection: Incremental, Distributed, Scalable // International Conference on Software Maintenance. IEEE Computer Society. Washington, DC, 2010. P. 1–9.

8. Gabel M., Jiang L., Su S. Scalable Detection of Semantic Clones // International Conference on Software Engineering. IEEE Computer Society, Washington, DC, 2008. P. 321–330.

9. Krinke J. Identifying Similar Code with Program Dependence Graphs // Working Conference on Reverse Engineering. IEEE Computer Society. Washington, DC, 2001.

10. Jiang L., Misherghi G., Su S., Glondu S. DECKARD: Scalable and Accurate Treebased Detection of Code Clones // International Conference on Software Engineering. IEEE Computer Society. Washington, DC, 2007. P. 96–105.

11. Andoni A., Indyk P. Near-optimal Hashing Algorithms for Approximate Nearest Neighbour in High Dimensions. Commun. // ACM. 2008. 51, 1. P. 117–122.

12. Evans W., Fraser C., Ma F. Clone Detection via Structural Abstraction // Software Quality Journal. 2009. 17, 4. P. 309–330.

13. Bulychev P., Minea M. Duplicate Code Detection Using Anti-unification // Spring Young Researchers Colloquium on Software Engineering, 2008. P. 51–54.

14. Kamiya T., Kusumoto S., Inoue K. CCFinder: Multilinguistic Token-based Code Clone Detection System for Large Scale Source Code // IEEE Trans. Softw. Eng. 2002. 28, 7. P. 654–670.

Review

For citations:

Akhin M., Itsykson V. Tree Slicing in Clone Detection: Syntactic Analysis Made (Semi)-Semantic. Modeling and Analysis of Information Systems. 2012;19(6):69-78. (In Russ.) https://doi.org/10.18255/1818-1015-2012-6-69-78

JATS XML

This work is licensed under a Creative Commons Attribution 4.0 License.

ISSN 1818-1015 (Print)
ISSN 2313-5417 (Online)

Username
Password
	Remember me
Not a user? Register with this site Forgot your password?

User

Modeling and Analysis of Information Systems

Tree Slicing in Clone Detection: Syntactic Analysis Made (Semi)-Semantic

Full Text:

Abstract

Keywords

About the Authors

References

Review

For citations:

Cookies policy