Scalable and accurate detection of code clones

Belevantsev A. A.; Kurmangaleev S. F.; Sargsyan S.; A. Avetisyan

doi:10.1134/s0361768816010072

Publications

?

Scalable and accurate detection of code clones

Programming and Computer Software. 2016. Vol. 42. No. 1. P. 27–33.

Belevantsev A. A., Kurmangaleev S. F., Sargsyan S., Avetisyan A.

A detailed description of a method for detection of code clones is described. This method is based on the semantic analysis of programs and on new algorithms that make it scalable without affecting its accuracy. The proposed method involves two phases. In the first phase, the program dependence graph (PDG) is constructed while the program is compiled. LLVM is used as the compilation infrastructure. In the second phase, similar subgraphs of maximum size that represent code clones are detected. Before starting the search for similar subgraphs, the PDG is divided into subgraphs that will be considered as potential clones of each other. To ensure scalability of the search for similar subgraphs, the composition of algorithms is used. The first algorithm checks that a pair of graphs cannot have similar subgraphs of the desired size; this is done in a linear amount of time. If this algorithm fails, another (approximate) algorithm is executed to find similar subgraphs of maximum size. After similar subgraphs have been found, the program code is additionally checked for the position of the code lines corresponding to the detected clone candidates. Tests showed that the developed tool is more accurate than similar tools, such as MOSS, CCFinder, and CloneDR. Results obtained for the projects Linux-2.6, Firefox Mozilla, LLVM/Clang, and OpenSSL are presented

Priority areas: engineering science

Keywords: graph semantic analysis LLVM

Determining the boundary of dynamical chaos in the generalized Chirikov map via machine learning

Чернышов Д. П., Satanin A., Shchur L., / Series arXiv "math". 2025.

We investigate the boundary separating regular and chaotic dynamics in the generalized Chirikov map, an extension of the standard map with phase-shifted secondary kicks. Lyapunov maps were computed across the parameter space (K,K(α, τ)) and used to train a convolutional neural network (ResNet18) for binary classification of dynamical regimes. The model reproduces the known critical ...

Added: November 21, 2025

Geometry of unimodular systems

Artamkin I., / Series arXiv "math". 2023.

A collection of vectors in a real vector space is called a unimodular system if any of its maximal linearly independent subsets generates the same free abelian group. This notion is closely connected with totally unimodular matrices: rows or columns of a totally unimodular matrix form a unimodular system and the matrix of coefficients of ...

Added: November 1, 2025

Об одном комбинаторном приложении теории ультрафильтров: новая конструкция графов без треугольников и с произвольно большим хроматическим числом

Polyakov N. L., Доклады Российской академии наук. Математика, информатика, процессы управления (ранее - Доклады Академии Наук. Математика) 2025 Т. 522 № 1 С. 40–49

The paper describes a new method for constructing graphs without triangles and with an arbitrarily large chromatic number. The properties of various types of ultrafilter extensions of functions and predicates are used to justify the method. ...

Added: June 3, 2025

Action of a Graph Automorphism on the Space of Flows

Spiridonov I., Mathematical notes 2019 Vol. 106 No. 1-2 P. 146 – 150

Added: April 27, 2025

Моделирование транспортно-логистических систем и исследование их структурной устойчивости.

Кочкаров А. А., Яцкин Д. В., Кочкаров Р. А., Управленческие науки 2020 Т. 10 № 1 С. 102–111

An important parameter of the transport and logistics task is the structural stability of the system to external influences. In modern literature, the concept of structural stability is defined in its own way for each individual task, as a result of which there are difficulties in applying the developed methods to new problems. The transport ...

Added: March 7, 2025

Проектирование транспортно-логистических систем, устойчивых к структурным разрушениям

Кочкаров А. А., Яцкин Д. В., Кочкаров Р. А., Теоретическая и прикладная экономика 2020 № 1 С. 1–9

This article is dedicated to designing of the transport and logistics systems with built-in resistance to structural failures. The sustainability indicators reflect the impact of the failure of one or several hubs (communication channels) upon working capacity of the already functioning system. In the process of designing the system, the sustainability indicators also provide opportunities ...

Added: March 7, 2025

Using Big Data for Foresight: Scientometric and Semantic Analysis for South Africa

Saritas O., Kotsemir M., , in: 21st Century Foresight: Shaping the Future for Sustainable Social, Economic and Environmental Development in South Africa.: Cham: Springer, 2024. P. 115–208.

The South Africa 2030 Foresight study, utilizing big data analytics, aimed at advancing the national innovation system and enhancing knowledge generation capacity. This chapter presents outcomes from bibliometric and semantic analyses conducted by ISSEK HSE. Bibliometric analysis explored South Africa’s research competences, scientific capacity and global collaborators, revealing mature and emerging research areas. Semantic analysis ...

Added: February 14, 2025

О последовательных факторах нижнего центрального ряда прямоугольных групп Кокстера

Veryovkin Y., Рахматуллаев Т. А., Математические заметки 2024 Т. 116 № 1 С. 10–33

We study the lower central series of a right-angled Coxeter group RCK and the corresponding associated graded Lie algebra L(RCK) and describe the basis of the fourth graded component of L(RCK) for any K. ...

Added: January 15, 2025

Doping dependence of low-energy charge collective excitations in high-Tc cuprates

Kagan M., Silkin V. M., Efremov D. V., / Series arXiv "math". 2024. No. 2411.12836.

In this study, we analyze the dielectric function of high-Tc cuprates as a function of doping level, taking into account the full energy band dispersion within the CuO2 monolayer. In addition to the conventional two-dimensional (2D) gapless plasmon mode, our findings reveal the existence of three anomalous branches within the plasmon spectrum. Two of these branches ...

Added: November 27, 2024

Influence of anisotropy on the study of critical behavior of spin models by machine learning methods

Sukhoverkhova D., Shchur L., / Series arXiv "math". 2024. No. 2410.14523.

In this paper, we applied a deep neural network to study the issue of knowledge transferability between statistical mechanics models. The following computer experiment was conducted. A convolutional neural network was trained to solve the problem of binary classification of snapshots of the Ising model's spin configuration on a two-dimensional lattice. During testing, snapshots of ...

Added: October 21, 2024

Cross-country analysis of science, technology and innovation policies: non-covid-19 related and Covid-19 specific STI policies in OECD countries

Russo M., Pavone P., Meissner D. et al., Quality and Quantity 2025 Vol. 59 No. Suppl 1 P. S343–S367

In OECD countries, Science, Technology and Innovation (STI) policies were seen as key aspects of coping with the Covid-19 pandemic. Now that the pandemic is over, identifying which policy mix portfolios characterised countries in terms of their non-Covid-19 related and Covid-19 specific STI policies fills a knowledge gap on changes in STI policies induced by ...

Added: September 27, 2024

Comparison of the microcanonical population annealing algorithm with the Wang-Landau algorithm

Mozolenko V., Fadeeva Marina, Shchur L., / Series arXiv "math". 2024. No. 2405.10865.

The development of new algorithms for simulations in physics is as important as the development of new analytical methods. In this paper we present a comparison of the recently developed microcanonical population annealing (MCPA) algorithm with the rather mature Wang-Landau algorithm. The comparison is performed on two cases of the Potts model exhibiting a first ...

Added: May 20, 2024

Majorana modes and Fano resonances in Aharonov- Bohm ring with topologically nontrivial superconducting bridge

Kagan M., Аксёнов С. В., / Series Research Square "Research Suqare". 2024. No. 1.

We study different resonances (first of all of the Fano type) in the interference device formed by the Aharonov-Bohm ring with superconducting (SC) wire in the topologically nontrivial state playing a role of a bridge between top and bottom arms. We analyze Majorana modes on the ends of the SC wire and show that the collapse of the ...

Added: April 10, 2024

High-frequency dielectric anomalies in a highly frustrated square kagome lattice nabokoite family compounds ACu7(TeO4)(SO4)5Cl (A=Na, K, Rb, Cs)

Ребров Я. В., Glazkov V., Murtazoev A. F. et al., / Series cond-mat "arxiv.org". 2023.

Nabokoite family compounds ACu7(TeO4)(SO4)5Cl (A=Na, K, Cs, Rb) are one of the candidates for the evasive spin-liquid state predicted for highly-frustrated square kagome lattice (SKL). Their magnetic subsystem includes SKL layers decorated by additional copper ions. All members of this family are characterized by quite high Curie-Weiss temperatures (∼80−200 K), but magnetic ordering was reported ...

Added: January 29, 2024

Spot the Bot: Distinguishing Human-Written and Bot-Generated Texts Using Clustering and Information Theory Techniques

Gromov V., Dang Q. N., , in: 10th International Conference, PReMI 2023, Kolkata, India, December 12–15, 2023, Proceedings. Pattern Recognition and Machine Intelligence. LNCS, volume 14301.: Cham: Springer, 2023. Ch. 3 P. 20–27.

Added: November 29, 2023

Nondestructive KPFM-assisted Quality Control in Fabrication of GaAs High-Speed Electronics

Shurakov A., Kaurova N., Belikov I. et al., / Series Physics "arxiv.org". 2022.

In this paper, we report on the method of nondestructive quality control that can be used in fabrication of GaAs high-speed electronics. The method relies on the surface potential mapping and enables rigid in vivo analysis of transport properties of an active electronic device incorporated into a complex integrated circuit. The study is inspired by ...

Added: September 15, 2023

Трнавац, Р. (1999). Концепт судьбЫ в русском и сербском языках

Trnavac R., Slavistika 1999 Vol. 3 P. 214–226

semantic analysis of the concept of fate in Russian and Serbian ...

Added: February 16, 2023

Концепт истины в русском и сербском языках.

Trnavac R., , in: Proceedings of the 5th International Congress: Sostojanie i perspektivy sopostavitel’nyh issledovanij russkogo i drugih jazykov.: [б.и.], 2000. P. 320–327.

Semantic analysis of the concept of "truth" ...

Added: February 16, 2023

Елементи концепта наде као хришћанске врлине.

Trnavac R., Српски језик 2001 No. 6 P. 469–478

Semantic elements of the Christian concept of "hope". ...

Added: February 16, 2023

Positive Appraisal in online news comments

Trnavac R., Taboada M., , in: Studies in Ethnopragmatics, Cultural Semantics, and Intercultural Communication: Ethnopragmatics and Semantic Analysis.: Singapore: Springer, 2020. P. 185–205.

This chapter investigates the linguistic expression of positive evaluation in English and describes a preliminary typology of linguistic devices used for positive evaluation. Using corpus-assisted analysis, we classify some of the resources that play a role in the expression of positive evaluation into phenomena in the lexicogrammar and phenomena that belong in discourse semantics and ...

Added: January 1, 2023

Studies in Ethnopragmatics, Cultural Semantics, and Intercultural Communication: Ethnopragmatics and Semantic Analysis

Singapore: Springer, 2020.

This book is the first in a three-volume set that celebrates the career and achievements of Cliff Goddard, a pioneer of the NSM approach in linguistics. It explores issues in ethnopragmatics and conversational humour, with a further focus on semantic analysis more broadly. ...

Added: January 1, 2023

О улози природног семантичког метајезика у дефинисању значења културолошких концепата мерак (срп.),кайф (рус.) (задовољство) и инат (срп.)

Радослава М. Трнавац, , in: Лексикографија и лексикологија у светлу актуелних проблема: Зборник научних радова.: Beograd: Институт за српски језик САНУ, 2021. P. 845–861.

THE ROLE OF THE NATURAL SEMANTIC METALANGUAGE IN DEFINING THE MEANING OF THE CULTURAL CONCEPTS MERAK, KAIF, AND INAT Summary In this paper, we describe the approach of the Natural Semantic Metalanguage towards the representation of semantic information in lexicography. We focus on the properties of the Natural Semantic Metalanguage that can improve the presentation ...

Added: December 28, 2022

Superconducting spin valves based on a single spiral magnetic layer

Pugach N., Safonchik M. O., Belotelov V. et al., / Series "cond-mat". 2022. No. 2110.00369.

A detailed investigation of a superconducting spin-triplet valve is presented. This spin-valve consists of a superconducting film covering a metal with an intrinsic spiral magnetic order, which could result from competing isotropic exchanges or, if the crystal lattice breaks central symmetry, from asymmetric Dzyaloshinskii-Moriya exchange. Depending on the anisotropy, such a metal may change its ...

Added: November 16, 2022

Charge transport in the spatially correlated exponential random energy landscape: effect of the non-positive correlation function

Novikov S. V., / Series cond-mat "arxiv.org". 2022. No. 2209.14955.

Charge transport in amorphous semiconductors having spatially correlated exponential density of states (DOS) has been considered for the arbitrary behavior of the correlation function of random energies. Average carrier velocity is exactly calculated for the quasi-equilibrium (nondispersive) transport regime. For the symmetric exponential DOS with exponential tails for low and high energies and non-positive correlation ...

Added: November 1, 2022