CCGraMi: An Effective Method for Mining Frequent Subgraphs in a Single Large Graph

Zobrazit/ otevřít
Datum
2021-12-21Altmetrics
10.13164/mendel.2021.2.090
Metadata
Zobrazit celý záznamAbstrakt
In modern applications, large graphs are usually applied in the simulation and analysis of large complex systems such as social networks, computer networks, maps, traffic networks. Therefore, graph mining is also an interesting subject attracting many researchers. Among them, frequent subgraph mining in a single large graph is one of the most important branches of graph mining, it is defined as finding all subgraphs whose occurrences in a dataset are greater than or equal to a given frequency threshold. In which, the GraMi algorithm is considered the state of the art approach and many algorithms have been proposed to improve this algorithm. In 2020, the SoGraMi algorithm was proposed to optimize the GraMi algorithm and presented an outstanding performance in terms of runtime and storage space. In this paper, we propose a new algorithm to improve SoGraMi based on connected components, called CCGraMi (Connected Components GraMi). Our experiments on four real datasets (both directed and undirected) show that the proposed algorithm outperforms SoGraMi in terms of running time as well as memory requirements.
Klíčová slova
Data mining, Pruning techniques, Single large graph, Subgraph mining, Weighted subgraphTrvalý odkaz
http://hdl.handle.net/11012/203385Typ dokumentu
Recenzovaný dokumentVerze dokumentu
Finální verze PDFZdrojový dokument
Mendel. 2021 vol. 27, č. 2, s. 90-99. ISSN 1803-3814https://mendel-journal.org/index.php/mendel/article/view/162
Kolekce
- Vol. 27, No. 2 [13]