Efficient frequent subgraph mining on large streaming graphs

Citation DataIntelligent Data Analysis, ISSN: 1571-4128, Vol: 23, Issue: 1, Page: 103-132

Publication Year2019

13
Citations
0
Usage
15
Captures
0
Mentions
0
Social Media

Metric Options: Counts1 Year3 Year

Metrics Details

Citations
13
- Citation Indexes
  13
Captures
15
- Readers
  15

Article Description

We propose an efficient, approximate algorithm to solve the problem of finding frequent subgraphs in large streaming graphs. The graph stream is treated as batches of labeled nodes and edges. Our proposed algorithm finds the set of frequent subgraphs as the graph evolves after each batch. The computational complexity is bounded to linear limits by looking only at the changes made by the most recent batch, and the historical set of frequent subgraphs. As a part of our approach, we also propose a novel sampling algorithm that samples regions of the graph that have been changed by the most recent update to the graph. The performance of the proposed approach is evaluated using five large graph datasets, and our approach is shown to be faster than the state of the art large graph miners while maintaining their accuracy. We also compare our sampling algorithm against a well known sampling algorithm for network motif mining, and show that our sampling algorithm is faster, and capable of discovering more types of patterns. We provide theoretical guarantees of our algorithm's accuracy using the well known Chernoff bounds, as well as an analysis of the computational complexity of our approach.

Bibliographic Details

DOI10.3233/ida-173705

URL IDhttp://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85062241189&origin=inward; http://dx.doi.org/10.3233/ida-173705; https://journals.sagepub.com/doi/full/10.3233/IDA-173705; https://dx.doi.org/10.3233/ida-173705; https://content.iospress.com:443/articles/intelligent-data-analysis/ida173705

AUTHOR(S)

Abhik Ray; Lawrence B. Holder; Albert Bifet

PUBLISHER(S)

SAGE Publications

TAG(S)

Mathematics; Computer Science

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know