Efficient and Near-optimal Algorithms for Sampling Small Connected Subgraphs
-
Published:2023-06-24
Issue:3
Volume:19
Page:1-40
-
ISSN:1549-6325
-
Container-title:ACM Transactions on Algorithms
-
language:en
-
Short-container-title:ACM Trans. Algorithms
Affiliation:
1. Università degli Studi di Milano, Italy
Abstract
We study the following problem: Given an integer
k
≥ 3 and a simple graph
G
, sample a connected induced
k
-vertex subgraph of
G
uniformly at random. This is a fundamental graph mining primitive with applications in social network analysis, bioinformatics, and more. Surprisingly, no efficient algorithm is known for uniform sampling; the only somewhat efficient algorithms available yield samples that are only approximately uniform, with running times that are unclear or suboptimal. In this work, we provide: (i) a near-optimal mixing time bound for a well-known random walk technique, (ii) the first efficient algorithm for truly uniform graphlet sampling, and (iii) the first sublinear-time algorithm for ε-uniform graphlet sampling.
Funder
Algorithms and Learning for AI
Bertinoro International Center for Informatics
European Research Council under the Starting Grant
Department of Computer Science of the Sapienza University of Rome
Publisher
Association for Computing Machinery (ACM)
Subject
Mathematics (miscellaneous)
Reference41 articles.
1. Mixing time bounds for graphlet random walks
2. David Aldous and James Fill. 1995. Reversible Markov Chains and Random Walks on Graphs. Retrieved from https://www.stat.berkeley.edu/aldous/RWG/book.pdf.
3. Biomolecular network motif counting and discovery by color coding
4. Color-coding