Local search for constrained graph clustering in biological networks
•A constrained graph clustering problem with applications in cancer research.•Exact method is capable of finding optimal solutions while satisfying all constraints.•Iterated local search significantly outperforms in terms of computational runtime.•Multilevel approach is proposed for clustering large...
Saved in:
| Published in | Computers & operations research Vol. 132; p. 105299 |
|---|---|
| Main Authors | , , , , |
| Format | Journal Article |
| Language | English |
| Published |
New York
Elsevier Ltd
01.08.2021
Pergamon Press Inc |
| Subjects | |
| Online Access | Get full text |
| ISSN | 0305-0548 0305-0548 |
| DOI | 10.1016/j.cor.2021.105299 |
Cover
| Summary: | •A constrained graph clustering problem with applications in cancer research.•Exact method is capable of finding optimal solutions while satisfying all constraints.•Iterated local search significantly outperforms in terms of computational runtime.•Multilevel approach is proposed for clustering large graphs.
Semi-supervised or constrained graph clustering incorporates prior information in order to improve clustering results. Pairwise constraints are often utilized to guide the clustering process. This work addresses a constrained graph clustering problem in biological networks where (1) subgraph connectivity constraints are strictly required to be satisfied and (2) clustering quality is assessed with respect to pairwise constraint violations. Existing constrained graph clustering methods often fail to fully satisfy connectivity constraints. This paper presents an iterated local search algorithm which aims to find the clustering with the highest quality in a short computing time. Experiments demonstrate how the proposed solutions are of good quality, often being optimal. Additionally, the proposed method significantly outperforms an existing branch-and-cut algorithm in terms of computational runtime and produces competitive results with regard to other local search techniques and graph clustering algorithms. Furthermore, a multilevel algorithm for clustering is designed to handle large-scale graphs. The performance of the overall scheme for a variety of coarsening methods from the literature is studied on a large number of biological networks exceeding 10,000 genes. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 0305-0548 0305-0548 |
| DOI: | 10.1016/j.cor.2021.105299 |