On 2-Clubs in Graph-Based Data Clustering: Theory and Algorithm Engineering

Authors

  • Aleksander Figiel
  • Anne-Sophie Himmel
  • André Nichterlein
  • Rolf Niedermeier

DOI:

https://doi.org/10.7155/jgaa.00570

Keywords:

graph modification problems , parameterized complexity , fixed-parameter tractability , problem kernel , data reduction , branch and bound , algorithm engineering

Abstract

Editing a graph into a disjoint union of clusters is a standard optimization task in graph-based data clustering. Here, complementing classical work where the clusters shall be cliques, we focus on clusters that shall be 2-clubs, that is, subgraphs of diameter at most two. This naturally leads to the two NP-hard problems $\rm{2-C \small{LUB}}$ $\rm{C{\small LUSTER}}$ $\rm{E{\small DITING}}$ (the editing operations are edge insertions and edge deletions) and $\rm{2-C \small{LUB}}$ $\rm{C{\small LUSTER}}$ $\rm{V{\small ERTEX}}$ $\rm{D{\small ELETION}}$ (the editing operations are vertex deletions). Answering an open question from the literature, we show that $\rm{2-C \small{LUB}}$ $\rm{C{\small LUSTER}}$ $\rm{E{\small DITING}}$ is W[2]-hard with respect to the number of edge modifications, thus contrasting the fixed-parameter tractability result for the classical $\rm{C{\small LUSTER}}$ $\rm{E{\small DITING}}$ problem (considering cliques instead of 2-clubs). Then, focusing on $\rm{2-C \small{LUB}}$ $\rm{C{\small LUSTER}}$ $\rm{V{\small ERTEX}}$ $\rm{D{\small ELETION}}$, which is easily seen to be fixed-parameter tractable, we show that under standard complexity-theoretic assumptions it does not have a polynomial-size problem kernel when parameterized by the number of vertex deletions. Nevertheless, we develop several effective data reduction and pruning rules, resulting in a competitive solver, outperforming a standard ILP formulation in most instances of an established biological test data set.

Downloads

Download data is not yet available.

Downloads

Published

2021-01-01

How to Cite

Figiel, A., Himmel, A.-S., Nichterlein, A., & Niedermeier, R. (2021). On 2-Clubs in Graph-Based Data Clustering: Theory and Algorithm Engineering. Journal of Graph Algorithms and Applications, 25(1), 521–547. https://doi.org/10.7155/jgaa.00570

Issue

Section

Articles

Categories