We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.OC

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Optimization and Control

Title: On the power of linear programming for K-means clustering

Abstract: In [SIAM J. Optim., 2022], the authors introduced a new linear programming (LP) relaxation for K-means clustering. In this paper, we further investigate the theoretical properties of this relaxation. We focus on K-means clustering with two clusters, which is an NP-hard problem. As evident from our numerical experiments with both synthetic and real-world data sets, the proposed LP relaxation is almost always tight; i.e. its optimal solution is feasible for the original nonconvex problem. To better understand this unexpected behaviour, we obtain sufficient conditions under which the LP relaxation is tight. We further analyze the sufficient conditions when the input is generated according to a popular stochastic model and obtain recovery guarantees for the LP relaxation. Finally, we construct a family of inputs for which the LP relaxation is never tight.
Subjects: Optimization and Control (math.OC)
MSC classes: 90C05, 90C57, 62H30, 49Q20, 68Q87
Cite as: arXiv:2402.01061 [math.OC]
  (or arXiv:2402.01061v1 [math.OC] for this version)

Submission history

From: Antonio De Rosa [view email]
[v1] Thu, 1 Feb 2024 23:19:48 GMT (101kb)

Link back to: arXiv, form interface, contact.