Loading jsMath...
Weizmann Logo
ECCC
Electronic Colloquium on Computational Complexity

Under the auspices of the Computational Complexity Foundation (CCF)

Login | Register | Classic Style



REPORTS > DETAIL:

Revision(s):

Revision #2 to TR13-031 | 29th April 2013 16:49

Clustering in the Boolean Hypercube in a List Decoding Regime

RSS-Feed




Revision #2
Authors: Irit Dinur, Elazar Goldenberg
Accepted on: 29th April 2013 16:49
Downloads: 3647
Keywords: 


Abstract:

We consider the following clustering with outliers problem: Given a set of points X \subset \{-1,1\}^n, such that there is some point z \in \{-1,1\}^n for which at least \delta of the points are \epsilon-correlated with z, find z. We call such a point z a (\delta,\epsilon)-center of X.

In this work we give lower and upper bounds for the task of finding a (\delta,\epsilon)-center. Our main upper bound shows that for values of \epsilon and \delta that are larger than 1/polylog(n), there exists a polynomial time algorithm that finds a (\delta - o(1),\epsilon - o(1))-center. Moreover, it outputs a list of centers explaining all of the clusters in the input. Our main lower bound shows that given a set for which there exists a (\delta,\epsilon)-center, it is hard to find even a (\delta/n^c, \epsilon)-center for some constant c and \epsilon=1/poly(n), \delta=1/poly(n).


Revision #1 to TR13-031 | 2nd March 2013 13:00

Clustering in the Boolean Hypercube in a List Decoding Regime





Revision #1
Authors: Irit Dinur, Elazar Goldenberg
Accepted on: 2nd March 2013 13:00
Downloads: 2928
Keywords: 


Abstract:

We consider the following clustering with outliers problem: Given a set of points X \subset \{-1,1\}^n, such that there is some point z \in \{-1,1\}^n for which at least \delta of the points are \epsilon-correlated with z, find z. We call such a point z a (\delta,\epsilon)-center of X.

In this work we give lower and upper bounds for the task of finding a (\delta,\epsilon)-center. We first show that for
\delta=1-\nu close to 1, i.e. in the "unique decoding
regime", given a (1-\nu,\epsilon)-centered set our algorithm can
find a (1-(1+o(1))\nu,(1-o(1))\epsilon)-center. More
interestingly, we study the "list decoding regime", i.e. when
\delta is close to 0. Our main upper bound shows that for values of \epsilon and \delta that are larger than 1/polylog(n), there exists a polynomial time algorithm that finds a (\delta - o(1),\epsilon - o(1))-center. Moreover, it outputs a list of centers explaining all of the clusters in the input. Our main lower bound shows that given a set for which there exists a (\delta,\epsilon)-center, it is hard to find even a (\delta/n^c, \epsilon)-center for some constant c and \epsilon=1/poly(n), \delta=1/poly(n).


Paper:

TR13-031 | 22nd February 2013 16:00

Clustering in the Boolean Hypercube in a List Decoding Regime





TR13-031
Authors: Irit Dinur, Elazar Goldenberg
Publication: 22nd February 2013 16:01
Downloads: 3580
Keywords: 


Abstract:

We consider the following clustering with outliers problem: Given a set of points X \subset \{-1,1\}^n, such that there is some point z \in \{-1,1\}^n for which at least \delta of the points are \epsilon-correlated with z, find z. We call such a point z a (\delta,\epsilon)-center of X.

In this work we give lower and upper bounds for the task of finding a (\delta,\epsilon)-center. Our main upper bound shows that for values of \epsilon and \delta that are larger than 1/polylog(n), there exists a polynomial time algorithm that finds a (\delta - o(1),\epsilon - o(1))-center. Moreover, it outputs a list of centers explaining all of the clusters in the input. Our main lower bound shows that given a set for which there exists a (\delta,\epsilon)-center, it is hard to find even a (\delta/n^c, \epsilon)-center for some constant c and \epsilon=1/poly(n), \delta=1/poly(n).



ISSN 1433-8092 | Imprint