An improved data stream algorithm for clustering (Q904105): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
ReferenceBot (talk | contribs)
Changed an Item
 
(2 intermediate revisions by 2 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1016/j.comgeo.2015.06.003 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2215609590 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5417724 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Data streams. Models and algorithms. / rank
 
Normal rank
Property / cites work
 
Property / cites work: COMPUTING <i>k</i> CENTERS OVER STREAMING DATA FOR SMALL <i>k</i> / rank
 
Normal rank
Property / cites work
 
Property / cites work: Streaming and dynamic algorithms for minimum enclosing balls in high dimensions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Incremental Clustering and Dynamic Information Retrieval / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4198056 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Data mining. Concepts and techniques / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive sampling for geometric problems over data streams / rank
 
Normal rank
Property / cites work
 
Property / cites work: Streaming Algorithms for k-Center Clustering with Outliers and with Anonymity / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Complexity of Some Common Geometric Location Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Streaming with minimum space: an algorithm for covering by two congruent balls / rank
 
Normal rank

Latest revision as of 08:56, 11 July 2024

scientific article
Language Label Description Also known as
English
An improved data stream algorithm for clustering
scientific article

    Statements

    An improved data stream algorithm for clustering (English)
    0 references
    0 references
    0 references
    15 January 2016
    0 references
    The paper deals with the problem of the \(k\)-center problem, i.e., given a set of points \(P\) in a metric space and an integer \(k\), we want to find \(k\) points \(c_1, c_2, \dots, c_k\) such that \(\max_{p \in P} \min_{1\leq i \leq k} |p c_i|\) is minimized, where \(|pq|\) denotes the distance between \(p\) and \(q\). The authors make some additional assumptions, namely they take the \(d\)-dimensional Euclidean metric space \(\mathbb R^d\). Moreover, they assume that the points arrive one by one in a stream and we are allowed to examine each point only once, and only a small amount of information can be stored in a device. The authors start with giving a single-pass data stream algorithm that for any \(d \geq 1\) uses the \(O(d / \varepsilon)\) space and returns two centers together with their radii that guarantee a \((1.8 + \varepsilon)\)-approximation for the Euclidean \(2\)-center problem. Each update in the algorithm takes \(O(d / \varepsilon)\) time. Later, they extend the algorithm to the Euclidean \(k\)-center problem. They design an approximation algorithm that for some constant \(k > 2\) and for any dimension \(d \geq 1\) uses \(O(2^k (k+3)! d / \varepsilon )\) space and returns \(k\) centers that guarantee a \((1.8 + \varepsilon)\)-approximation. Each update in the algorithm takes \(O(2^k (k+2)! d / \varepsilon)\) time.
    0 references
    Euclidean \(k\)-clustering
    0 references
    Euclidean \(k\)-center problem
    0 references
    streaming algorithms
    0 references
    computational geometry
    0 references

    Identifiers